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b 411 

•C>> Invalid Option Number 

*** DIALOG HQMEBASE <SM) Main Menu *** 

Inf ormat i on : 

1. Announcements (new files, free connect time, price changes, etc.) 
£„' Database, Rates, & Command Descriptions 

3. Help in Choosing Databases for Your Topic 

4. Customer Services (telephone assistance, training, seminars, etc.) 

5. Product Descriptions 

Connect ions s 

6. DIALOG Menus (SM) 

7. DIALOG Business Connect i on ( R) , Headl ines (SM) , Medical Connect i on (SM) 
8- DIALOG SourceOne (SM) Document Delivery 

9. Data-Star 

10. Other Online Menu Services & Files (Money Cent er ( R) , OAG, TNT, etc.) 

/H = Help /L = Logoff /NOMENU = Command Mode 

Enter an option number to view information or to connect to an online 
service. Enter a BEGIN command plus a file number to search a database 
<e. g. , Bl for ERIC) . 
?Bb411 

>> Invalid Option Number 

***. DIALOG HOMEBASE (SM) Main Menu #** 

Inf ormat i on : 

1. Announcements (new files, free connect time, price changes, etc.) 

2. Database, Rates, & Command Descriptions 

3. Help in Choosing Databases for Your Topic 

4. Customer Services (telephone assistance, training, seminars, etc.) 

5. Product Descriptions 

Connect ions: 

6. DIALOG Menus (SM) 

7. DIALOG Business Connect i on ( R) , Head 1 ines (SM) , Medical Connect ion (SM) 

8. DIALOG SourceOne (SM) Document Delivery 

9. Data-Star 

id. Other Online Menu Services & Files (MoneyCent er ( R) , OAG, TNT, etc.) 

/H = Help /L = Logoff /NOMENU = Command Mode 

Enter an option number to view information or to connect to an online 
service. Enter a BEGIN command plus a file number to search a database 
(e. g. , Bl for ERIC) . 
?b 411 
SYSTEM s HOME 

21sep94 07:07:43 UserE*143£3 Session DE!£6. £ 
$0.03 0.002 Hrs FileHomeBase 
$0.03 Estimated cost FileHomeBase 
$0. 02 TYMNET 



$w. Wb Estimated cost this search 

$0,31 Estimated total session cost 0.013 Hrs. 
File 411 s DIAL INDEX < R) 



DIALINDEX <R) 

(c) 1994 Dialog Info-Svcs. 

*•** DIALINDEX search results display in an abbreviated *** 
*** format unless you enter the SET DETAIL ON command. *** 
?sf all 

You have 335 files in your file list. 
(To see banners, use SHOW FILES command) 
?s autocrine (w) mmbi 1 it y (w) factor?{or autotaxin or ATX-t 
Your SELECT statement is: 

s autocrine (w) moti lity (w) factor?, or autotaxin or ATX. < 
Items File 

< 5 5s BIDSIS PREv*IEWS(R>_J969-i994/OCT W£ 

1 6 s NTIS_1964-1994/0ct B£ 

•C-C-C-C Examined 50 files 

£ 73: EMBASE_1974-1994/ISS 36 

1 76: Life Sciences Co 1 1 ect i on_1978-1994/Aug 
-C £ 144: Pascal_„1973-1994/Aug 

3 155s MEDLINE(R)_I966-1994/Nov W£ 
Examined 108 files 

1 156: T0XLINE(R) m 1965»1994/Sep 
3 159: CANCERLIT<R)_1963-1994/Sep 

< Examined 150 files 

1 £65 s Fed. Res. in Progress_1994/Aug 
1 £66: Fed. Res. in Pr ogress_1994/ Aug 

< 1 345: INPADOC/Fam. & Legal St at . _1994/UD=9436 

1 351s DERWENT WPI„1981~1994/UD-9430;UA^94£6UM^9417 
1 357: Derwent Biotechnology Abs m 198£~i994/0ct Bl 
1 358: Current Biotech Abs_1983- 1994/Oct 
19 398s CHEMSEARCH(TM)_1957»1994/Aug 13341571RNS 
3 399s CA Search <R)_1967~1994/UD==1£1 10 
Examined £00 files 

< 4 434: Scisearch ( R) __1 974-* 1994/ Aug W4 

•i 3 440s Current Contents Search <R) _1990-1994/Aug W£ 

Examined £50 files 

1 669s Fed. Reg i st er_1988-1994/Sep £0 
Examined 300 files 



19 files have one or more items; file list includes 335 files. 



>>>Unrecognizable command. 
?rf 

Your last SELECT statement was: 

S AUTOCRINE ( W) MOTILITY <UI) FACTOR?. OR AUTOTAXIN OR ATX. -C 



Ref 


Items 


File 




Nl 


19 


398: 


CHEMSEARCH (TM) _1957-1994/Aug 13341571RNS 


N£ 


5 


5s 


BIOSIS PREVIEWS(R)_1969-1994/0CT W£ 


N3 


4 


434: 


Sci search <R)_i974-1994/Aug W4 


N4 


3 


155: 


MEDLINE (R) .1966-1994/Nov W£ 


N5 


3 


159 : 


CANCERLIT (R) __1 963- 1 994/Se p 


N6 


%3 


399 s 


CA Search (R)_1967-1994/UD=1£110 


N7 


3 


440: 


Current Contents Search (R) 1990-1994/Aug W£ 


N8 


c! 


73: 


EMBASE_1974-1994/ISS 36 


N9 


ci 


144s 


Pascal _1 973- 1994/ Aug 


NIC 


1 


6s 


NTIS_1964-1994/0ct B£ 


19 


files have 


one or 


more items? file list includes 335 files. 



- Enter P or PAGE for more - 



?save temp 

Temp SearchSave M TD09;E: ,, stored 
?b Nl«fflISJ«i^£©8 9 35i 

>>> Ranked file range must be entered as "Nrr - Nrr" (e.g., N1-N10). 

>>>".l" is not a valid category or service name 

>>>' n s M is invalid in a filelist., 

>>> Invalid BEGIN command 

?< 

?b NisN10}<£51,3<:5«: 

>>>"3. <5 n is not a valid category or service name 
>>>"<" is invalid in a filelist. 
>>> Invalid BEGIN command 
?b Nl sN10, £BS*$5B8 

> > > " . 3. . 351 1 11 is not a valid category or service name 
21sep94 07:14:31 Us ere! 1 43e!3 Session Dc!£b» 3 
*5. ££ 0.116 Hrs FileAll 
$5. £'£ Estimated cost File411 
$i.3S TYMNET 

$6.54 Estimated cost this search 

$6.85 Estimated total session cost 0. 1£9 Hrs. 



SYSTEM:OS - DIALOG OneSearch 

File 398 : CHEMSEARCH (TM) 1957~1994/Aug 13341571RNS 
(c) 1994 Amer. Chem. Soc. 
*File 398: Use is subject to the terms of your user/customer agreement. 
File 5:BI0SIS PREVIEWS(R) 1969-1994/OCT W£ 

(c) 1994 BIOSIS 
File 434 : Sci search ( R) 1974-1994/Aug W4 

<c) 1994 Inst for Sci Info 
File 155: MEDLINE (R) 1966-1994/No v W£ 

(c) format only 1994 Dialog Inf o. Svcs. 
File 159:CANCERLIT(R) 1963-1994/Sep 

(c) format only 1994 Dialog Inf o. Svcs. 
*File 159: See HELP NEWS 159 for NCI message. 
File 399:CA Search<R) 1967-1994/UD=12i 10 
<c> 1994 American Chemical Society 
*File 399: Use is subject to the terms of your user/customer agreement. 
File 440sCurrent Contents Search (R) 1990-1994/Aug W£ 
(c) 1994 Inst for Sci Info 
*File 440: Use Format 19 for contents records (LIMIT /CONT) 
Use Formats £ » 9 for individual article records (LIMIT /NCONT) 
File 73 : EMBASE 1974-1994/ ISS 36 

(c) 1994 Elsevier Science B. V. 
*File 73: Truncate EMTREE codes(e.g. DC=C1.1£0?) for complete retrieval. 
See HELP NEWS 73 for explode feature. 
File 144:Pascal 1973- 1994/Aug 

(c) 1994 INIST/CNRS 
File 6:NTIS 1964-1994/Oct B£ 

Comp. & distr. 1994 NTIS, US Dept of Commerce 
File 358:Current Biotech Abs 1983-1994/Oct 
(c> 1994 Royal Society of Chemistry 



Set Items Description 



?e-Cxs 



Ref Items RT Index-term 

El 0 *.-CXS 

E£ 11497£0 ( //BIOCHEMICAL PHENOMENA, METABOLISM, NUTRITI 

E3 0 1 ( ( ( ( D I ETHOX YPHOSPH I NOTH I OYL ) THIO) ACETYL) METHYL 

E4 0 1 ( ( ( (ISOPROPOXY)METHYLPHOSPHORYLOXY) IMINO) METHY 

E5 0 1 ((((£,6 DIETHYLPHENYL) CARBAMOYL) METHYL) IMINO) D 

E6 0 1 ((((£,6 DIISOPROPYLPHENYL) CARBAMOYL) METHYL) I MI 

E7 0 1 ((((3 BROMO £,4,6 TRIMETHYLPHENYL) CARBAMOYL) ME 

E8 0 1 ((((3 BROMOMESITYL) CARBAMOYL) METHYL) IMINO) DI AC 

E9 0 1 ((((4 BUTYLPHENYL) CARBAMOYL) METHYL) IMINO) DIACE 



Eld 
EH 
El£ 



tZi 1 ( < <CARBOXYMETHYL> IMINO) BIS (ETHYLENENITRILO) ) TE 
13 1 ( < ( IODOACETYL) AMINO) ETHYL) 5 NAPHTHYLAM INE 1 S 
13 1 ( ( (PARA CUMENYLCARBAMOYL) METHYL) IMINQ) DI ACETIC 



Enter P 


or PAGE for more 


?exs 




Executing TD09£ 




£4783 


AUTOCRINE 


143£45 


MOTILITY 


■3 


FACTOR?. 


0 


AUTOCRINE (W) MOTILITY (W) FACTOR?. 


46 


AUTOTAX IN 




ATX. -C 


SI 46 


AUTOCRINE (W) MOTILITY (W) FACTOR?. 


?■§ ATX 




S£ 613 


ATX 


?s sl or s£ 




46 


Sl 


613 


S£ 


S3 649 


Sl OR S£ 


?s antibofla 




S4 0 


ANT I BO. D? 


?-§sant ibod? 




{■eProcess ing 




Processed 10 of 


11 files ... 


Completed processing all files 


S5 1718826 


ANTIBOD? 


?s s5 and s3 




>>)Un recognizable 


Command 


?s s5 and s3 




1718B£6 


S5 


649 


S3 


S6 73 


S5 AND S3 


?rd 




>>>Duplicate detection is not supported for File 3 


>>>Records from unsupported files will be retained 



>>> Record 440:4585345 ignored; incomplete bibliographic data, not retained 
in RD set 

>>> Record 440:3418381 ignored; incomplete bibliographic data, not retained 
in RD set 

... examined 50 records (50) 
...completed examining records 

S7 47 RD (unique items) 

?t S7/7/1-47 

7/7/1 (Item 1 from files 5) 

DIALOG (R) Fi le SsBIOSIS PREYIEWS(R) 
(c) 1994 BIOSIS. Oil rts. reserv. 



7060647 BIOSIS Number: 87121168 

SPECIFICITY OF ANTIBODIES TO SEA ANEMONE TOXIN III AND IMMUNOGENICITY OF 
THE PHARMACOLOGICAL SITE OF ANEMONE AND SCORPION TOXINS 

BAHRAOUI E M; EL AYEB M; GRANIER C; BERESS L; ROCHAT H 

LABORATOIRE DE BIOCHIMIE, FACULTE DE MEDECINE SECTEUR NORD, BOULEVARD 



15, FRANCE. 

CODEN: EJBCA 
of Biochemistry 



PIERRE DRAMARD, F~133£6 MARSEILLE CEDEX 
EUR J BIOCHEM 180 (1). 1989. 55-60. 
Full Journal Title: European Journal 
Language : ENGLISH 

Toxin III (ATX III) of the sea anemone 
polypeptide containing 27 amino acid resi 

similarity with other toxins (ATX I and/fl) from the species, or with 
scorpion toxins, although they apparently act in a similar manner by 
prolonging action potentials. The specificity of ATX III antibodies was 
characterised using ATX III, ATX I, native and chemically modified ATX II 

result's obtained sunoest 



(Anemonia sulcata) is a 
ues. It has no sequence 
speci es, 



and scorpion . aloha. -toxins. The 



that a ren i on 



9 

of 



ATX III, partially or totally overlapping the pharmacological site shared 
with ATX I and ATX II, is immunogenic?^ It includes a guanidino and at least 
two carboxylate groups. The corresponding region is not immunogenic in ATX 
I and ATX II. Anti-<ATX III) arrxibodies recognize the similar regions of 
ATXI and ATX II and apparently do not recognize scorpion toxins. 



7/7/E' (Item £ from file: 5) 

DIALOG (R) Fi le 5:BI0SIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

6478959 BIOSIS Number: 85079480 

CELLULAR PATHWAYS FDR REJECTION OF CLASS-I-MHC-DISPARATE SKIN AND TUMOR 
ALLOGRAFTS 

SMITH D M ; STUART F P; WEMHOFF G A; QUINTANS J; FITCH F W 

UNIV. CHICAGO, COMMITTEE ON IMMUNOLOGY, DEP. PATHOLOGY, BOX 414, 5841, S. 
MARYLAND AVE. , CHICAGO, ILLINOIS 80637. 

TRANSPLANTATION (BALTIMORE) 45 (1). 1988. 168-175. CODEN: TRPLA 
Full Journal Title: TRANSPLANTATION (Baltimore) 
Language: ENGLISH 

We have investigated the relative roles of the Lyt-£+ and L3T4-+* T 
lymphocyte subsets in rejection of class-I-MHC-^ant i gen-d i sparat e skin and 
tumor allografts. To deplete T cells in vivoj rat anti-Ly-E* or anti-L3T4 
monoclonal antibodies (mAb) were administered' to adult-thymectomi zed (ATX) 
recipient mice prior to transplantation. / BALB/c (H-E'd) recipient mice 
injected the la- Sarcoma I (Sal) (H~-£a) tissue culture-derived tumor after 
depeltion of the L3T4+ T cell subset in vivo. In contrast, depletion of the 
Lyt-£+ T cell subset permitted 1 ethal >lfumor growth in all recipient mice. 
To determine the role of particular- T cell subsets in rejection of Ld 
class-I-MHC-ant i gen-d i sparat e allografts, BALB/c skin was transplanted to 
BALB/c-H-EdmE recipient mice. Skit* grafts were rejected by control mice 
with a mean survival time (MST) j6f 14.5 days. The MST of skin grafts for 
mice treated with anti-L3T4 mAb was 16.6 days. In contrast, administration 
of anti~Lyt-£ mAb alone (MST/^ > 47 days) or together with anti-L3T4 mAb 
(MST = > 50 days) caused prolonged or indefinite graft survival in all 
recipient mice. Depletion of'specific T cell subsets was confirmed by flow 
cytometric analysis and by analysis of T cell function in vitro. These 
results suggest that Lyt-£+ T lymphocytes are essential for rejection of 
class-I-MHC-disparate allografts^ indirect presentation of alloantigen to 
L3T4-+-T cells may not be necessary for rejection. 



7/7/3 (Item 3 from files 5) 

DIALOG (R) Fi le 5:BI0SIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

5418734 BIOSIS Number: 8£063537 

FUNCTIONAL CHARACTER 1 2 AT I ON OF PRETHYMIC T CELLS COMMITTED TO 
ALLOREACT I V I TY 

BLANK M; GRONAU C; SAUER M; WOTTGE H-U; MUELLER-RUCHHOLTZ W 
DEP. IMMUNOL. , UNIV. KIEL, BRUNSWIKERSTR. 2-6, D-S300KIEL, FRG. 
TRANSPLANTATION (BALTIMORE) 41 (6). 1986. 759-7#S. CODEN: TRPLA 
Full Journal Titles TRANSPLANTATION (Baltimorp') 
Language: ENGLISH / 

The results of previous experiments on MHC/ fully allogeneic bone marrow 
transplantation (BMT) in non-thy meet o m i z/ed recipients indicated that 
anti-MHC al 1 oreact i v i t y starts to becom^ irreversibly committed at the 
prethymic level. This is a matter of some/controvet^sy. Since it is possible 
that conflicting results depend on th^e methods chosen, we reexamined our 
previous results by applying two new approaches. Adult thy meet om i sed (ATX) 
Balb/c mice received a syngeneic festal thymus either 3 weeks before or 3 
weeks after lethal irradiation and r^const i t ut i on with C57BL/6 BM incubated 
in antiserum. Since monoclonal an^Dodies such as anti-Thy 1 are of limited 
value for investigations of the above type (Thy 1 antigen crosses the 
prethy m ic/thy m ic border), we used two highly selective, excessively 
cytotoxic xenoantisera for incubation of the donor BM-either a specific 



anti-T cell serum (SAT) that eliminated only mature T cells, or a specific 
ant i 1 y mphocy t e serum (SAL) that reacted additionally with a subset of 
prethymic T cells (PTC). In both experimental approaches the results were 
similars (1) Recipients of SAT-BM developed antihost reactivity, in 
contrast to recipients of SAL-BM. (2) SAT-BM recipients became 
i mmunodef ici ent , whereas SAL-BM chimeras were immunocompetent. (3) Late 
mortality was observed only following SAT treatment. (4) Preliminary 
morphological findings in the lymphoid tissues of BM recipients agree fully 
with the functional observations. We conclude that the data confirm our 
previous results in nont hy meet om i zed BM recipients-i. e. , PTCs initiate 
antihost reactivity in MHC fully allogeneic BMT-and PTC commitment is 
thymus/thymus factor independent. These conclusions are discussed with 
regard to the problems of MHC allogeneic clinical BMT. 



7/7/4 (Item 4 from files 5) 

DIALOG ( R) Fi 1 e SsBIOSIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

4437760 BIOSIS Numbers 78011583 

INJECTION OF MOUSE THYRO GLOBULIN AND-OR ADULT THYMECTOMY DO NOT BREAK 
TOLERANCE TO THYRO GLOBULIN DURING THE LUPUS-LIKE GRAFT VS. HOST DISEASE IN 
MICE 

gftN RAPPARD-VAN DER VEEN F M; KONG Y M; ROSE N R; KIMURA M; GLEICHMANN E 
PUBLICATION SECRETARIAT, CENTRAL LAB. NETHERLANDS RED CROSS BLOOD 
TRANSFUSION SERV. , P.O. BOX 9506, 1006 AK AMSTERDAM, NETHERLANDS. 
CLIN EXP IMMUNOL 55 (3). 1984. 525-534. CODENs CEXIA 
Full Journal Title: Clinical and Experimental Immunology 
Languages ENGLISH 

In contrast to lupus-like autoantibodies such as anti-DNA, autoantibodies 
to mouse t hyrog 1 obul i n (MTg) were not previously detectable in serum of Fl 
mice suffering from a lupus-like graft vs. host disease (GVHD) (GVH Fl). 
Possible explanations for this restricted autoantibody formation during the 
potent allogeneic stimulation were investigated. The main question was 
whether the natural level of circulating MTg was too low to induce the 
formation of anti-MTg antibodies in GVH Fl mice. Existence, in the Fl mice 
studied, of B cells capable of producing anti-MTg antibodies was 
demonstrated by injection of 1 ipo poly saccharide (LPS) and exogeneous MTg. 
MTg injected into various Fl mice at the onset of the GVH reaction (GVHR) 
failed to overcome the lack of antibody formation to MTg even though the 
GVHR led to a severe lupus-like disease. Adult thymectomy (ATx) of either 
the recipients, the donors or ^both also did not break tolerance to MTg 
during the GVHR, irrespect i ve/df administration of exogeneous MTg. Thus, 
neither i.v. injection of MTg ntr ATx, designed to remove T suppressor (Ts) 
cells, is adequate to en.aole an autoantibody response to MTg during 
lupus-like GVHD. Hence, t\y? non-specific T cell help that causes lupus-like 
GVHD seems to be intrinsically insufficient to trigger the Tg reactive B 
cells. Globular proteins, such as Tg, may require specific T cell help. In 
the presence of only r^On-specif ic T help, self-antigens such as DNA seem to 
be more apt than glptmlar proteins to provide an effective signal 1 to the 
corresponding autoreactive B cells. 



7/7/5 (Item 5 from files 5) 

DIALOG ( R) Fi 1 e 5:BI0SIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

4408376 BIOSIS Numbers 771383703 

NATURE OF HISTAMINE INDUCED SUPPRESSOR CELLS ON THE EFFECTOR PHASE OF 
DELAYED HYPER SENSITIVITY TO DI NITROFLUORO BENZENE IN MICE 

SATO K 

THIRD DEP. INTERNAL MED. , SCH. MED. , UNIV. TOKUSHIMA, TOKUSHIMA. 
SHIKOKU ACTA MED 39 (3). 1983 (RECD. 1984). 291-300. CODEN : SKI2A 
Full Journal Titles Shikoku Acta Medica 
Language : JAPANESE 

Thp nA*fcuv^p Anrl rolp of t;itonv^c;5nr rpl U on thp pff prtnv^ nhfl«;p nf Hp 1 aupH 



type hypersensitivity (DTH) to 2, 4~dinitro~l~f luorobenzene (DNFB) were 
investigated in mice using cell transfer experiment, BALB/c mice were 
sensitised by high dose (£-5%) DNFB, and spleen cell suspension was made 5 
days later. These sensitised donor spleen cells were incubated with 10~4M 
histamine in vitro for 30 min at 37. degree. C, and were transferred i.v. to 
the 1% DNFB, and spleen cell suspension was>4fiade 5 days later. These 
sensitised donor spleen cells were incubated withal 8 -4 M histamine in vitro 
for 3© min at 37. degree. C, and were transferred i.v. to the 1% DNFB 
sensitised recipient mice. DTH was measured by ear swelling 1 day after 
challenge of 1% DNFB. Suppression induced/oy the treatment with histamine 
was specific to the antigen DNFB. T-supm/essor cells (Ts-amp cells) in the 
spleen were sensitive to ant i-Thy-l/ant i body. These cells could also be 
passed through a nylon wool column. Suppression disappeared by pretreatment 
of donor mice with cyclophosphamide (CY) . Thus, histamine induced DTH 
Ts-amp cells in the spleen are se/rfsitive to CY. The spleen cells from donor 
mice thymectomised at adult /(RTX) before sens it i sat i on with DNFB did not 
express suppressive activity jqx\ DTH. The histamine-recept or positive Ts-amp 
cells were present in tb-e-'spleen cells of mice which were sensitized to a 
large dose of DNFB and induced suppressor T cells to regulate the effector 
phase of DTH in the presence of histamine. The role of the suppressor cells 
was discussed. 



7/7/6 (Item 6 from files 5) 

DIALOG ( R) Fi 1 e 5:BI0SIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

4084272 BIOSIS Numbers 76034123 

CHARACTER I Z AT I ON OF A CONCANA V AL I N A INDUCED AMPLIFIER T CELL WHICH 
AUGMENTS IN-VITRO ANTIBODY RESPONSES TO DI NITRO PHENYL FICOLL 

FINESILVER A G; BRALEY-MULLEN H 

DEP. MICROBIOL., UNIV. MO. , COLUMBIA, MO. 65^12. 
CELL IMMUNOL 75 (2). 1983. 199-213. COE^Ns CLIMB 
Full Journal Titles Cellular Immunology / 
Languages ENGLISH / 

The addition of the T-cell mitogen comcanavalin A (Con A) on Day 2 of a 
4~day in vitro culture of murine spleysn cells with the thymus-independent 
<TI> antigen DNPCdinitropheny 1339/Ficol 1 resulted in significant 
enhancement of the direct ant itr i/i tropheny 1 (TNP) plaque-forming cell 
(PFC) response. This enhancement /was mediated by a nylon wool™ and 
ant i IG-nonadherent amplifier T cell/ (Tft) . TA activity was not eliminated by 
in vitro treatment of T cells wi th/ant i-Thy 1.2 and complement. TA activity 
could be eliminated by pretreatment of mice with ant i 1 y mph ocy t e serum (ALS) 
in vivo, followed in vitro treatment of T cells with anti-Thy 1.2 + C. TA 
appear to bear a low surface density of Thy-1 antigen. These TA were 
relatively resistant to ALSrf used alone, to cyclophosphamide, and to low 
dose in vitro irradiation. rA were still present in the spleen 14 wk after 
adult thymectomy (ATx). They were I~J positive and apparently belonged to 
the Lyt 1+2- T-cell subset. 



7/7/7 (Item 7 from files 5) 

DIALOG (R) Fi 1 e SsBIOSIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

3986737 BIOSIS Number: 75034096 

CORTICO STEROID ACTION ON LYMPHOCYTE SUB POPULATIONS AND HUMORAL IMMUNE 
RESPONSE OF AXOLOTL URODELE AMPHIBIAN 

TOURNEFIER A 

LAB. D' IMMUNOL. COMPAREE, UNIV. PIERRE MARIE CURIE, 9 QUAI SAINT-BERNARD, 
75005 PARIS, FR. 

IMMUNOLOGY 46 <1). 1982. 155-162. CODENs IMMUA 
Full Journal Titles Immunology 
Languages ENGLISH 

The effect of in vivo hydrocort i sone (HC) treatment on thymocytes, 
splenic and blood lymphocytes and on allogeneic and humoral immune 



responses were investigated in the axoltl Curodele amphibianD. HC induces a 
profound lymphocytopenia in the thymus (83°/. HC sensitive) and the spleen 
(50% HC sensitive) but not in the blood. The density gradient analysis of 
HC-treated axolotls showed that thymic cell populations of light density 
were more sensitive than populations of high ^density. The timing of HC 
administration in relation to the ant i gen ic^cKal 1 enge is crucial for the 
humoral immune response- If HC injection i.s^given 8 days before or on the 
same day as injection of horse red^-*6lood cells (HRBC) , the antibody 
response is markedly enhanced. If .-HC injection occurs 8 days after 
injection of HRBC or on the day" of the maximum anti-HRBC response, the 
antibody response is unchanged^" The allograft immune response is not 
affected by HC treatment. .Tfne parallelism of the enhanced anti-HRBC 
response after HC treatment, with the same enhanced response obtained in 
adult thymectomized (AT^ X animals, as well as in ATx-HC-t r eat ed axolotl, 
may be explained by the presence of a cort icosensit i ve, T-like population 
of suppressor cells in axolotl. 



7/7/8 (Item 8 from file: 5) 

DIALOG (R) Fi le 5:BI0SIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

3661783 BIOSIS Numbers 73054150 

SERUM THYMIC FACTOR RESTORES THE ABILITY OF ADULT THYMECTOMI ZED MICE TO 
BE SUPPRESSED BY HAPTEN MODIFIED SELF 

GROUIX B; CHARREIRE J$ ERARD D$ GALANAUD P; BACH J-F 

INSERM U131, 32 RUE DES CARNETS, 9214® CLAMART, FR. 

CELL IMMUNOL 64 (1). 1981. 144-149. CODENs CLIMB 

Full Journal Titles Cellular Immunology 

Language: ENGLISH 

The injection of trinitropheny 1 (TNP) conjugated spleen cells (TNP-SC) 
into normal recipients induces a specific' suppression of the anti-TNP 
antibody response to the T-independent ant i gen TNP polyacry lamide. The 
T-cell dependency of this suppression can be shown by S arguments. Splenic 
T cells from TNP-SC-t reat ed mice may suppress the in vitro antibody 
response of normal mouse spleen/ cells. The anti-TNP response of adult 
thymect omized (ATX) mice is evidently not suppressed after injection of 
TNP-SC. Treatment of such ATX .mice with a synthetic analog of serum thymic 
factor (FTS) 7-9 wk afte*^ thymectomy restores their susceptibility to 
TNP-SC— induced suppress ion-. 
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POLY GENIC CONTROL OF THE IMMUNE RESPONSE TO F ANTIGEN 
SILVER D M; LANE D P 
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Full Journal Title: I mmunogenet ics 
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The ability to produce an autoimmune response to F antigen in mice is 
under H-2-1 inked and non-H-2—1 inked Ir-gene control. There is an absolute 
requirement for a k allele at H-2K or I-A to produce anti-F antibodies. Low 
and high responsiveness is controlled by a non-H-2-1 Lsrf< ed Ir gene which 
behaves in a similar fashion to Ir-3, in that as the^tfose of F-antigen is 
lowered, low responders behave as high responde/s and vice versa. This 
conversion from low to high responsiveness also occurs within a month after 
ATX Cadult thymectomy!. Most FI hybrids derii/ed from (responder .times, 
nonresponder ) parents bearing identical Retypes behave as dominant 
nonresponders. As a result of ATX, such FI mLe^e convert to high responders. 
This conversion occurs if the animals are/not immunised before day 9®. If 
thev receive F antigen prior to that ti^me, thev remain nonresponders for 



7-9 mo. One Fl combination, AKDc.', bahaves as a dominant high responded. 
Genetic analysis showed that in the presence of a k allele at H-~£K or I-A, 
a non-H-c'-l inked Ir gene inherited^from the AKR mice determined dominant 
responsiveness. No manipulation sff the immune response or combination of 
genes converted nonresponders lacking a k allele into responders. Such 
complex genetic control suggests regulation by a number of independently 
segregating loci whose funcitofr it is to limit the autoimmune response to F 
ant i gen. 
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THYMUS DEPENDENT SUPPRESSION OF HELPER FUNCTION IN ADULT XENOPUS-LAEV IS 
THE SOUTH AFRICAN CLAWED TOAD 

RUBEN L N; METTE S A; COCHRAN S K; EDWARDS B F 
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THYMUS £ (1). I960. 19-26- CODENs THYMD 

Full Journal Title: Thymus 

Language : ENGLISH 

Carrier-dependent and specific helper function capable of amplifying 
anti-hapten (TNP Ctr in itropheny 11 ) responses >can be demonstrated in X. 
laevis. This amplifying activity appears /to be short-lived, since 
separation of the carrier (RBC Cred blood cel^j) priming and hapt en-carr i er 
injections by only 1 wk eliminates the effect of carrier priming. Carrier 
enhancement is optimal when the injections are made £ to 4 days apart* 
Surgical removal of both thymuses (A**) of young adults S days after 
carrier priming allows helper function to persist for at least S wk« The 
magnitude of the anti-hapten response in ATx animals is substantially 
greater than in sham thy meet om i ze/f (STx) siblings, even when an optimal 
immunisation schedule was utilized. Thy mus-dependent suppression of an 
ant i-het ero 1 ogous erythrocyte response was demonstrated in vitro. Animals 
were immunized in vivo and their individual thymuses and spleen halves were 
subsequently cultured separately/ and together. The addition of thymus with 
its corresponding spleen substantially reduced the number of splenic 
antigen-binding cells and the/ titer of antibody released into the culture 
medium. This relatively /primitive amphibian apparently possesses 
thy mus-der i ved suppressor cells which regulate humoral immune responses. 
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ACCELERATION OF AGE RELATED CHANGES IN THE IMMUNITY AND ENDOCRINE ORGANS 
BY ADULT THYMECTOMY IMMUNOLOGICAL AND HISTO PATHOLOGICAL STUDIES 

HAYASHI Y; HIROKfiWfl K 
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Long-lived B6C3F1 female mice were /dult t hymect om i zed (aTX) at 6 wk of 
age, and the effect of aTX w^s observed from immunological and 
histopathological viewpoints up t/ £4 mo. of age. Anti-SRBC Csheep red 
blood cells] antibody respons/ in aTX group showed an apparently 
accelerated decline with age /as compared with age-matched control. 
Mitogenic responsiveness of/ spleen and lymph node cells to 
phytohemagglut inin (PHA) and co/canavalin (Con) A in the aTX group showed a 
slightly accelerated declin^r with age, although statistically it was not 
significant except for the responsiveness of spleen cells to PHA 
Cphyt ohemagglut ininU . Thymic function to induce helper T Cthymus-der i vedH 
cells was maintained throughout life* althouah it declined with aae, but 



its function to induce PHA- and Con fl-responsive T cells disappeared in an 
early stage of life. Hi st opat ho 1 og ica 1 1 y , age related changes of the 
endocrine organs were apparently accelerated in the aTX group, suggesting 
that the normal function of endocrine organs appeared to have correlation 
with normal function of the thymus throughout life. The aTX increased the 
incidence of spontaneously occurring reticulum cell sarcoma in aged mice, 
supporting the concept of immunological surveillance. 
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THE SUPPRESSION OF THE DEVELOPMENT OF EXPERIMENTAL THYROIDITIS AND 
RELATED CELLULAR IMMUNITY IN THE GUINEA-PIG FOLLOWING ADULT THYMECTOMY 

HOJO K; HI RAMINE C 
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The role of the thymus in the pathogenesis of experimental allergic 
thyroiditis (EAT) in the guinea pig was studied after the lapse of a 
sufficiently long period following adult thymectomy. Female Hartley guinea 
pigs thymect omized or sham-operated at 10-12 wk of age were immunized by a 
single injection of homologous thyroid extract in complete Freund' s 
adjuvant (CFA) 7 or more mo. after operation. The animals were sacrificed 
at 4 or 5 wk after immunization- Adult thymect omi zed aatl subsequently 
sensitized <ATx~sensi t i zed) animals showed a markedly depressed ability to 
develop thyroiditis, 18 of the 21 animals thyroid lp^ions which could be 
graded as slight or lower than sham-operated j^ontrols. Delayed skin 
reaction and macrophage migration inhibitory fi^fctor production of lymph 
node cells to thyroid antigen were reduced in/xhe ATx-sens i t i zed animals- 
Enhancement of migration was noted in sojj/^ of the animals. Anti-thyroid 
hemagg 1 ut inat ing antibody was not detected in about a quarter of 
ATx-sensit ized animals, in the remainder's of which the titers were at the 
level similar to that of the sha^rf-operat ed controls- There was a 
substantial decrease in the percental of rosette-forming cells (T [thymus 
derived] cells) on unsensitized S&Tx animals 14 mo- after operation. The 
suppression of the development of/chyroidit is and related cellular immunity 
may be a reflection of a decline/ in peripheral T cell population during the 
long term after thymectomy- EA^ in the guinea pig may be a thymus dependent 
disease. 
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SUB POPULATIONS OF SPLENIC THYMUS DERIVED CELLS REGULATING AN ANT I HAPTEN 
ANTIBODY RESPONSE PART 1 HELPER AND AMPLIFIER CELLS 

MUIRHEAD D Y; CUDKOWICZ G 
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There was a pronounced quantitative difference between the helpev^ 
activities of B6C3F1 splenic T Cthymus derived] cells sensitised with 
unmodified vs. modified antigens of SRBC Csheep red blood cellsD. Modified 
SRBC induced the greater helper activity which was measured by the 
magnitude of an anti-TNP Ctr i n i t ropheno 1 3 response < I g C i mm unog 1 obu 1 inD M and 
IgG) elicited in vivo by virgin B Cbone marrow derived! lymphocytes. 
Antigen mod i f i cat i on ~ was produced by conjugating SRBC wi^ft the hapten or 
simply by incubating SRBC in cacodylate buffer. There/were restrictions 
with respect to erythrocyte species and mouse strains f^dr this differential 



priming to occur. The relatively poor per f ormajprt5e of SRBC-primed T 
lymphocytes was apparently not clue to suppresspf T cells, but rather to 
activation of only 1 of £ identified T ce 1 1 s'ubpopulat i ons required for 
full helper^ activity,. Unmodified SRBC activated a subpopulat i on of helper 
cells characterized as sensitive to e 1 i mtfn&t i on by ATS Cant i-thy mocy t e 
serum] and long-lived after ATx [adult thymectomy] but failed to activate 
in B6C3F1 mice a second subpopulat i on/ of amplifier cells resistant to 
elimination by ATS and short-lived after ATx. Modified SRBC activated 
helper and amplifier cells. Under appreciate conditions these subsets of T 
cells were strongly synergistic in prj/moting anti-hapten antibody formation 
especially of the IgG class- TYye involvement of £ distinct types of T 
lymphocytes in the positive regulation of antibody responses raises 
interesting and novel questions concerning the sequence of events in the 
triggering of B cells and the subsequent development of the response. 
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REGULATION OF ANTI HAPTEN ANTIBODY RESPONSE BY CHEMICALLY MODIFIED 
CARRIER ANTIGEN PREFERENTIALLY PROVOKING DELAYED TYPE HYPER SENSITIVITY 
PART £ DELAYED TYPE HYPER SENSITIVITY REACTIVITY AND CARRIER SPECIFIC 
SUPPRESSION OF ANTI DI NITRO PHENOL ANTIBODY RESPONSE INDUCED BY PRIMING 
WITH DODECANOYL BOVINE SERUM ALBUMIN ARE MEDIATED BY FUNCTIONALLY DISTINCT 
THYMUS DERIVED CELL SUB POPULATIONS 

MACHIDA A; KUMAZAWA Y; MIZUNOE K 
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Language s ENGLISH 

Experiments were carried out to determine whether the cell populations 
involved in DTH and the suppression of antibody response are identical. The 
effects of 4 treatments, i.e., adult thymectomy (ATx), X-irrad iat i on, 
anti-mouse thymocyte serum (ATS) and hydrocowriGone <HC) on the induction 
of DTH and on the carriei — specific suppression of antibody response were 
observed in mice immunized with chemically modified antigen, dodecanoy 1-BSA 
Cbovine serum albuminD (d-BSA), emul s i f i ecr wi th complete Freund' s adjuvant 
(CFA) . DTH induced by immunization witbr d-BSA remained constant in adult 
thy meet om i zed mice, whereas the suppression of antibody response was not 
inducible in these animals. Injection of low doses of ATS caused the 
depression of DTH in mice primed /with d-BSA, but did not affect the 
suppressive activities of their splejen cells. Sublethal X-irrad iat ion 1 wk 
prior to d-BSA priming inhibited the/generation of suppressor cells but did 
not affect the generation of cel/s mediating DTH. The suppressive effect 
was also abrogated by subletbfal X-irradiat ion given £ days after 
immunization with DNP Cd in i troph /ny lat ed ] -BSA (14 days after priming with 
d-BSA). The treatment of anir/als with HC £ days before the footpad 
challenge or immunization with /DNP-BSA depressed the ability of animals to 
induce DTH and the suppress/on of antibody response. However, the latter 
was more sensitive to HC ttnan the former. d-BSA-primed spleen cells were 
capable of suppressing Tanti-DNP response, but not of inducing 
DTH-react i v i t y upon tran«/Per to recipient mice. DTH-react i v i t y and the 
carrier-specific suppression of anti-hapten antibody response induced by 
injection of d-BSA are apparently mediated by different cell populations. 
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Abstracts Control animals and rats infected 90 days earlier, by inoculation 
of 1 x 10(6) trypomast igot es of Trypanosoma crusi at weaning, were 
subjected to adult thymectomy (ATx) or sham operat i on^XB <r Prr>< ) and 
assessed 3 months later- for the presence of myocapdiTal lesions and 
levels of lymph node and spleen T-cell populat,i<ms« Chronic focal 
myocarditis (CFM) developed in 7B% and 84% o-f S-ATx or ATx infected 
rats, respectively. While the two groupsvof infected rats did not 
differ as to the occurrence of myocardral lesions, large foci of CFM 
were more prevalent in ATx infect ed^at s. Chronic T. cruzi <Tc) 
infection resulted in decreased CP^+ and increased CD8+ lymph node and 
spleen cell, with CD8+ lymphocytes being lowered to normal values in 
the spleen of the ATx infectesf group- It is suggested that ATx might 
act by interfering with a ddwn-regulating i mmunoregulat ory mechanism, 
leading to an exacerbat i oft of autoimmune reactions believed to be 
involved in the generation of myocardial damage. 
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Islet cell transplantation in type I diabetes mellituss evaluation of 
humoral immune response- 

Kondratiev YY 5 Sadovnikova NV 5 Petrova 6N5 Fedotov VP 5 Bljumkin VN$ 
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Four males and three females ranging in age from 20 to 35 years and 
afflicted with complicated Type 1-diabetes for more t>^n 8 years underwent 
islet cell allotransplantation (ATx, 6 cases) and x^rlot ransplantat ion (XTx, 
1 case). Precultured islet cells derived fr^m human or bovine fetal 
pancreata were injected into the m. rectus abdominis- Immunosuppression was 
not applied- Plasma C-peptide and islet cell y^urface antibodies (ICSA) were 
continually measured both before and untdl the twentienth week following 
islet cell transplantation. All recipients were subdivided as "responsive" 
<RR, 3 males) or "non-responsive" (NRF^/l male and 3 females), according to 
the dynamics of their ICSA levels. Qtll 3 RR (lXTx and 2 ATx) showed a peak 
of ICSA two weeks after ce 1 1 inject i on. Subsequent ICSA levels had the 
tendency to either diminish sfc increase- Heterogeneity of preoperat i ve 
antibody level, especially Xr\ NRR, was also observed- No associations 
between ICSA and ATx or/XTx, age at diabetes onset, or duration of the 
disease was found- Only/"one RR with XTx had a reduced daily insulin 
requirement and a significant C-peptide response similar to the dynamics of 
ICSA levels- A greater mass of available bovine islet cells might be 
responsible for this effect. 
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Temporal changes of suppressor T lymphocytes and cytotoxic T lymphocytes 



in syngeneic murine malignant gliomas. 
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The temporal activities of suppressor T lymphocytes (Ts) and cytotoxic T 
lymphocytes (CTL) were investigated in a syngeneic murine malignant glioma 
(a methy Icho lant hrene-induced ependymoblast^ma of C57BL/& mouse origin, 
£03-gl ioma) . After the b.c. tumor inoculaj/ion, it was suggested that both 
Ts and CTL were generated with target specificity against £03-g 1 i oma cells, 
because neither Ts nor CTL activity/ were seen against syngeneic EL 4 
( ben zpyrene- induced thymoma), al logen/eic P815 (methy lcho lant hr en e- induced 
mastocytoma of DBA/2 mouse origin)/ or YAC-1 (Moloney leukemia-induced 
T-cell lymphoma of A/Sn mouse origipO, but only against £133~gl ioma. It was 
found that the generation of Ts preceded that of CTL and that the turnover 
was faster; furthermore, Ts were Generated in the thymus and spleen, while 
CTL were distributed in regional lymph nodes and spleen. Surface marker 
analysis revealed that only Lyt/l-. £. 3+ T-cells part ici pat ed in suppressor 
responses in contrast to noth Lyt-1-. £«. 3n- and Lyt--H-. £. 3+ T-cells 
participating in cytotoxic responses. The effects of adult thymectomy (ATx) 
on the changes of the immunized T-cell subsets were also investigated. In 
mice thymect omized 3 weelys previously, the Ts activity was abrogated, 
whereas the CTL activity/ increased markedly and Lyt-l+.£.3+ T-cells were 
not detected. The resuLxs suggest that CTL or their precursors bearing 
Lyt-H-a S. 3+ phenotype aryd Ts bearing Ly t-1-. 2„ 3+ phenotype are short-lived 
lymphocytes. Accordingly, it is suggested that in tumor-bearing mice 
short-lived Ts are generated earliest with target specificity and, due to 
the reciprocal relationships between Ts and CTL activities, may have a 
modulating influence on CTL 5 furthermore, ATx may alter the patterns of 
generation of the precursor T-cells and Ts. 
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T-cell recruitment regulated 
role in immune response. 

Koga Y; Taniguchi K; Nomoto K 
Immunobiology (GERMANY, WEST) 
0171-2985 Journal Codes GH3 
Languages : ENGLISH 
Document types JOURNAL ARTICLE 
The dynamics of the number 
pro stag land in E£ 
Corynebact er i urn 

increased but decreased to lowe/ than the normal level thereafter 
absolute number of T cells in the spleen began to increase after the 
level dropped- But such an increa/e of T cells was not observed in ATx 
challenged with C. parvum. Moreover, replenishing the mice with exogenous 
PGE£ in the period of low PGE£/halted selectively the increase of T cells 
in the spleen. This enlarge/5 T cell subset responded to PHA, expressed 
Lyt-H-SH-, and was sensitive tjb PGE2. And this T-cell subpopulat i on exerted 
a suppressive effect on ant/body response in low PG environment, but lost 
its inhibitory effect in Kigh PG milieu. These results suggested that an 
immature T cell subset is/ recruited from the thymus in a low PG state and 
participates as regulator cells in immune response at peripheral lymphoid 
organs. 
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Injection of mouse thyreoglobulin and/or adult thymectomy do not break 
tolerance to thyreoglobulin during the lupus like graft versus host disease 
in mice. 

van Rappard-van der Ueen FM; Kong YM$ Rose NR? Kimura M5 Gleichmann E 
Clin Exp Immunol (ENGLAND) Mar 1984, 55 (3) p5£5-34, ISSN 0009-9104 
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In a previous paper (Gleichmann, van Elven ft van der Veen, 198£), it had 
been reported that, in contrast to lupus like autoantibodies such as 
anti-DNA, autoantibodies to mouse thyroglobul in (MTg) were not detectable 
in serum of Fl mice suffering from a lupus like graft versus host disease 
(GVHD) <GVH Fl). In the present paper, possible explanations for this 
restricted autoantibody formation during the potent allogeneic stimulation 
were investigated- The main question was whether the natural level of 
circulating MTg was too low to induce the formation of anti-MTg antibodies 
in GVH Fl mice. Existence, in the Fl mice studied, of B cells capable of 
producing anti-MTg antibodies was demonstrated by ^^tfijection of 
1 i popo 1 y saccharide (LPS) and exogeneous MTg. Howe ver^MTg injected into 
various Fl mice at the onset of the GVH reaction (GVH^^rai led to overcome 
the lack of antibody formation to MTg even though fcfrfe 6VHR led to a severe 
lupus like disease. Furthermore, adult thyme^omy (ATx) of either the 
recipients, the donors, or both also did not tw^eak tolerance to MTg during 
the GVHR, irrespective of administration o^exogeneous MTg. Thus, neither 
intravenous injection of MTg nor ATx, designed to remove T suppressor (TS) 
cells, is adequate to enable an autoantibody response to MTg during lupus 
like 6VHD. Hence, the non-specific Tyt^ell help that causes lupus like GVHD 
seems to be intrinsically insufficient to trigger the Tg reactive B cells. 
We suggest that globular proteins, such as Tg, require specific T cell 
help. In the presence of on>f non-specific T help, se 1 f -ant i gens such as 
DNA seem to be more apt than globular proteins to provide an effective 
signal 1 to the corresponding autoreactive B cells. 
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Differentiation and maturation of thy-1 negative bone marrow cells. I. 
Effects of the thymus and radioresistant helper functions on the maturation 
of precursor cells specific for heterologous erythrocytes. 

Gondo H5 Taniguchi K; Kubo C; Nomoto K 

J Clin Lab Immunol (ITALY) Sep 1983, IS 
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Document type: JOURNAL ARTICLE 

The effects of adult thymectomy (ATx) and xfre i mmun i zat i on on the 
differentiation of cells responsible for delayed footpad reaction (DFR) , 
plaque forming cells <PFC) and cell medi^ed lympholysis (CML) were 
examined in lethally irradiated a\nd thyyl negative bone marrow cell 
reconstituted C57BL/6 recipients. ATx Educed the degrees of immune 
responses in irradiated and reconstituted mice. When the recipients had 
been pre i mmun i z ed, all of DFR, PFC/ and CML became detectable even in 
irradiated and reconstituted ATx rp^ce. Pre i mmun i eat i on also evoked early 
maturation of precursor cells f>efr CML. Therefore, it was suggested that 

helper effects, ^presumably in the presence of antigens, 
the d i f ferent i a^fci on and maturation of T cell precursors in 
the absence of/the thymus. We also demonstrated differences 
periods of/ such responses after lethal irradiation and 
One or two weeks following irradiation and reconst i t ut i on, 
detectable. On the other hand, the generation of PFC was 
than £ weeks after bone marrow cell reconst it ut i on, and it 
weeks for thy-1 negative bone marrow cells to raise CML. T 



rad ioresistant 
could promote 
bone marrow in 
in rest orat i on 
reconst i tut ion. 
DFR was first 
detected later 
took over 4 



cells responsible for DFR may have lower dependency on the thymus than 
those for PFC and CML. 
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Characterization of a concanavalin A-induced amplifier T cell which 
augments in vitro antibody responses to DNP-Ficoll. 
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The addition of the T~cell mitogen cxnicanaval in A (Con A) on Day 2 of a 
4-day in vitro culture of murine spleen cells with the thy mus-independent 
(TI) antigen DNP39-Fico 1 1 resulted ill/significant enhancement of the direct 
ant i tr in i tropheny 1 (TNP) plaque-/? orm ing cell (PFC) response- This 
enhancement was mediated/ by a nylon wool™ and 

antiimmunoglobulin-nonadherent amplifier T cell (TA) ■ TA activity was not 
eliminated by in vitro treatment of T cells with anti-Thy 1.2 and 
complement (C) . TA activity/ could be eliminated by pretreatment of mice 
with ant i 1 y mphocyt e serum (Aj/S) in vivo, followed by in vitro treatment of 
T cells with anti-Thy ly£ + C. Thus, TA appear to bear a low surface 
density of Thy-1 antiger^T These TA were relatively resistant to ALS used 
alone, to cyclophosphamide, and to low dose in vitro irradiation. TA were 
still present in the /spleen 14 weeks after adult thymectomy (ATx). They 
were I-J positive and apparently belonged to the Lyt 1+2- T-cell subset. 
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Polygenic control of the immune response to F antigen- 
Silver DM 5 Lane DP 
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The ability to produce an autoimmune response to F antigen in mice is 
under H-2-linked a-Cnd non-H-2-1 inked Ir-g&rfe control. There is an absolute 
requirement for a k allele at H-2K j&r I— A in order to produce antiF 
antibodies. Low and high respons i venes<ir is controlled by a non-H-2-1 i nked 
Ir gene which behaves in a similar fashion to Ir-3, in that as the dose of 
F-antigen is lowered, low responders behave as high responders and vice 
versa. This conversion from low vo high responders and vice versa. This 
conversion from low to high responsiveness also occurs within a month after 
ATX. -Most FI hybrids derived f/om (responder X nonresponder ) parents 
bearing identical F-types behave/as dominant nonresponders. As a result of 
ATX, such FI mice convert to biigh responders . This conversion occurs if 
the animals are not immunized before day 90. If they receive F antigen 
prior to that time, they j/emain nonresponders for 7-9 months. One Fi 
•{combination showed — AKD2 — boriaves as an dominant higher responder. Genetic 
analysis showed that the /presence of a K allele at H-2K or I-A, a 
non-H-2-1 inked Ir gene inherited from the AKR mice determined dominant 
responsiveness. No manipulation of the immune response or combination of 
genes converted nonresponders lacking a k allele into responders. Such 
complex genetic control suggests regulation by a number of independently 
segregating loci whose function it is to limit the-C autoimmune response to F 
ant i pen. 
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Regulation of contact sensitivity to DNFB in the mouses effects of adult 
thymectomy and thymic factor. 

Erard D; Charreire J 3 Auffredou MT 5 Galanaud P$ Bach JF 

J Immunol (UNITED STATES) Oct 1979, 123 (4) pl573-6, ISSN 00££-17&7 
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The contact sensitivity response to DNFB is depf^eased after adult 
thymectomy (ATX). This response decreases to 50"/. of tj*e control response of 
normal age-matched mice as soon as 3 weeks aft&r ATX and is not further 
depressed 9 to 16 weeks after ATX. These resumes suggest that two T cell 
subsets of different lifespan are invol ved/in the anti-DNFB response. A 
circulating thymic factor (FTS) is able to >*estore the contact sensitivity 
response to DNFB when injected 3 to 9/weeks after ATX but not 16 weeks 
later. By contrast, FTS has a depressiv^r effect on the contact sensitivity 
response to DNFB of normal mice throuqp a cycl ophospham ide-sens i t i ve T cell 
subset. These results suggest that JF1S regulates DNFB contact sensitivity 
by acting on a cycl ophospham ide-^ens i t i ve T cell subset, still present 9 
weeks after ATX but absent after weeks* Thus although the T cell defect, 
causing a depression of the /contact sensitivity reaction to DNFB is 
quantitatively similar 3 and/ 16 weeks after ATX, its nature is probably 
d i f f erent . / 
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Masugi nephritis in T cell-depleted state. Influence of adult thymectomy 
and ant i-thy mocy t e serum. 

Oite T 
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The effects of surgical removal of the thymus or the a^kiinistrat ion of 
antiserum to t hy mus-der i ved lymphocytes on the development of Masugi 
nephritis were investigated in Wistar rats. Whi le ^thymectomy at weaning 
(ATx) had no significant effect on the humoral antibody response to 
injected rabbit IgG, repeated injections of rabbi;rant i -rat thymocyte serum 
(ATS) suppressed it remarkably. Both treatment did not show morphological 
evidences that glomerular inflammation arur injury were suppressed in the 
autologous phase of Masugi nephritis. Glomerular lesions in rats receiving 
the nephrotoxic serum (NTS) one month a^ter ATx ( ATx-i+NTS) appeared to be 
severer in hyperce 1 1 ular it y , mitotic?^ counts, the amount of deposited 
fibrin-related substances and crescent formation than those in other 
nephritic groups. Morphological >^tudy of ATS+-NTS rats revealed that the 
glomerular changes were nearly eg^tal to those of NTS™ in j ect ed control rats, 
in spite of markedly suppresset^numoral response to injected rabbit IgG and 
the absence of host IgG along the glomerular capillary walls. 
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Regulation de 1 ' hy persensi bi 1 i t e de contact par 
de la thymectomie a 1' age adulte chez la Souris. 

Erard D; CharY^eire J$ fluffredou MT ; Galanaud 
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6 weeks after adult thymectomy^ (ATx) in the Mouse, the contact 
sensitivity reaction to dinitrof^uorobenzene (DNFB) is enhanced. 4 to 7 
months after ATx, this reaction is deeply, but incompletely depressed, 
whereas the concomitant antibody response is not affected- These results 
suggest that both a suppressing and an amplifier, T lymphocytes, the life 
span of which is different after ATx, are involved in the regulation of 
contact sensitivity- Ther effect of circulating thymic factor on this 
reaction suggests that this factor acts exclusively, at least in short 
treatments, on the suppressor function- 
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Subpopulat i ons of splenic T cells regulating an antihapten antibody 
response. I. Helper and amplifier cells. 

Muirhead DY$ Cudkowicz G 

J Immunol (UNITED STATES) Feb 1978, 120 (£) p579-85, ISSN 0022-1767 
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There was a pronounced quantitative difference/ between the helper 
activities of B6C3F1 splenic T cells sensitised witpn unmodified vs modified 
antigens of SRBC. Modified SRBC induced the greater helper activity which 
was measured by the magnitude of an anti~TNP response (IgM and IgG) 
elicited in vivo by virgin B lymphocytes. Ant*igen modification was produced 
by conjugating SRBC with the hapten or simply by incubating SRBC in 
cacodylate buffer- There were restriction's with respect to both erythrocyte 
species and mouse strains for this/ differential priming to occur- The 
relatively poor performance of SRBC-primed T lymphocytes was apparently not 
due to suppressor T cells, but/rather to activation of only one of two 
identified T cell subpopulat i or^s required for full helper activity. 
Unmodified SRBC activated a sujapopulat i on of "helper" cells characterized 
as sensitive to elimination bv/ATS and long-lived after ATx, but failed to 
activate in B6C3F1 mice a/ second subpopulat i on of "ampl if iev*" cells 
resistant to elimination Joy ATS and short-lived after ATx. In contrast, 
modified SRBC activated byuh helper and amplifier cells. Under appropriate 
conditions these subsets/of T cells were strongly synergistic in promoting 
antihapten antibody formation especially of the IgG class. The involvement 
of two distinct type's of T lymphocytes in the positive regulation of 
antibody responses w4ises interesting and novel questions concerning the 
sequence of events in the triggering of B cells and the subsequent 
development of the response. 
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Immunity to s poro z o i t e — i nd need malaria infeciton in mice. I. The effect 
of immunization of T and B eel 1-def ici a*<t mice. 

Chen DH$ Tigelaar RE; Weinbaum FI j< 
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The cellular basis of immunity to sporozoites was investigated by examing 
the effect of immunization of T and B cell-deficient C57BL/6N X BALB/c AnN 
Fl (BLCF1) mice compared to immunocompetent controls. Immunization of T 
cell-deficient ( ATX-BM-ATS) BLCF1 mice with x-irradiated sporozoites did 
not result in the generation of protective immunity* The same immunization 
protocols protected all immunocompetent controls. In contrast, B 
cell-deficient (micron-suppressed) BLCF1 mice were protected by 
immunization in the majority of cases- The absence of detectable serum 
circumsporozoit e precipitins or sporozoite neutralizing activity in the 
micron-suppressed mice that resisted a sporozoite challenge suggests a 
minor role for these humoral factors in protection. These data demonstrate 
a preeminent role for T cells in the induction of protective immunity in 
BLCF 1 mice against a P. berghei sporozoite infection. 



7/7/£8 (Item 13 from files 155) 

DIALOG ( R) Fi 1 e 155:MEDLINE (R) 

(c) format only 1994 Dialog Info. Svcs. All rts. reserv. 

(Zi£73£639 75139639 

Studies on the thymic dependence of the immunoglobulin classes of the 
mouse (3857(3) . 
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The influence of the thymus on serum immunoglobin (Ig) conp^iTt rat i on was 
studied by a comparison of serum Ig levels in congen it al Lj^athy m ic (nu/nu) 
mice versus control littermate het erozygotes and ^affult thymectomized, 
irradiated, bone marrow reconsituted mice (ATx/plus B) versus adult 
thymectomized, irradiated mice reconstituted jj^Tth bone marrow and thymus 
(ATx plus BT). IN THE FORMER GROUP IgGl. Ig**T and IgGE'a were 8%, 17% and 
31% of controls. IgM levels were increa^o (340"/) compared to controls. 
When ATx plus B mice were compared to controls. When ATx plus B mice were 
compared with nonirradiated control s^si g i ficant depressions were noted in 
serum IbGl and IgM. The only significant decrease in serum Ig levels 
between ATx plus B and ATx plus BT/<gas in IgGl. These results are discussed 
in terms of the effects of ^thymic influence, residual T lymphocyte 
population differences between the two groups, and the effect of 
irradiation. ' 
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Cellular cooperation during in vivo anti-hapten antibody responses. III. 
The helper cell activity of activated thymocytes, of spleen cells treated 
with anti-theta serum, and of spleen cells from ant i-thymocyt e 
serum-treated or adult thymectomized donors. 

Janeway CA Jr. 
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An adoptive secondary ant i -£', 4™d in i t rophen y 1 (DNP) antibody response 
involving T-B cell collaboration has been studied. In particular, attempts 
have been made to affect the unexpectedly steep log dose-response curve 
obtained when graded numbers of helper cells are transferred to irradiated 
recipients given a fixed number of B cells (premium effect). A variety of 
means were used to alter helper cell activity, and this activity was then 
measured quantitatively, as was the ability of the helper cells present 
after these treatments to qive a premium effect. It was shown that 



activated T cells are approximately twice as active as spleen cells in 
helper activity and give a comparable premium effect- Graded doses of 
anti-theta serum plus complement markedly reduce the helper activity of 
spleen cells without affecting the D/emium effect given by the residual 
cells. Treatment of primed cell donors with limited doses of heterologous 
anti-mouse thymocyte serum (ATS) ./before transfer doe?s not affect B cell 
activity, but readily inactivates helper cells, again without affecting the 
premium effect given by the Residual cells, Adult thymectomy (ATx) of 
helper cell donors before priming with carrier led initially to increased 
helper activity relative to age-matched control donors- This increase may 
reflect the loss of nonspecific suppressor T cells from spleens shortly 
after ATx. Late after AT>c, there was about a £-fold decrease in helper 
activity, probably reflect/ng a loss of helper cell precursors. Pit no time 
was there any change in Jche premium effectp In view of the failure of any 
of the techniques used tp abolish the premium effect given by helper cells 
in this response, it /eems likely that this premium effect is due to the 
cooperative int eract i on of two very similar types of mature T cell- 
Alternatively, the pw4mium effect observed here may result from the 
interaction of two activities of a single type of T cell which is mediated 
by independent factors. 
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I MMUNOH I STOCHEM I CAL DETECT I ON OF BREAST 
PRETRANSPLANT BONE MARROW (MEETING ABSTRACT) 
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Detection of small numbers of breast cancer cells 
assessing the efficacy of purging yr tumor from bone 
autologous transplantation (ATx). We have used a mixture of 
antibodies, including 260F9, 5209^, and 317G5 (Baxter Corp), 
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immunohist ochemical assay to identify breast tumor cells in bone marrow. In 
a series of experiments in /which CAMA breast tumor cells were added to 
normal marrow, tumor cells were consistently detected in bone marrows at a 
concentration of one tumor/cell per 10(6) normal bone marrow cells. Tumor 
cells were also detected \y buf f y coats of marrows from patients undergoing 
ATx. Among 51 specimens examined, tumor 
i mmunoh i st ochem ical stai/ning of buffy coats in six 
tumor-negative by /onventional light 
staining of bone iarrow buffy coats 



cells were found by 
specimens which had been 
microscopy. Immunohistochem ical 
may more accurately reflect the 



presence or abse#be of tumor in bone marrow 
microscopic examination of bone core sections. 



than conventional light 
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EFFECT OF ADULT THYMECTOMY ON TJ^OUR IMMUNITY IN MICE 
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The effect of adult thymectomy (ATx) on (1) the growth of the PS15 



mastocytoma in DBA/£J mice, (£) the ability of mice to generate cytotoxic 
cells specific for P815, and <3> the ability of tumoi — bearing animals to 
generate suppressor cells specific for this tumor was studied. Spleen cells 
from adult mice that had been Tx 8wk previously demonstrated a severely 
impaired primary cytotoxic response to P815 tumor cells, whereas their 
cytotoxic responses to allogeneic cells (C57BL/6) and to non-H-£ antigens 
(BALB/c) and their ability to form a primary antibody response to sheep RBC 
were unimpaired. Suppressor T cells, specific for P815 cells, appeared 
after 4-8 days in the thymuses of animals inoculated with P815 cells- No 
differences in tumor growth between ATx animals and sham-operated controls 
were observed, and the Tx tumor-bearing animals and untreated controls had 
equal levels of specific suppressor cells in their lymph nodes. Severely 
thy mocyt e-depr i ved mice that had been Tx, irradiated, and reconstituted 
with either marrow or spleen cells 8 wk before tumor implantation succumbed 
more rapidly to metastatic tumor than did all control animals- The data 
suggest that tumor immunity and temporary tumor containment, at least with 
this tumor line, depend on the presence of an intact immune system. (SB 
Ref s) 
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Immune competence in 90Sr~exposed, adult thymect omi zed and 

ant i lymphocyteglobul in-treated CBA mice. II. Reticuloendothelial phagocytic 
function and in vitro mitogen responsiveness of spleen cells 

Bierke P. 

The Unit of Experimental Pathology, Faculty of Veterinary Medicine, 
Swedish University of Agricultural Sciences, Uppsala Sweden 

ACTA ONCOL. (Sweden) , 1990, £9/5 (fclS-feSl) CODENs ACTOE ISSNs 
0284-186X 

LANGUAGES; English 

The hypothesis that immune system failu**^ plays a role in the development 
of radiation induced tumours was *>^cently investigated experimentally- 
Young adult CBA mice, intact or immujprocompromised by adult thymectomy (ATx) 
and/or ant i 1 y mphocy t eg 1 obul in (flLG/ treatment, were exposed to single doses 
of 90Sr, after which tumour ^development was monitored- To evaluate the 
experimental results requiv^ed knowledge about the immunological 
responsiveness of the mice- TMe present paper contributes to that knowledge 
by reporting on the in vij/ro responsiveness of lymphoid cells to mitogens 
(LPS, PHA, Con-fl) arv^K the in vivo phagocytic functioning of the 
reticuloendothelial sy y y£em (RES), measured as the rate of clearance of 
l£5I-al bumin micro-aggregates in peripheral blood« 90Sr, ATx and 
ALG-treat merit s , separately and in combination, suppressed mitogenic 
lymphoid cell activation, whereas the RES phagocytic funtion remained 
unchanged, except in response to 90Sr+ALG treatment, which seemed to 
slightly inhibit phagocytic activity. 
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Rad i ostront i um-induced oncogenesis and the role of immunosuppression. II. 
Influence of 90Sr dose, adult thymectomy and ant i 1 y mphocyt eg 1 obul in 
treatment on the development of 1 y mpho-ret icular and extraske 1 et al 
neoplastic lesions in CBfi mice 

Bierke P. 5 Nilsson A. 
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Pathology, Faculty of Veterinary Medicine, Swedish University of 
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The significance of depressed immune function for the development and 
progression of tumours induced by 90Sr (main yf osteosarcomas and malignant 
lymphomas) was investigated in a series yrfr experiments by comparing the 
tumour responses in normal mice with tho/e in immunocompromised mice- The 
present paper (part II) reports on 1 ymph^-ret icular (LR) and extraskeletal 
neoplastic lesions in male CBA/SU mic/ after exposure to different single 
doses of 90Sr with or without additional immunosuppression by adult 
thymectomy (ATx) and/or prolonged ayx i-1 ymphocy t eg 1 obul in (ALG) treatments 
Neoplastic lesions in bone were/ reported in part I- The status of the 
animal's immune system and responsive ability were examined in parallel 
experiments. The tumour yieldsywere analysed in relation to the dosage of 
90Sr and the immunosuppressive J/reatments employed. Although the incidences 
and latency times of induced t/mours were clearly dose-dependent, they were 
never significantly influenced by ATx/ALG treatments. Thus, no substantial 
support was gained for the theory that the immune system plays a 
controlling or modifying role in 90Sr carcinogenesis. The results, which 
are in agreement with the bone tumour responses, suggest that 90Sr induced 
tumours either do not express the antigens necessary for immune rejection 
or that the decline in immune responsiveness induced by ATx/ALG was of 
little consequence for tumour development and spread. The pathogenesis of 
90Sr induced malipnant lymphomas (MLs) and their i mmunoohenot voes are 



d i scussed. 
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Rad i ost rent i urn-induced oncogenesis and the role of immunosuppression. I. 
Influence of 90Sr dose, adult thymectomy and ant i lymphocyt eglotaul in 
treatment on the development of neoplastic and preneoplastic lesions in the 
skeleton of CBA mice. 

Bierke P. ; Nilsson A. 
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Ionizing irradiation by incorporated strontium-90 exerts two major 
effects: it induces tumours (mainly osteosarcomas and 1 y mphoret icul ar 
tumours) and depresses the immune system. The interrelation between these 
function, i.e. the significance of decreased immunological responsiveness 
in the oncogenic process, remains unclear,, The influence of the 90Sr dose 
and the role of immune modulation on the tumour yield, were investigated in 
young adult CBft mice. The animals were exposed to different single doses of 
90Sr and, in addition, some groups were subjected to long-term unspecific 
immune suppression by adult thymectomy (ATx) and/or prolonged 
ant ilymphocyteglobul in (ALG) treatment- The present p^Rer (part I) reports 
on the effects of the treatments on bone tumour r^ponses as reflected by 
incidence, multiplicity, latency time, histpJr^gic characteristics and 
growth behaviour. The histogenesis of/osteosarcomas, as evidenced 
morphologically by preneoplastic aj>dr early neoplastic growth, is 
illustrated and discussed. The result^^emonstrat e a positive dose-response 
relationship for osteosarcomas, >^ri which the relative incidences of the 
various osteosarcoma subtypm*r were differentially affected. Thus, 
well-differentiated tumours jjj^re gradually replaced by less differentiated 
types as the dose decreased, ft correlation was also observed between the 
incidence of osteosarcoma/ andthat of assumed preneoplastic lesions in the 
same bones and sites. ( Immune suppression by ATx and/or ALG did not 
distinctly alter the neoplastic or preneoplastic responses at any 
dose-level of 90Sr. 
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Influence of sup 9sup 0Sr, adult thymectomy and ant i 1 y mphocyt eg 1 obul in on 
haematopoietic tissues and peripheral blood leucocytes in CBA mice 

Bierke P. 
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The role of long-time immune suppression in carcinogenesis induced by the 
long-lived emitter sup 9sup 0Sr, is investigated in an ongoing study. The 
experimental design is based on the assumption that impaired immune 
responsiveness, by other means than sup 9sup 0Sr, might increase the 
neoplastic response in exposed individuals, and thus reflect a protective 
function, if existing. Int ercompar i son is made of the tumour yield in mice 
exposed to different single doses of sup 9sup BSr and simultaneously 
subjected or not to long-term immune suppression by adult thymectomy (ATx) 
and/or ant i 1 vmohocvt eol obul in (ALG) treatment. In£nrmat i on on the npneral 



condition and responsiveness of the immune -/%yst em, in the respective 
models, during tumour expectancy time, i*r essential for a conclusive 
evaluation of the results. To meet these demands that present paper reports 
on histopathologic alterations in immune/organs and changes in white blood 
cell counts, induced by the different combinations of sup 9sup 0Sr, GTx and 
ALG treatment. The results confirm the prediction, that ATx + ALG is an 
efficient and, with respect to the i/f5ur pose of the study, suitable treatment 
for additive long-term depression of the immune system in sup 9sup #Sr 
irradiated mice, evidenced in particular by increased depletion of 
monomorphonuclear cells (MNC) in lymphoid organs and peripheral blood. 
Subsequent reports will deal with functional immune parameters. 
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6016443 EMBASE No: 86011503 

Cell-mediated lympholysis (CML) to allogeneic and t r in i t ropheny 1 
(TNP) -modified cellss Re-evaluation of the role of the thymus in CML 

Gondo H. 5 Tokuda N. 5 Taniguchi K. 5 Nomoto K. 

Department of Immunology, Medical Institute of Bioregulat ion, Kyushu 
University 69, Fukuoka JAPAN 

J- CLIN. LAB. IMMUNOL. (SCOTLAND) , 1985, 17/3 (131-136) CODENs JLIMD 
LANGUAGES s ENGLISH 

The role of the thymus in cell-mediated lympholysis (CML) to allogeneic 
and tr in i tropheny 1 (TNP) -modified cells was reevaluated using adult 
thy meet om i zed (ATx) and neonatal thy meet om^sed (NTx) mice. CML to 
TNP-modified cells was used as the model of/the major histocompatibility 
complex (MHO -restrict ed CML. CML to TNP-modified cells, but not to 
allogeneic cells was reduced at late stages/after ATx and remained low even 
in NTx-11 (thymectomy at 11 days off age) mice. Interleukin-S (IL-£) 
producing activity in such mice was low-dr than that in the normal controls, 
while allogeneic CML was maintained at the same level as seen in the 
controls. The addition of exogenous IL-S to in vitro culture led to a 
restoration of the generation /of CML to TNP-modified cells, in neonatal 
thymectomised mice. These resumes suggested that thy inns-dependency differs 
in the £ forms of CML arfd that the CML to TNP-modified cells showed a 
higher t hy mus-dependency that did the allogeneic CML. Moreover, the 
activity of helper T cells may exert a direct influence on CML to 
TNP-modified cells, as compared to allogeneic cells. 
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5631613 EMBASE Nos 841S7284 

Influence of sup 9sup 0Sr, 
on T-cells in mouse peripheral 

Bierke P. 5 Gidlund M. 

Unit of Radiologic Oncology, 

SWEDEN 

ACTA RADIOL, (SWEDEN) , 1984 
SERIES s SER. ONCOL. RADIAT. 
LANGUAGES s ENGLISH 

Three groups of male CBA rice were 
(14.8 kBq/g body weigW i.p. ) 

'fALG) 




and ant i lymphocyteglobul ine 



of Stockholm, S-172 46 Sundbyberg 



£3^1 (61-64) CODENs AROND 
THEtR. PHYS. BIOL. 



treated by (1) sup 9sup 0Sr injection 
(£) adult thymectomy (ATx) + 
ant i 1 ymphocyt eg 1 obul ine ^(ALG), and (3) sup 9sup ©Sr + ATx + ALG, 
respectively, and one untreated group served as a control- The relative 
number of T-l ymphocyt es was determined in peripheral blood mononuclear 
cells of individual animals using a dye exclusion cytotoxicity assay with 
antisera against the T-cell surface membrane marker Thy 1 . £ and complement. 
Total and differential leucocyte counts were also performed, sup 9sup iZiSr 
irradiation decreased the total number of leucocytes irrespective of type, 
and the combined treatment of ATx and ALG decreased mainlv mononuclear 



cells and particularly T-cells. The most advanced T-cell depletion in 
peripheral blood was found in the sup 9sup ©Sr + ATx and ALG treated group 
with a 97 per 'cent reduction as compared with untreated controls- ATx + ALG 
thus proved to be useful for blood T-cell depletion in mice treated 
simultaneously with sup 9sup 0Sr, and might provide a valuable tool in 
investigations on the possible role of cell-mediated immune response in 
radiation-induced oncogenesis, with particular emphasis on selective 
depletion within the monomorphonucl ear compartment. 
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£584389 EMBASE No 2 81242583 

Serum thymic factor (FTS) restores the ability of adult thy meet om i zed 
mice to be suppressed by hapten-modif ied self 

Grouix B« ; Charreire J. 5 Erard D. 5 et aL 

INSERUM U 13, 92140 Clamart FRANCE 

CELL. IMMUNOL. (USA) , 1981, 64/1 (144-149) CODENs CLIMB 
LANGUAGES s ENGLISH 

The injection of tr in itropheny 1 (TNP) con j ugat eji^pl een cells (TNP-SC) 
into normal recipients induces a specific s>ir|^pres s i on of the anti-TNP 
antibody response to the T— independent ajj^lgen TNP polyacry lamide. The 
T-cell dependency of this suppression Jr"s shown by two arguments, (i) 
Splenic T cells from TNP-SC-t reat ed orfce suppress the in vitro antibody 
response of normal mouse spleen cells^r (ii) The anti-TNP response of adult 
thy meet om i zed (ATX) mice is not ./suppressed after injection of TNP-SC. 
Treatment of such ATX mice with a synthetic analog of serum thymic factor 
(FTS) 7-9 weeks after thymectomy restores their susceptibility to 
TNP-SC- induced suppression. 
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£064526 EMBASE No s 80201845 

Trin i trobenzene sulfonic acid effects in two amphibian model systems 
Mette S.A. $ Ruben L- N. 

Dept. Biol-, Reed Coll., Portland, Ore. 97202 USA 

CELL. IMMUNOL. (USA) , 1980, 53/2 (298-306) CODENs CLIMB 

LANGUAGES s ENGL I SH 

The induction of hapt en-speci f ic tolerance was investigated in two 
amphibia, Not opht hal mus viridescens and Xenopus laevis. Responses to 
tr in i t ropheny lat ed (TNP)-Ficoll and TNP-1 i popo 1 y saccharide (LPS), as well 
as to horse erythrocytes (HRBC) were examined in both^dpeci es, following an 
intraperitoneal injection of 2, 4, 6-tr in i trobenzenes^fl f on ic acid (TNBS). The 
less evolutionary advanced newt, Not opht hal mLipf failed to respond to all 
three immunogens after TNBS administration. Ulh>4e Xenopus became completely 
tolerant upon challenge with TNP-Fico 1 1/and partially tolerant with 
TNP-LPS, full capacity to respond to HRBC^ttfas retained. Therefore, specific 
tolerance was induced in Xenopus, but/not in Not ophthal mus. The tolerance 
with TNP— Ficoll in the toad, XenXpus, was short lived and return to 
responsiveness appeared to be related inversely to levels of TNP protein in 
the sera of TNBS-treated animaL^T The thymic dependence of this tolerance 
could not be determined, because adult thymectomy (ATx) abrogated the 
response to TNP-Ficoll in /control nontolerieed animals. Responses to 
TNP-LPS and HRBC were urysfrf ect ed by ATx. These data, in conjunction with 
TNBS-induced differentia/! tolerance to the TNP moiety, suggest 
carrier-dependent hapt en-speci fic B-cell heterogeneity in the toad which 
differs in certain ways from that recently described for murine systems. 
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(c) 1994 Elsevier Science B« V. fill rts. reserv. 

1O51B04 EMBASE No s 78SS3fefe5 

The involvement of Tsub 1 and Tsub S lymphocytes in primary and secondary 
delayed type hypersensitivity responsiveness 

Van Der Kwast Th.H. ; Benner R. 

Dept. Cell Biol. Genet., Erasmus Univ., Rotterdam NETHERLANDS 
ANN. IMMUNOL- (FRANCE) , 1977, 1S8C/6 CODENs ANIMC 

LANGUAGES s ENGLISH SUMMARY LANGUAGES s/FRENCH 

The contribution of short-lived and Jrang-lived T lymphocytes (Tsub 1 and 
Tsub £ lymphocytes respectively) j/o primary and secondary delayed type 
hypersensitivity (DTH) responsiveness was studied by means of thymectomy of 
adult mice (ATx) with sheep red/blood cells as the antigen. Within 1 mth 
after ATx a fall of the primary/and secondary DTH responsiveness was found 
of £©% and 50% respect i ve 1 y. >Th i s reduction can be attributed to the loss 
of short-lived T cells- j^ix 8 mth prior to the administration of the 
priming dose caused a increase of the primary and the secondary DTH 
responsiveness of about 50% and 65% respectively. These data suggest that 
both the short-lived Tsub 1 and the long-lived Tsub £ precursor cells 
account for the occurrence of cells mediating the primary DTH response and 
memory cells responsible for the secondary DTH response. The contribution 
of both populations of precursor cells in primary and secondary DTH 
responsiveness appeared to be only proportionally different. 
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Regulation of anti-hapten antibody response by chemically modified 
carrier antigen preferentially provoking de lay ed-t y pe hypersensitivity 
(DTH). II. DTH~react i v it y and carriei — specific suppression of anti-DNP 
ant i body response 

Machida A. ; Kumazawa Y. 5 Mizunoe K. 

Dept. Immunol., Kitasato Inst., Tokyo JAPAN 

MICROBIOL- IMMUNOL. (JAPAN) , 1977, £1/8 (439-449) CODENs MIIMD 
LANGUAGES : ENGLISH 

Experiments were carried out to determine whethaf or not the cell 
populations involved in DTH and in the suppression of y*mtibody response are 
identical. The effects of 4 treatments, i.e., adult thymectomy (ATx), X 
irradiation, anti mouse thymocyte serum (ATS) aiyf hydrocortisone (HC) on 
the induction of DTH and on the carrier specific suppression of antibody 
response were observed in mice immunised with chemically modified antigen, 
dodecanoyl BSA (d BSA), emulsified with complete Freund 1 s adjuvant (CPA) , 
with the following results: (1) DTH induced by immunization with d BSA 
remained constant in adult thymectomised mice, whereas the suppression of 
antibody response was not inducible in the/se animals. (£) Injection of low 
doses of ATS caused the depression of DTH in mice primed with d BSA, but 
did not affect the suppressive act iv/x ties of their spleen cells. (3) 
Sublethal X irradiation I wk prior/ to d BSA priming inhibited the 
generation of suppressor cells but d/d not affect the generation of. cells 
mediating DTH. The suppressive effect was also abrogated by sublethal X 
irradiation given 2 days after immunization with DNP BSA (14 days after 
priming with d BSA)« (4) The treatnfent of animals with HC 2 days before the 
footpad challenge or immunisation with DNP BSA depressed the ability of 
animals to induce both DTH and the suppression of antibody response. 
However, the latter was more sensitive to HC than the former. In addition 
to these results, it was also found that d BSA primed spleen cells were 
capable of suppressing anti DNP response, but not of inducing DTH 
reactivity upon transfer to recipient mice. These results suggest that DTH 
reactivity and the carrier specific suppression of anti hapten antibody 
response induced by injection of d BSA are mediated by different cell 
populat ions* 
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9940E6 EMBASE No s 7S16>4£51 

Functional heterogeneity among the T derived lymphocytes of the mouse. 
VII* Conversion of Tl cells to T2 cells by antigen 

Araneo B« ft. ; Marrack P* 5 Kappler J,W. 

Div. Immunol., Dept. Microbiol-, Univ. Rochester Cancer Cent*, Sch. Med., 
Univ. Rochester, N. Y* 14fc4£ USA 

J. IMMUNOL. (USA) , 1977, 119/£ (765-771) CQDEN s JOIMA 
LANGUAGES: ENGLISH 

The Tl subpopulat i on of peripheral T cells was defined in mice by its 
short half life, insens i t i v i t y to ant i-thy mocy t e sera^-(ATS) in vivo, and 
slow kinetics of response to antigen. The T£ subpopuj^t i on was defined by 
its long lifetime, elimination by ATS in vivj>f and rapid response to 
antigen. Mice containing only Tl-type T cell^were constructed by adult 
thymectomy (ATx) followed immediately by the e^iininat ion of T£ cells by ATS 
treatment. Immunization of these mice wLfcfi SRBC led to the production of 
memory helper cells in the TiE: subpopu l^zi on. This process depended on the 
presence of Tl cells and for the Jtfost part required SRBC immunisation, 
although a few SRBC-speci f ic T£ cellos reappeared in the mice in the absence 
of antigen. We conclude that ^yfl cells can give rise to TE' cells in an 
antigen-driven step and that the £ populations correspond to virgin and 
memory T cells, respectively. 
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884107 EMBASE No s 78050486 

Biochemical characterisation of a serum thymic factor 
Bach J. F„ 5 Dardenne M. 5 Pleau J- M. \ Rosa J. 
INSERM U 25, Hop, Necker, Paris FRANCE 

NATURE (LOND« ) (ENGLAND) , 1977, £66/5597 (55-57) CODENz NATUA 
LANGUAGES s ENGLISH X 

The authors presented direct evidence for the presence in normal blood of 
a thymus dependent nonapeptide of molecular weighj^T about 900, which adds to 
the list of biologically active peptides whipfrTcould be hormones. The high 
activity of the synthetic peptide in^-tfTe rosette assay of the peptide 
synthesized on the basis of the ^eclrfino acid sequence supports the 
specificity of its action. Its apt^vity in various systems suggests that 
the natural peptide is involved T cell differentiation- These includes 0 
conversion in vitro and in/vivo^ enhancement of the generation of 
alloantigen reactive cyototox\c T cells in Tx mice? induction of suppressor 
T cells in NZB mice assayed using antibody production against polyvinyl 
purrolidone, enhancement of mitogen response in nude mice in vitro and ATx 
rats in vivo, and normalisation of the abnormally high level of autologous 
erythrocyte binding cells in ATx mice* 
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South African psychiatrists' criteria for predicting dangerousnes s 
Zabow T- ; Cohen A. 

Forensic Psychiatry Unit, University of Cape Town, Valkenberg Hospital, 
Observatory, Cape Town South Africa 

MED- LAW (South Africa) , 1993, 1E/3-5 (417-430) CODENs MELAD ISSNs 
07S3-1393 

LANGUAGES 5 English SUMMARY LANGUAGES s English 

The prediction of dangerousness has emerged as one of the most 
controversial issues in forensic osvchiatrv* It is a value-laden and 



ambiguous concept which has not been adequately defined and operat i onal i zed 
by the law or psychiatry- The validity and reliability of psychiatric 
predictions of dangerousness have been brou^nt seriously into question in 
the past few decades- The Booysen Commission appointed in South Africa 
relates to the problem. An exploratory/survey to ascertain the variables 
which South African psychiatrists perceive as influencing their decisions 
about dangerousness and to compare/these findings with those of previous 
research was undertaken. One hundred and thirty-eight psychiatrists were 
included in a survey by means of a/questionnaire. Findings that clinicians 9 
decisions were significantly influenced by patients' cr i m inal / v i o 1 ent 
history were consistent withr previous research. The article further 
evaluates the responses tj^rthe questionnaire variables and the degree as 
well as a study of detained 'dangerous' patients to assess accuracy of 
evaluations in practice. 
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Cellular cooperation during in vivo anti hapten antibody responses. III. 
The helper cell activity of activated thymocytes, of spleen cells treated 
with anti omega serum, and of spleen cells from anti thymocyte serum 
treated or adult thy meet om i zed donors 

Janeway C. A. Jr. 

Lab. Immunol., NI AID, NIH, Bethesda, Md. £0014 USA 

J. IMMUNOL. (USA) , 1975, 114/4 (1408-1414) CODEN 5 JOIMA 

LANGUAGES s ENGLISH 

An adoptive secondary anti 2,4 din it ropheny 1 (DNP) antibody response 
involving T B cell collaboration was studied. In particular, attempts were 
made to affect the unexpectedly steep log dose response curve obtained when 
graded numbers of helper cells are transferred /o irradiated recipients 
given a fixed number of B cells (premium effect/. A variety of means were 
used to alter helper cell activity, and this/activity was then measured 
quantitatively, as was the ability of the helper cells present after these 
treatments to give a premium effect. It wasr shown that activated T cells 
are approximately twice as active as splejen cells in helper activity and 
give a comparable premium effect- Grad&a doses of anti theta serum plus 
complement markedly reduce the he 1 per /act i v i t y of spleen cells without 
affecting the premium effect given t/y the residual cells. Treatment of 
primed cell donors with limited doses or heterologous anti mouse thymocyte 
serum before transfer does not affect B cell activity, but readily 
inactivates helper cells, again withemt affecting the premium effect given 
by the residual cells. Adult thymectomy (ATx) of helper cell donors before 
priming with carrier led initially t/o increased helper activity relative to 
age matched control donors. Thi/4 increase may reflect the loss of 
nonspecific suppressor T cells -from spleens shortly after ATx. Late after 
ATx, there was about a £ fold decrease in helper activity, probably 
reflecting a loss of helper /cell precursors. At no time was there any 
change in the premium effecy. In view of the failure of any of the 
techniques used to abolish the premium effect given by helper cells in this 
response, it seems likely that this premium effect is due to the 
cooperative interaction of £' very similar types of mature T cell. 
Alternatively, the premium effect observed here may result from the 
interaction of 2 activities of a single type of T cell which is mediated by 
independent factors, 
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£1 £1216 8 EMBASE No s 75000046 

Immunologic tolerance to a hapten. III. . Induction of tolerance to 
tr in i trophen v 1 in B cells in various differentiation states 



Fidler J. M. 5 Golub E. S. 

Dept. Biol. Sci., Purdue Univ., Lafayette, Ind. 47907 USA 
J- IMMUNOL. (USA) , 1974, 112/5 (1691-1899) CODENs JOIMA 
LANGUAGES: ENGLISH 

The induction of tolerance to the TNP hapten in populations of B cells in 
various states of differentiation was studied with the free, reactive 
hapten tr ini trobenzenesul f on ic acid (TNBS) . An adoptive transfer system was 
used in which tolerance was induced in an adoptively transferred B cell 
source before the addition of reconstituting thymus cells and immunization 
with hapten carrier. Bone marrow cells and cells from fetal liver were 
rendered tolerant to trinitropheny 1 (TNP) in this manner, showing that 
tolerance can be induced in cells from the primary B cell precursor and 
stem cell source of both the adult and fetal mouse. Tolerance was induced 
in normal spleen cells after adoptive transfer and in spleen cells from 
adult thymect omi zed x-irradiat ed bone marrow reconstituted mouse (ATXBM) 
mice (B cell spleens) after transfer to adult thymjewft om i zed x-irrad iat ed 
mouse (ATX) hosts. These experiments show the indyjc-tion of tolerance to TNP 
in B cells in another state of differentiation (peripheral B cells). 
Tolerance could be induced, in B cell source^aepleted of T cells indicating 
that immunologic tolerance to TNP may bwr induced in B cells in the absence 
of T cells. The possibility of residual TNBS affecting the thymus cells 
used for reconst i t ut i on was ruleji^out by transferring fetal liver cells to 
a secondary host after tolerap^fe induction in a primary adoptive host, and 
also by rendering ATXBM mi^e tolerant before the transfer of their spleen 
cells to an adoptive h&st* These maneuvers remove the B cells from the 
environment used for tolerance induction, and indicate that TNBS did not 
act upon reconst it ut in-C g thymus cells- The data indicate that tolerance to 
TNP can be induced in early differentiation states of the B cell, 
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9/3/1 (Item 1 from files 398) 

DIALOG (R) Fi le 398 s CHEMSEARCH ( TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 



CAS REGISTRY NUMBER s 150979-04-7 
MOLECULAR FORMULA: Unknown 



CO NAME(S) : 

HP=Peptide (9CI) 

SB= (Val-Asn-Val-I 1 e--Ser-Gl y-Pro-I 1 e-Phe-flsp-Tyr-flsp-Tyr~Asp-Bl y-Leu- 
Xaa-Asp-Thr-Glu-flsp-Lys) 
SYNONYMS: Autotaxin fragment (human) 



9/3/£ (Item £ from files 398) 

DIALOG ( R) Fi le 398 : CHEMSEARCH < TM ) 
(c) 1994 finer. Chem. Soc. fill rts. reserv. 

COS REGISTRY NUMBER : 150979-03-& 
MOLECULfiR FORMULA : Unknown 
CA NAME (S) s 

HP=Peptide (9CI) 

SB= (Thr-Phe-Pro-Asn-Leu-Tyr-Val-Xaa-Ala-Gln-Gly-Leu-Tyr-Trp-Ser) 
SYNONYMS: Autotaxin fragment (human) 



9/3/3 (Item 3 from file: 398) 

DIALOG ( R) Fi le 398 : CHEMSE ARCH (TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 150979-8S-5 
MOLECULAR FORMULA: Unknown 
CA NAME (S) : 

HP=Peptide (9CI) 

SB= (Pro-Glu-Glu-Val-Thr-Xaa-Pro-Asn-Tyr-Leu) 
SYNONYMS: Autotaxin fragment (human) 



9/3/4 (Item 4 from file: 398) 

DIALOG (R) Fi le 398 : CHEMSEARCH (TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 150979-01-4 
MOLECULAR FORMULA s Unknown 
CA NAME (S) : 

HP=Peptide (9CI) 

SB= (Pro-Xaa-Leu-Asp-Val-Tyr-Lys) 
SYNONYMS: Autotaxin fragment (human) 



9/3/5 (Item 5 from file: 398) 

DIALOG ( R> Fi 1 e 398 : CHEMSEARCH (TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 150£3fc-7£-9 
MOLECULAR FORMULA: C74H99N13019 
RING SYSTEM DATA: 

(03) (nr=01; sr=5; ar=C4N. 01; fr=NC4. 01 ; ir=lfc-13fc-l) 

(04) (nr=01; sr=& 5 ar= fr=C6.01j ir=46-150-18) 
CA NAME (S) : 

HP=L~Proline (9CI) 

SB=L-t yrosy 1 g 1 ycy 1-L-ph eny 1 alany 1 -L-l eucy 1-L-pheny lalan y 1-L-pr o 1 y 1 -L 
-pro 1 y 1 -L— t yr o sy 1 -L-l eucy 1-L— s ery 1 -L— sery 1— L-sery 1 — 
SYNONYMS: Autotaxin fragment (human) 



9/3/6 (Item & from file: 398) 

D I ALOG < R ) F i 1 e 39 8 : CHEMSEARCH ( TM ) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147977-73-9 
MOLECULAR FORMULA: C54H86N14015 
RING SYSTEM DATA: 



(131) (nr=01; sr=5; ar=C4N.01; fr=NC4. 01; ir=:16-136-l ) 

(01) (nr=0£; sr=5, 6 ; ar=C4N. 01-C&. 01 ; fr=NC4. 01-C&. 01 ; ir =333-151-57) 
CO NOME (S) s 

HP=L-Lysine (9CI) 

SB=N£-(N-(N- (N- (N-(N-(N-(1-(N£~ <N-g 1 ycy 1 g 1 ycy 1 ) -L-g 1 ut am iny 1 ) -L-pro 1 
y 1 ) -L-l eucy 1 ) -L-try pt ophy 1 ) -L-i so 1 eucy 1 ) -L-threony 1 ) -L-alany 1 ) -L- 
threony 1 ) - 
SYNONYMS: Autotaxin fragment (human) 



9/3/7 (Item 7 from files 398) 

DIALOG ( R) Fi le 398 : CHEMSEARCH ( TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER : 147960-63-£ 
MOLECULAR FORMULA : C54H8£N1£O£0 
RING SYSTEM DATA: 

(0£) (nr=01? sr=5; ar=C4N. 01 ; fr=NC4.01; ir=16-136-l> 
(01) (nr=01; sr=&; ar= fr=C6.01: ir=46-150-18) 
CA NAME (S) : 

HP=L-Asparagine (9CI) 

SB=N£- ( N- ( N- ( 1 - ( N- ( N- ( N- ( N- ( 1 - ( N-L-s er y 1 -L-t y r o s y 1 ) -L-pro 1 y 1 ) -L-. alp 
ha. -g lutamy 1 ) -L-i so 1 eucy 1 ) -L-l eucy 1 ) -L-threony 1 ) -L-pro ly 1 ) -L-alan 
y 1 ) -L-. al pha. -aspart y 1 ) - 
SYNONYMS: Autotaxin fragment (human) 



9/3/8 (Item 8 from file: 398) 

DIALOG ( R) Fi 1 e 398 : CHEMSEARCH ( TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-62-1 
MOLECULAR FORMULA: C63H96N16014 
RING SYSTEM DATA: 

(01) (nr=01; sr=5; ar=C4N. 01 ; fr=NC4. 01 ; ir=16-136-l) 
(01) (nr=01; sr=5; ar=C3N£. 01 5 fr=NCNC£. 01 ; ir=l&-195-24) 
(0£) (nr=01; sr=6; ar= fr=C6.01; ir=46-150-18) 
CA NAME (S ) : 

HP=L-Tyrosine (9CI) 

SB=N- ( N- ( N- ( N- ( 1 - ( N£- ( N- ( N- (N- ( N-L-h i st i d y 1 -L- 1 e ucy 1 ) -L- 1 e ucy 1 ) -L-t y 
rosyDglycyl) -L-arg iny 1 ) -L-pro 1 y 1 ) -L-alany 1 ) -L-val y 1 ) -L-l eucy 1 ) - 
SYNONYMS: Autotaxin fragment (human) 



9/3/9 (Item 9 from file: 398) 

D I ALOG ( R) Fi 1 e 398 : CHEMSEARCH ( TM ) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-61-0 
MOLECULAR FORMULA: C26H40N6O9 
RING SYSTEM DATA: 

(01) (nr=01; sr=6; ar= fr=C6.01; ir=46-150-18) 
CA NAME (S) : 

HP=L-Threonine (9CI) 

SB=N- (N- (N£- <N-L-tyrosy 1 -L-l eucy 1 ) -L-asparaginy 1 ) -L-alany 1 ) - 
SYNONYMS: Autotaxin fragment (human) 



9/3/10 (Item 10 from file: 398) 

D I ALOG ( R ) F i 1 e 398 : CHEMSEARCH (TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-60-9 
MOLECULAR FORMULA s C33H46N608 
RING SYSTEM DATA: 

(0£) (nr=01: sr=6s ar= fr=C6.01: ir=46-150-18) 



CA NAME (S) : 

HP=L-Phenylalanine (9CI) 

SB=N- (N- <N£- (N-L~valyl-L-leucyl ) -L-asparaginy 1 ) -L-tyrosy 1 ) - 
SYNONYMS s Autotaxin fragment (human) 



9/3/11 (Item 11 from file: 398) 

D I ALOG ( R ) F i 1 e 398: CHEMSE ARCH ( TM ) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER s 147960-59-6 
MOLECULAR FORMULA : C48H67N13016 
RING SYSTEM DATA: 

(01 ) <nr=(31 ; sr=5; ar=C3N£. 01 ; fr=NCNC£. 01 ; ir=l&-195-£4) 

(0£) (nr=01; sr=6 ; ' ar= fr=C6.0l5 ir=46-150-18) 
CA NAME (S) s 

HP=L-Serine (9CI) 

SB=N{- (N- (N- <N- (N£- (N- (N- (N-L-gl utam iny 1-L-t yrosy 1 ) -L-leucy 1 ) -L-hist i 
dy 1 ) -L-g 1 utam i ny 1 ) -L-t yrosyl) glycyl) -L-sery 1 ) - 
SYNONYMS: Autotaxin fragment (human) 



9/3/l£ (Item 12 from file: 398) 

DIALOG ( R) Fi 1 e 398 : CHEMSEARCH (TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-58-5 
MOLECULAR FORMULA s C 1 1 3H 1 80NS8O38 
RING SYSTEM DATA: 

(01) (nr=0l5 sr=5; ar=C4N. 01; •Cfr=NC4.01; ir=16-136--l > 
(0£) (nr=01; sr=6; ar= fr=C6.01; ir=--46-150-18) 
CA NAME (S) : 

HP=L-Arginine (9CI) 

SB=L-threony 1-L-. alpha, -glut amy 1-L-pheny lal any 1 -L-leucy 1 --L-sery l-L-a 
sparag iny 1-L-t yrosy 1-L-l eucy 1-L-t hreony 1-L-asparag iny 1-L-val y 1-L- 
. alpha. -as party 1-L-. alpha. -as part y 1-L-i so 1 eucy 1-L-t hreony l-L-l euc 
y l-L-valy 1-L-proly Iglycy 1-L-t hreony 1-L-l eucy lglycyl- 
SYNONYMS: Autotaxin fragment (human) 



9/3/13 (Item 13 from file: 398) 

DIALOG ( R) Fi 1 e 398 : CHEMSEARCH (TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-57-4 
MOLECULAR FORMULA: C68H101N17O£1 
RING SYSTEM DATA: 

(01) (nr=01; sr=5; ar=C3N£. 01 ; f r=NCNC£. 01 ; ir=16-195-£4) 
(0£) (nr=01; sr=6; ar= f r=C6. 01 ; ir=46-150-18) 
CA NAME (S) : 

HP=L-Arginine (9CI) 

SB=N£- (N- <N- (N- (N— (N- (N- (N- (N- (N- (N-L-. al pha. -as part y 1-L-i so 1 eucy 1 ) - 
L-. al pha. -g 1 utamy 1 ) -L-hist idyl) -L-l eucy 1 ) -L-t hreony 1 ) -L-sery 1 ) -L- 
leucy 1 ) -L-. alpha. -as party 1 ) -L-phen y lal any 1 ) -L-pheny 1 al any 1 )- 
SYNONYMS: Autotaxin fragment (human) 



9/3/14 (Item 14 from file: 398) 

D I ALOG ( R ) F i 1 e 398: CHEMSEARCH ( TM ) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-56-3 

MOLECULAR FORMULA : C8£H1£3N190£3S 

RING SYSTEM DATA: 

(01) (nr=01; sr=5; ar=C4N. 01 5 f r=NC4. 01 ; ir=16-136-l) 
(03) (nr=01; sr=6 ; ar= fr=C6.01; ir=46-150-18) 



CA NAME(S) : 

HP=L-Lysine (9CI) 

SB=L-valy 1 -L-asparag in yl-L-sery 1— L-meth i on y 1-L-g 1 utam i ny 1-L-threony 1 
-L-val y 1-L-pheny lalany l-L-valy 1 g lycy 1-L-t yrosy 1 g 1 ycy l-L-pro ly 1-L- 
threony 1-L-pheny 1 al any 1- 
SYNONYMSs Autotaxin fragment (human) 



9/3/15 (Item 15 from files 398) 

DIALOG ( R) Fi 1 e 398 : CHEMSEARCH ( TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-55-2 
MOLECULAR FORMULA : C56H80N1£O17 
RING SYSTEM DATA: 

(0£) (nr=01? sr=5; ar=C4N. 01 ; fr=NC4. 01 s. ir=16-136-l) 
(0£) (nr=01; sr=6; ar= fr=C6.01; ir=46-150-18) 
CA NAME (S) : 

HP=L~Tyrosine (9CI) 

SB=N- (-CN- ( N£- < N- ( N£- ( N- ( N- < 1 - ( 1 -L-s er y 1 -L-pr o 1 y 1 ) -L-pr o 1 y 1 ) -L-ph en y 1 a 
lany 1 ) -L~. al pha. -g 1 utamy 1 ) -L-asparag iny 1 ) -L-iso 1 eucy 1 ) -L-asparag i 
ny 1 > — L— 1 eucy 1 ) - 
SYNONYMS: Autotaxin fragment (human) 



9/3/lfc (Item 16 from file: 398) 

D I ALOG ( R ) F i 1 e 398: CHEMSEARCH ( TM ) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-54-1 
MOLECULAR FORMULA: C53H73N11017 
RING SYSTEM DATA: 

(01) (nr=01; sr=5; ar=C4N. 01 ; fr=NC4. 01; ir=16-136-l) 
(01) (nr=01; sr=6; ar= fr=C6.01; ir=46-150-18) 

(01) (nr=0£; sr=5, 6 ; ar=C4N. 01-C6. 01 ; fr=NC4. 01-C6. 01 ; ir=333-151-57) 
CA NAME (S) : 

HP=L-Isoleucine (9CI) 

SB=N- ( N- ( N— ( N£— ( N— ( i-(N- (N-L-t yrosy 1-L-. alpha, -as party 1 ) -L-val yl ) -L- 
prol y 1 ) -L-tryptophy 1 ) -L-asparag iny 1 ) -L-. alpha. -g 1 utamy 1 ) -L-threon 
yl)- 

SYNONYMSs Autotaxin fragment (human) 



9/3/17 (Item 17 from files 398) 

D I ALOG ( R ) F i 1 e 398: CHEMSEARCH ( TM ) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 
•C 

CAS REGISTRY NUMBER: 147960-53-0 
MOLECULAR FORMULA: CE1H36N6O10 
CA NAME (S) : 

HP=L-Serine (9CI) 

SB=N- (N- (N- <N-L-gl utam iny 1-L-alany 1 ) -L-. al pha. -gl utamy 1 ) -L-val yl ) - 
SYNONYMS: Autotaxin fragment (human) 



9/3/18 (Item 18 from file: 398) 

DIALOG ( R) Fi 1 e 398 : CHEMSEARCH ( TM) 
(c) 1994 Amer. Chem. Soc. All rts. reserv. 

CAS REGISTRY NUMBER: 147960-5£-9 

MOLECULAR FORMULA: C3£H44N607 

RING SYSTEM DATA: 

(01) (nr=01? sr=5; ar=C4N.01; fr=NC4. 01 ; ir=16-136-l) 
(0£) (nr=01; sr=6; ar= fr=C6.01; ir=46-150-18) 

CA NAME (S) : 

HP=L~Lvsine (9CI) 



SB=N£-<N-UN- U-L-tyrosy 1-L-proly 1 ) -L-alany 1 ) -L-pheny lalany 1 ) - 
SYNONYMS : Putotaxin fragment (human) 



9/3/19 (Item 19 from files 398) 

D I ALOG < R ) F i 1 e 398: CHEMSE A RCH ( TM ) 
(c) 1994 Amer. Chem. Soc. fill rts* reserv. 

CfiS REGISTRY NUMBERS 147960-51-8 
MOLECULftR FORMULAS C32H44N10O8 
RING SYSTEM DATA: 

(01) (nr«81; sr=5; ar=C3N£. 01 $ f r=NCNC2. 01 5 ir=16~195~24) 
(01) (nr=02; sr=5, 6 ; ar=C4N. 01-C6. 01 ? f r=NC4. 01-C6. 01 5 ir=333-151-57) 
CA NAME (S) s 

HP~L--Asparag ine (9CI ) 

SB«NS-(N- (N-(N-(N-L~tryptophyl-L~hist idyl) ~L-valyl )-L-al any 1 ) -L-a lan 
yl)~ 

SYNONYMS s Autotaxin fragment (human) 



9/3/20 (Item 1 from files 5) 

DIALOG € R) Fi 1 e SsBIOSIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv- 

11147913 BIOSIS Numbers 97347913 

Partial cDNA cloning of the autocrine cell motility factor, autotaxin 
(ATX) 

yplurata J? Arestad A; Licotta L A; Stracke M L 
Natl. Inst. Health, Natl. Cancer Inst., Bethesda, MD £0892, USA 
FASEB Journal 8 (7). 1994. A1445. 

Full Journal Titles 85th Annual Meeting of the American Society for 
Biochemistry and Molecular Biology, Washington, D. C. , USA, May 21~~25, 1994. 
FASEB Journal 

ISSNs 0892-6638 

Language s ENGLISH 



9/3/21 (Item 2 from files 5) 

DIALOG ( R) Fi 1 e StBIOSIS PREVIEWS(R) 
(c) 1994 BIOSIS- All rts. reserv. 

11147796 BIOSIS Numbers 97347796 

Gl ycosy lat i on of the tumor cell motility factor, autotaxin, is not 
required for activity 

Stracke M L; Levine M; Arestad A; Liotta L A 

Natl. Inst. Health, Natl. Cancer Inst., Bethesda, MD 20892, USA 
FASEB Journal 8 (7). 1994. A1424. 

Full Journal Titles 85th Annual Meeting of the American Society for 
Biochemistry and Molecular Biology, Washington, D. C. , USA, May 21-25, 1994- 
FASEB Journal 

ISSNs 0892-6638 

Language s ENGLISH 




9/3/22 (Item 3 from files 5) 

DIALOG ( R) F i 1 e SsBIOSIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

11053808 BIOSIS Numbers 97253808 

Characterization of a new scatter factor (CL4-SA) produced by rat mammary 
tumor cells 

Kopdag H$ Hoelzel F; Scherdin U 

Inst. Physiol. Chem. , Univ. Hosp. , Hamburg-Eppendorf , GER < 
Journal of Cancer Research and Clinical Oncology 120 (SUPPL. ) . 1994. 
Rl 14. 

Full Journal Titles 21st National Cancev*" Connress of the German Cancer 



Society, Hamburg, Germany, March 7-11, 1994. Journal of Cancer Research 
and Clinical Oncology 

ISSN : 0171-5216 

Language: ENGLISH 

Document Type: CONFERENCE PROCEEDINGS 



9/3/£3 (Item 4 from file: 5) 

DIALOG ( R) Fi 1 e 5:BI0SIS PREVIEWS(R) 
<c> 1994 BIOSIS. fill rts. reserv. 

10406739 BIOSIS Number: 96006739 

GRANULOCYTE-MACROPHAGE COLONY-STIMULATING FACTOR INDUCES HUMAN 
MELANOMA-CELL MIGRATION 
yKOHN E C; HOLLISTER G H; DIPERSIO J' D; WAHL S; LIOTTA L A; SCHIFFMANN E 
v / BLDG. 10, ROOM 2A33, NATL. CANCER INST. , BETHESDA, MD £0892, USA. 
\y INT J CANCER 53 (6). 1993. 968-972. CODEN: IJCMA 
Full Journal Title: International Journal of Cancer 
Languages ENGLISH 



9/3/24 (Item 5 from files 5) 

DIALOG(R) File 5:BI0SIS PREVIEWS(R) 
(c) 1994 BIOSIS. All rts. reserv. 

■i 

9093687 BIOSIS Number: 93078687 

IDENTIFICATION PURIFICATION AND PARTIAL SEQUENCE ANALYSIS OF AUTOTAXIN A 
NOVEL MOT I L I TY-ST I MULAT I NG PROTEIN 

/STRACKE M L; KRUTZSCH H C; UNSWORTH E J 5 ARESTAD A 5 CIOCE V; SCHIFFMANN E 
; /LIOTTA L A 

/ LAB. PATHOL., NATIONAL CANCER INST., NATIONAL INST. HEALTH, BUILDING 10, 
\yROOM 2A33, BETHESDA, MD. 20892. 

J BIOL CHEM £67 (4). 1992. 2524-2529. CODEN: JBCHA 
Full Journal Title: Journal of Biological Chemistry 
Language: ENGLISH 



9/3/25 



(Item 1 from file: 



DIALOG ( R) Fi 1 e 1 55 : MEDLI NE ( R) 

(c) format only 1994 Dialog Info.Svcs. All rts. reserv. 

08903820 94218820 

The role of autotaxin and other motility stimulating factors in the 
regulation of tumor cell motility. 
S^racke M; Liotta LA; Schiffmann E 

Jational Cancer Institute, National Institutes 



^Laboratory of Pathology, N; 
J*T Health, Bethesda, MD £0892. 



V — ^ Cu 



Symp Soc Exp Biol (ENGLAND) 
Journal Code: VGF 
Languages: ENGLISH 
Document types JOURNAL ARTICLE 
?e au=-£ttsfeeke^em 
> > >Unrecognizable Command 
?e au=stracke, m 



1993, 47 pl97-214, ISSN 0081-1386 



Ref Items Index-term 

El 3 AU=STRACKE, K. J. 

E2 1 AU=STRACKE, K. H. 

E3 0 *AU=STRACKE, M 

E4 7 AU=STRACKE, M. 

E5 1 AU=STRACKE, MARICA 

E6 9 AU=STRACKE, MARKUS 

E7 £ AU=STRACKE, MARY 

E8 11 AU=STRACKE, MARY L. 

E9 £ AU=STRACKE, MICHAEL 

E10 5 AU=STRACKF„ R. 



Ell 

El£ 



1 ALMSTRACKE, ROBERT J. 

2 AU=STRACKE, ROLAND 





Ent er 


. p o r PAGE for more 


?{s 


s 




?s 


e4 or e7 




> > > One or more 


prefixes are unsupported 


> > > 


or undefined in one or more files. 






7 ALMSTRACKE, M. 






£ ALMSTRACKE, MARY 




sua 


9 AU="STRACKE, M. " OR AU="STRACKE, MARY" 








>>> Unrecognizable Command 


?ds 






Set 


Items 


Descr i pt ion 


SI 


46 


AUTOCRINE<W)MOTILITY<W)FACTOR?. OR AUTOTAX IN 


Sc! 


613 


ATX 


S3 


649 


SI OR S£ 


S4 


0 


ANTIBO. D? 


S5 


1718S£6 


ANT I BOD? 


S6 


73 


S5 AND S3 


S7 


47 


RD (unique items) 


sa 


£6 


RD SI (unique items) 


S9 


£5 


S8 NOT S7 


S10 


9 


AL)="STRACKE, M. " OR AU="STRACKE, MARY" 



?*d si© 

>>>Duplicate detection is not supported for File 398. 

>>>Records from unsupported files will be retained in the RD set* 
...completed examining records 



Sll 




5 


. RD 


SliZi 


( un ique 


sll not 


s8 


5 

CI 6 


Sll 
S8 






Sl£ 




4 


Sll 


NOT 


S8 


sl£ not 


s7 


4 

47 


Sl£ 
S7 






S13 




4 


Sl£ 


NOT 


S7 



<?t -Csl3/ 

> > > Unrecognisable Command 
?t 513/3/1-4 



13/3/1 (Item 1 from files 399) 

DIALQG(R)File 399sCA Search(R) 
(c) 1994 American Chemical Society. 



All rts. 



re serv. 



JOURNAL 

motility stimulating factors 



in the 



IS 10796 13 CA: 121 (7) 79613c 
The role of autotaxin and otl 

regulation of tumor cell moti, 

AUTHOR(S)s Stracke, Mar^pT^J^t t v a, Lance A. % Schiffmann, Elliott 
LOCATION: Lab. Patho 1 . , Vlajjf 1 „ Cancer Inst., Bethesda, MD, £0892, USA 
JOURNAL: Symp. Soc. Exp. WoL DATE s 1993 VOLUME s 47 NUMBER : CELL 

BEHAVIOUR PAGES s 197-214 CODEN s SSEBA9 ISSNs 0081-1386 LANGUAGE: 

Engl i sh 



13/3/2 (Item 2 from files 399) 

DIALOG (R) Fi le 399sCA Search(R) 

(c) 1994 American Chemical Society. All rts. reserv. 

110190136 CAs 110(21) 1901 36 j JOURNAL 

Biochemical mechanisms of tumor invasion and metastases 

AUTHOR(S)s Liotta, L. A.: Wewer, U. ; Rao, N. C. : Schiffmann, E. s Str 



M. s /Guirguis, R- 5 Thorge irsson, U„ 5 Muschel, R« 5 Sobel, M. 

kftcATIQNs Lab, Pathol., NIH, Bethesda, MD, £0892, USA 
/JOURNAL: fldv. Exp. Med, Biol. DATE: 1988 VOLUME: £33 NUMBER s Cancer 
y-Tletastasiss Biol. Biochem. Mech. Clin. Inspects PAGES: 161-9 CODENs 
AEMBAP ISSNs 00&5~£:598 LANGUAGES English 



13/3/3 (Item 1 from files 6) 

DIALOG < R) Fi 1 e bsNTIS 

Comp. & distr. 1994 NTIS, US Dept of Commerce. All rts. reserv. 

1491515 NTIS Accession Numbers TIB/A90-815SE/XAB 

Belastungs- und Beulversuche an axial symmetri sch belasteten 
Rotat i onsschal en aus Metall im e last i sch-plast i schen Bereich zur 
Ueberpruef ung n i cht 1 i n ear er Rechenprogramme. (Load and buckling tests and 
axisymmetrical ly loaded metal shells-of revolution in the elastic-plastic 
range for the verification of nor^frfear computer programs) 

StracWe, M. ; Duesing, H. s^Rrysik, R. 5 Schmidt, H. 

Gesamt hochschul e Essen (Germany, F. R. ) . Fachbereich 10 - Bauwesen. 

Corp. Source Codes: 04^46012; 9900856 

Sep 86 £93p y< 

Languages s German ^ 

Journal Announcements GRAI9101 

In German. For schungs ber icht e aus dem Fachbereich Bauwesen, Universitaet 
- Gesamt hochschul e Essen, no. 38. 
NTIS Prices: PC E07 



13/3/4 (Item £ from files 6) 

DIALOG ( R) Fi 1 e 6:NTIS 

Comp. & distr. 1994 NTIS, US Dept of Commerce. All rts. reserv. 

1308743 NTIS Accession Numbers TIB/A88-80048/XAB 

Beulversuche an laengsnahtgeschweissten staehl ernerju^Kre iszylinderschalen 
unter Aussendruck im e last i sch-plast i schen Bere-r'ch. (Buckling tests on 
longitudinally welded cylindrical steel she 1 1 sounder external pressure in 
the e last o-plast ic range) 

Stracke, M. ; Schmidt, H. 

Gesamt hochschul e Essen (Germany, F. R^x) „ Fa-Cchbere ich 10 - Bauwesen. 
Corp. Source Codes: 0491460HE: 
Dec 84 10p. 

Languages: German y< 
Journal Announcements GRA£#B10 

In German, Forschungsb^e^icht e aus dem Fachbereich Bauwesen, Universitaet 
Essen, Gesamthochschule, " no. 28. 

NTIS Prices s PC E07 
?e ainet} tbptta, 1 
> > > Unrecogn i zabl e Command 
?e au^iot t a, 1 



Ref 


Items 


Index-terra 


El 


0 


*AU. =LIOTTA, L 


E2 


1 


AU=A CRIEPI, N. S. 


E3 


1 


AU=A. B. SIDDIQUE 


E4 


1 


AU=A. FENG 


E5 


1 


AU=A. WULF, WM. 


E6 


1 


AU=A-DAVIDSON, REVATHI 


E7 


3 


AU=A-GRI VAS, D. 


E8 


1 


AU=A-GRIYAS, DIMITRI 


E9 


1 


fiU=fi-H TflYEL, M. 


ElfZi 


1 


AU=A-KWIE JAP TJOEN SAN, 


Ell 


3 


AU=A-MONEIM, M. T. 


El£ 


1 


AU=A, SHAIFER 



Enter P or PAGE for more 
?* au=liotta. 1 



Ref 


Items 


I nd e x-t erm 






El 


1 


AU=LIOTTA, 


JUDITH 


J. 


ES 


1 


AU=LIOTTA, 


K. 




E3 


0 


*AU=LIOTTA, 


L 




E4 


1£ 


AU=LIOTTA, 


L. 




E5 


7£ 


AU=LIOTTA, 


L. A. 




E6 


1 


AU=LIOTTA, 


L» J. 




E7 


8 


AU=LIOTTA, 


LANCE 




E8 


164 


AU=LIOTTA, 


LANCE 


A. 


E9 


1 


AU=LIOTTA, 


LANCE 


ALLEN 


El® 


8 


AU=LIOTTA, 


LOUIS 


J. 


Ell 


1 


AU=LIOTTA, 


LOUIS 


JAMES 


El£ 


1 


AU=L IOTTA, 


MARIO 





Enter P or PAGE for more 
?s e4 or e5 or eS o£re8 6t?re9 
>>>Term "O. R" in invalid position 
?s e4 or e5 or e7 or e8 or e9 
>>>One or more prefixes are unsupported 
>>> or undefined in one or more files, 
IS ALNLIOTTA, L. 
7£ AU-LIOTTA, L. A. 
8 AU=LIOTTA, LANCE 
164 AU™L IOTTA, LANCE ft. 

1 AU=LIOTTA, LANCE ALLEN 

514 £57 AU="LIOTTA, L. " OR AU="LIOTTA, L. A. » OR AU^'LIOTTA, 

LANCE" OR AU~"LIOTTA, LANCE A. " OR AU="LIOTTA, LANCE 
ALLEN" 

?rd 

>>>Duplicate detection is not supported for File 398. 

>>>Records from unsupported files will be retained in the RD set. 

-..examined 50 records (50) 

../examined 50 records (100) 

...examined 50 records (150) 

...examined 50 records (£00) 

...examined 50 records (£50) 

...completed examining records 

515 £47 RD (unique items) 

?ds 



Set 


Items 


Descript ion 


SI 


46 


AUTOCRINE(W)MOTILITY<W)FACTOR?. OR AUTOTAXIN OR ATX. -C 


S£ 


613 


ATX 


S3 


649 


SI OR S£ 


S4 


0 


ANT I BO. D? 


S5 


17188£& 


ANTIBOD? 


Sfe 


73 


S5 AND S3 


S7 


47 


RD (unique items) 


S8 


£6 


RD SI (unique items) 


S9 


£5 


S8 NOT S7 


SI® 


9 


AU="STRACKE, M. " OR AU="STRACKE, MARY" 


Sll 


5 


. RD SI® (unique items) 


Sl£ 


4 


Sll NOT S8 


S13 
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Identification, puri f icat i on, and partial sequence analysis of autotaxin, 
a novel mot i 1 i t y-st i inu lat i ng protein 

AUTHOR(S)s Stracke, Mary L. ; Krutzsch, Henry C. 5 Unsworth, Edward J. 5 
Arestad, Anders? Cioce, Vittoria; Schiffmann, Elliott; Liotta, Lance A. 

LOCATIONS Lab. Pathol., Natl. Cancer Inst., Bethesda, MD, 20892, USA 

JOURNAL: J. Biol. Chem. DATE s 1992 VOLUME s £67 NUMBER s 4 PAGESs 
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(c) 1994 American Chemical Society. All rt,s'« reserv, 

121079330 CA: iSi (7) 79330f JOURNAL 

High-affinity - al pha. -thrombin binding to platelet glycoprotein 
lb. alpha. : identification of two binding domains 

AUTHOR (B) : Gralnick, Harvey R. ; Wil/iams, Sybil; McKeown, Laurie P.; 
Hansmann, Kristin; Fe-Cnton, John W. ,/lI; Krutzsch, Henry 

LOCATION: Hematology Service, National Institutes Health, Bethesda, MD, 
£0892, USA 

JOURNALS Proc. Natl. Acad. Sci. U. S. A. DATE 5 1994 VOLUME 5 91 
NUMBERS 14 PAGES s 6334-3 CODENs PNASA6 ISSNs 00£7~84£4 LANGUAGE s 
Eng 1 i sh 



31/3/2 (Item £ from files 399) 

DIALOG (R) Fi le 399sCA Search(R) 

<c) 1994 American Chemical Society. All rts. reserv. 
1£0£6£913 CAs 120 (£1 ) £6£9 13s JOURNAL 

A sequence-specific, s ing 1 e-st rand /bind ing protein activates the far 
upstream element of c-myc and defined a new DNA-binding motif 

AUTH0R(S)s Duncan, Robert; Bazar/ Leonard ; Michelotti, Greg; Tomonaga, 
Takeshi? Krutzsch, Henry 5 Av i gan, /mark ; Levens, David 

LOCATIONS Lab. Pathol. , Natl. JEancer Inst., Bethesda, MD, 20892, USA 

JOURNAL: Genes Dev. DATE s 1^94 VOLUME s 8 NUMBER: 4 PAGES: 465-88 

CODENs GEDEEP ISSNs 0890-93^9 LANGUAGE: English 



31/3/3 (Item 3 from file: 399) / 

DIALOG (R) Fi le 399sCA Search(R) / 
(c) 1994 American Chemical Society. All/rts. reserv. 

1£0£08113 CAs l£0<17)£08113t JOURNAL 

Apo 1 i poprot e i n Es a potent inhibito/* of endothelial and tumor cell 
proliferation / 

AUTH0R(S)s Vogel, Tikva; Guo, Nenonua; Guy, Rachel; Drezlich, Nina; 
Krutzsch, Henry C. ; Blake, Diane A. 7 Panet, Amos; Roberts, David D. 

LOCATIONS Lab. Pathol., Natl. Ini. Health, Bethesda, MD, £089£, USA 

JOURNALS J. Cell. Biochem. DATE/ 1994 VOLUME s 54 NUMBER s 3 PAGESs 
£99-308 CODEN: JCEBD5 ISSNs 0730-£31£ LANGUAGE s English 



31/3/4 (Item 4 from files 399) / 

DIALOG ( R) Fi 1 e 399:CA Search(R) / 

(c) 1994 American Chemical Society./All rts. reserv. 
119£6£££8 CAs 119 <£5>£6£££0t/ JOURNAL 

Modulation of endothelial eel ^proliferation, adhesion, and motility by 
recombinant hepar in-bind ing domain and synthetic peptides from the type I 
repeats of thrombospond in / 

AUTHOR(S): Vogel, Tikva; Boo, Neng Hua; Krutzsch, Henry C. 5 Blake, Diane 
A.; Hartman, Jacob; Mende^itz, Simona; Panet, Amos; Roberts, David D. 

LOCATION: Lab. Pathol., Natl. Cancer Inst., Bethesda, MD, £089£, USA 

JOURNAL: J. Cell. Biochem. DATE: 1993 VOLUME: 53 NUMBER: 1 P-CAGESs 
74-84 CODEN: JCEBD5 ISSN: 0730-£31£ LANGUAGE s English 



31/3/5 (Item 5 from file: 399) 

DIALOG ( R) Fi 1 e 399:CA Search(R) 

(c) 1994 American Chemical Society. All rts. reserv. 
119198234 CAs 119(19) 198234w JOURNAL 

The purification and characterisation of an extremely thermostable 
B alpha. -amylase from the hy pert hermoph i 1 ic archaebact er i urn Pyrococcus 
f ur i osus 

AUTHOR(S): Laderman, Kenneth ft. ; Davis, Bradley ft^y Krutzsch, Henry C, ; 

Lewis, Marc S. ; Griko, Y. V. ; Privalov, Peter L. j^Anf insen, Christian B. 
LOCATION: Dep. Biol., Johns Hopkins Univ., B^Ttimore, MD, 21218, USA 
JOURNAL s J. Biol- Chem. DATE s 1993 VOLIME: 268 NUMBER: 32 PAGESs 

£4394-4(31 CODEN: JBCHA3 ISSN: 00£1-9£58^LANGUAGE : English 



31/3/6 (Item 6 from files 399X 

DIALOG ( R) Fi 1 e 399:CA Search (R) 

(c) 1994 American Chemical Society. All rts. reserv. 
119175324 CA: 1 1 9 ( 1 7 ) 1 75324m JOURNAL 

Specific binding of heterogeneous r ibonucLeroprot e in particle protein K to 
the human c-myc promoter, in vitro 

AUTHOR(S): Takimoto, Masato; Tomonagaf Takeshi ; Matunis, Michael? Avigan, 
Mark; Krutzsch, Henry; Dreyfuss, Gid^m; Levens, David 

LOCATION: Lab. Pathol., Natl. Cayfczr Inst., Bethesda, MD, 20892, USA 

JOURNALS J. Biol. Chem. DATEs/1993 VOLUME s 268 NUMBERS 24 PAGES: 
18249-58 CODEN: JBCHA3 ISSN: 0021-9258 LANGUAGE: English 



31/3/7 (Item 7 from file: 399) 

DIALOG ( R) Fi 1 e 399sCA Search (R) 

(c) 1994 American Chemical Society. All rts. reserv. 

119023356 CA: 119 (3) 23356k CONFERENCE PROCEEDING 

TIMP-2: identification and char act eri zatifwi^of a new member of the 

metal loproteinase inhibitor family y< 

AUTHOR(S): St et 1 er-St e v en son, William/^; Krutzsch, Henry C ; Liotta, 

Lance A. f 

LOCA-CTION: Lab. Pathol., Natl. Capper Inst., Bethesda, MD, 20892, USA 
JOURNAL: Matrix Metal loprot einas/es Inhib., Proc. Matrix Metal loproteinase 

Conf. EDITOR: Birkedal-Hansen, Kenning (Ed), DATE: 1992 PAGES: 299-306 
CODENs 58AQAX LANGUAGE: Eng^sh MEETING DATE: 890000 PUBLISHER: 

Fischer, Stuttgart, Germany 



31/3/8 (Item 8 from file: 399) 

DIALOG ( R) Fi 1 e 399:CA Search(R) 

(c) 1994 American Chemical Society. All rts^ reserv. 
119005737 CA: 119(l)5737f JOURI>tflL 

Inhibition of fibronectin binding a*Hj fibronect in-mediated cell adhesion 
to collagen by a peptide from the second type I repeat of thrombospondin 

AUTHOR(S): Sipes, John M. ; Guo,/Neng Hua; Negre, Eric; Voge-Cl, Tikva; 
Krutzsch, Henry C. ; Roberts, Davad D. 

LOCATION: Lab. Pathol., Natl/ Cancer Inst., Bethesda, MD, 20892, USA 

JOURNAL: J. Cell Biol. DATE: 1993 VOLUME s 121 NUMBER: 2 PAGES: 469-77 

CODEN: JCLBA3 ISSN: 0021 7^525 LANGUAGE: English 



31/3/9 (Item 9 from files 399) 

DIALOG ( R) Fi 1 e 399:CA Search(R) 

(c) 1994 American Chemical Society. All rts. reserv. 
1 18250600 CA : 1 18 (25) 250600h JOURNAL 
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AUTHOR (S): Rohan, Patricia J. ; Dav^ Paula; MosUaluk, Christopher A. ; 

Kearns, Mary; Krutzsch, Henry; Si&IJenlist, Ulrich; Kelly, Kathleen 

LOCATION: Lab. Pathol., Natl. £ancer Inst., Bethesda, MD, £0892, UBA 
JOURNALS Science (Washingt on/D. C. , 18B3-) DATE: 1993 VOLUME: £59 
NUMBER: 5102 PAGES: 1763-6/^ CODEN: SCIEAS ISSN: 0036-8(375 LANGUAGE: 

English / 



31/3/10 (Item 10 from file: 399) 

DIALOG ( R) Fi 1 e 399:CA Search(R) 

(c) 1994 American Chemical Society. All /is. reserv. 
118142727 CA: 118(15) 142727w ^JOURNAL 

N-isopropy 1 i odoacet am ide in the .reduction and alkylation of proteins: 
Use in m icro sequence analysis / 

AUTHOR(S)s Krutzsch, Henry / Inman, John K. 

LOCATION: Lab. Pathol., Natl/ Cancer Inst., Bethesda, MD, USA 
JOURNAL: Anal. Biochem. DAyTE s 1993 VOLUME: 209 NUMBER: 1 PAGES: 
109-16 CODEN: ANBCA2 ISSN :/ 0003-2697 LANGUAGE s English 



31/3/11 (Item 11 from file: 399) 

DIALOG (R) Fi le 399:CA Search(R) 

(c) 1994 American Chemical Society. A1V rts. reserv. 
118140848 CAs 118(15) 140848n / JOURNAL 

Cloning and characterization of at novel human cDNA that has DNA 
similarity to the conserved region of the collagenase gene family 

AUTHOR(S): Templeton, Nancy Sjjryth; Rodgers, Lisa A.; Levy, Anna T. ; Ting, 
Kai Li; Krutzsch, Henry C. ; Li/otta, Lance A. 5 St et 1 er-St evenson, William 

LOCATION: Lab. Pathol., Na# / L . Cancer Inst., Bethesda, MD, 20892, USA 

JOURNAL: Genomics DATE:/1992 VOLUME s 12 NUMBER: 1 PAGES: 175-6 

CODEN: GNMCEP ISSN: 08-^8-7543 LANGUAGE: English 



31/3/12 (Item 12 from file: 399) 

DIALOG ( R) Fi 1 e 399:CA Search(R) ^ 
(c) 1994 American Chemical Society. AL3r rts. reserv. 

117209613 CA: 1 17 (21 ) 2096 13f X JOURNAL 

A monomeric von Willebrand fact/or fragment, Leu-504-Ser-728, inhibits von 

Willebrand factor interaction w^th glycoprotein Ib-IX 

AUTHOR(S): Gralnick, Harvey A ; Williams, Sybil; McKeown, Laurie; Kramer, 

Wendy; Krutzsch, Henry; Gore^Ki, Marian; Pinet, Amos; Garfinkel, Leonard I. 
LOCATION: Hematol. Serv../Natl. Inst. Health, Bethesda, MD, 20892, USA 
JOURNAL: Proc. Natl. Ac^d. Sci. U. S. A. DATE: 1992 VOLUME: 89 
NUMBER: 17 PAGES: 78#0-4 CODEN: PNASA6 ISSN: 0027-8424 LANGUAGE: 

Eng 1 i sh 



31/3/13 (Item 13 from file: 399) 

DIALOG ( R) Fi 1 e 399:CA Search(R) 

(c) 1994 American Chemical Society. All rts. reserv. 
117189539 CA: 1 17 ( 19 ) 189539f JOURJtfAL 

Hepar in-bind ing peptides from the type^I repeats of thrombospond in. 
Structural requirements for heparin bidding and promotion of melanoma cell 
adhesion and chemotaxis / 

AUTHOR(S): Guo, Neng Hua; Krutzspff, Henry C. ; Negre, Eric; Zabrenetzky, 
Vivian S. ; Roberts, David D. / 

LOCATION: Lab. Pathol., Natly^Cancer Inst., Bethesda, MD, 20892, USA 

JOURNAL: J. Biol. Chem. DATE: 1992 VOLUME: 267 NUMBER: 27 PAGES: 
19349-55 CODEN: JBCHA3 ISSN: 0021-9258 LANGUAGE: English 
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DIALOG ( R) Fi 1 e 399:CA Search(R) 

(c) 1994 American Chemical Society. All rts. reserv. 



11710665£ CA: 117(11) 10665£p JOURNAL 

Interactions of a lam in in-bind ing peptide from a 33~kDa protein related 
to the 67-kDa laminin receptor with laminin ap.^ melanoma cells are 
heparin -de pendent ^-^^^ 

AUTHOR(S): Guo, Neng Hua; Krut z sch, .--Henry C. ; Vogel, Tikva; Roberts, 
David D„ 

LOCATION: Lab. Pathol., Natl./Cancer Inst., Bethesda, MD, £089£, USA 
JOURNAL: J. Biol. Chem. DATE: 199£ VOLUME s £67 NUMBER: £5 PAGES: 
17743-7 CODEN: JBCHA3 ISSNK (ZHZi£l-9£58 LANGUAGE: English 



31/3/15 (Item 15 from file: 399) 

DIALOG (R) Fi le 399:CA Search(R) 

(c) 1994 American Chemical Society- All rts. reserv. 
117<Zi85B7£ CAs 117(9)8587£t JOURNAL 

A prot eo 1 yt ical 1 y sensitive region common to several rat liver 
cytochromes P450: effect of cleavage on substrate binding 

AUTHOR(S): Tsokos, Dimitris C. ; Oma#a, Yoshiaki; Robinson, Richard C. ; 
Krutzsch, Henry C. ; Gelboin, Harry^V. ; Friedman, Fred K. 

LOCATION: Lab. Mol. Carcinog. , Nat 1. Cancer Inst., Bethesda, MD, £0B9£, 

usa y 

JOURNAL: Biochemistry DATE s 199£ VOLUME: 31 NUMBER s 31 PAGES s 7155-9 
CODEN: BICHAW ISSN: 0006-£960 LANGUAGE: English 
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(c) 1994 American Chemical Society. All rts. reserv. 



11706378£ CA: 117(7)6378£m PATENT 
Human megakaryocyte colony-stimulating factor 
methods 

INVENTOR (AUTHOR) : Murphy, Martin J.; Parch 
Er ickson-Mi 1 1 er , Connie L. ; Dai, Wei? Zha 
Krutzsch, Henry 

LOCATION: USA 

ASSIGNEE: Hippie Cancer Research/Center 
PATENT: PCT International : W(K9£00319 Al 
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116£3£948 CA: 116 (£3)£3£948g /JOURNAL 
Heparin- and sulf at ide-binding peptides from the type 

thrombospondin promote melanoma c^ell adhesion 

AUTHOR(S): Guo, Neng Hua; Krutzsch, Hentry C. ; Negre, 

Blake, Diane A. ; Roberts, DavLa D. 

LOCATION: Lab. Pathol., Nat4. Cancer Inst., Bethesda, 
JOURNAL: Proc. Natl. Acad/ Sci. U. S. A. DATE: 1992 
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Eric; Vogel, Tikva; 
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USA 
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116101599 CAs 116(11) 101599w JOURNAL 

Hi gher- order complex formation b^W?een the 7£~k i 1 odal t on type IV 
collagenase and tissue inhibitory^of metal loproteinases--£ 

AUTHOR(S): Kleiner, David Eyf Jr.; Unsworth, Edward J.; Krutzsch, Henr 
C ; St et 1 er~St e venson, William G. 

LOCATION: Lab. Pathol., ,Natl. Cancer Inst., Bethesda, MD, 2089c:, USA 

JOURNAL : Biochemistry DATE s 199S VOLUME : 31 NUMBER: 6 PAGES: 1665- 

CODEN: BICHAW ISSN: 0006-2960 LANGUAGE: English 



31/3/19 (Item 19 from file: 399) 

DIALOG ( R) Fi 1 e 399:CA Search(R) y< 
(c) 1994 American Chemical Society/ All rts. reserv. 

115££7417 CA: 1 15 (£1 ) ££74^7h JOURNAL 

Reducing chemical back grouped noise in automated protein sequencers 
AUTHOR(S): Fransworth, IJ^ce; Carson, Wulf; Krutzsch, Henry 
LOCATION: Porton InstnA. , Tarzana, CA, 91356, USA 

JOURNAL: Pept. Res.yDATE: 1991 VOLUME s 4 NUMBER: 4 PAGES: £45-51 
CODEN: PEREEO ISSptt 1040-5704 LANGUAGE: English 
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(c> 1994 American Chemical Sociei 
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ABSTRACT: 

Indirect method of detecting elastase-modif ied or cleaved, antithrombin 
(•*"*ATx****) in the presence of intact antithrombin (AT-III). The inventive 
method includes a modified ELISA using ax detergent to alter the intact 
AT-III. Cleaved AT-III is generated in human plasma, then an ELISA is 
performed in the presence of a detergent- 
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compositions and methods for inhibiting gene expression are 
The compounds comprise oligonucleotide sequences of from about 
£00 bases having a diol at either or both termini. Preferred 

polyalky leneglycols, preferably polyethy leneglycols. 

ical compositions comprising the compounds and a 

cally acceptable carrier and methods of inhibiting gene 
in mammals comprising administering such compounds are also 

Methods for inhibiting nuclease cleavage of compounds are also 
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ABSTRACT: 






A gasket for 


sealing a channel of a mounting cup to a container 


bead 



including at least one protrusion at an end of the gasket for retaining 
the mounting cup in position on a container bead prior to clinching. Six 
such protrusions can be equidistant ly positioned about the gasket- A 
gasketed mounting cup including such protrusions is also disclosed. A 
method for manufacturing such a gasketed mounting cup is disclosed as 
we 1 1 . 
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ABSTRACT: 

A composition useful to produce a golf ball cover of a blend of 
ethy lene-methy 1 acrylate and an ionomer resin and a compatabi 1 izer. The 
composition produces a cover material having hardness and feel comparable 
to Balata, but having improved resilience, durability and cut resistance 
over Balata. 
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ABSTRACT: 

Unique digital codes are encoded on a video signal, the codes are 
retrieved at receivers and precise information concerning the time of 
occurrence, length, nature and quality of a monitored broadcast at a 
frame by frame level, is generated- The codes are inserted on scan lines 
of the video, and vary either on a field-to-field or frame-to-frame 
basis. The code has a repeating first part having a unique program 
material identifier indicating the time, date and place of encoding, and 
has a second portion that varies in a predetermined non-repeating 
sequence which varies along the entire length of the tape, thereby 
uniquely identifying each frame of the video program material. Also 
encoded upon successive frames is a cyclic counter code with a count 
corresponding to the sequence of the identifier data on successive 
frames. When the video signal is processed by a receiver, the first 
portion identifier data from the various frames is mapped into selected 
memory locations in accordance with the count of the frame as determined 
by the second portion. Odd and even fields are encoded with complementary 
bit sequences to assist in processing the encoded data* Whenever the 
frame sequence is interrupted a data packet is generated representative 
of the condition encountered. The data packets are accumulated in log 
files in a memory in the receiver. The log files are transmitted to a 
data center, as is a copy of the encoded tape. Reports concerning the 
broadcast are generated. 
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ABSTRACT: 

A subscriber terminal apparatus for a television in an in-band 
subscription television system is provided. The subscriber terminal 
includes a receiver for receiving a television signal including video, 
audio, ana data information. A selector selects a channel of the 
television signal. A memory stores a plurality of barker screens 
providing messages regarding one or more channels of the television 
signal. An on-screen display control circuit controls the display of the 
barker screens on the television. A processor retrieving a barker screen 
from the memory supplies the retrieved screen to the on-screen display 
control circuit if the barker screen provides a message regarding a 
selected channel. Barker screen information may also be obtained from a 
dedicated data channel or a six megahertz video barker channel. 
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ABSTRACT: 

A pulse generator for generating pulses for modulation onto a carrier or 
subcarrier of a composite television signal is provided. The pulse 
generator includes a memory such as an EPROM for storing one or more 
waveshapes. Each stored waveshape is defined by a sequence of addressable 
values representing the amplitude of the waveshape as a function of time. 
A selecting circuit such as a microprocessor selects one of the 
waveshapes in the memory. A counting circuit responsive to a clock signal 
controls the address lines of the memory to read the amplitude values 1 
corresponding to the selected waveshape from the memory- The amplitude 
values are supplied to a digital to analog converter to convert the 
amplitude values to an analog pulse. The pulse may then be filtered to 
remove clock noise. The resultant signal is supplied to an amplitude 
modulator for modulating the signal onto a carrier or subcarrier of a 
composite television signal. 
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ABSTRACT s 

A subscriber terminal includes a receiver for receiving a television 
signal including video, audio, and data information. A channel of the 
television signal may be selected for display on the television. A memory 
stores channel identification information such as channel identifiers. 
The channel identifiers include at least one display character. A 
processor establishes a relationship between channel identification 
information and channel numbers associated with channels of the 
television signal. An on-screen display control circuit controls the 
display of character information on the television and has the capability 
of overlaying the channel number and the channel identification 
information on the video portion of a selected channel displayed on the 
television for a predetermined period of time. Program identification 
information such as program titles may also be displayed. Alternatively 
or additionally, the channel identification information may be displayed 
on a display such an LED display of a subscriber terminal. 
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ABSTRACT: 

A hydrophone analog signal data acquisition, A/D conversion and data 
transsion system includes a first-stage signal processing subsystem which 
provides digital representations of the hydrophone analog signal, which 
in turn are signal processed for transmission in the form of data packets 
stage signal processing subsystem (40) . Subsystem 40 includes 
of Data Multiplexer/FIFO units (48), including corresponding 
acting data unit accumulators, each accumulator having a 
inputs coupled to output channels of the first-stage signal 



by a second 
a pi ur a 1 i t y 
se 1 ect i ve I y 
plurality of 



processing subsystem for receiving digital representations of hydrophone 
analog signals. Each data unit accumulator includes a first buffer (48-2) 
for stoY^ing information that includes a digital representation of the 
analog hydrophone signal, an identification of a hydrophone that 
generated the acoustic information, and a time that the acoustic 
information is received from the hydrophone. Each data unit accumulator 
further includes an input interface that is operable during the first 
period for receiving a alert signal with a hydrophone analog signal, 
indicating that the associated source has data available. The input 
interface compares a current state of the alert signal to a previous 
state for detecting an occurrence of the assertion of the alert signal. 
The data unit accumulator also receives and stores a unit of data from a 
data source having an asserted alert signal, and is responsive to the 
storage of the unit of data therein, during the first period, to receive 
and store, during the second period, other information associated with 
the unit of data stored during the first period. 
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ABSTRACT: 

Coated abrasive material having a combination backing member. The 
combination backing member has as a bottom member a conventional backing 
member substrate used in the manufacture of coated abrasive material such 
as cylinder paper coated with a polymeric layer. The polymer layer 
provides a relatively smooth surface for application of the maker coat 
during manufacture of the coated abrasive material. The coated abrasive 
member can be used in diverse applications such as the fine finishing of 
particle board and offhand grinding of automobile body seams. 
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ABSTRACT: 






This invention 


relates to a compat i bi 1 i zed rubber composition and a 



process for compat ibi 1 i z ing dissimilar rubber blends comprising blending 
an ethylene/acry late/acryl ic acid terpolymer with £ or more different 
rubbers, selected from the group including, but not limited to, EPR, 
EPDM, CR, NBR, SBR and NR. 
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ABSTRACT: 

A method of controlling the allocation of a population of remote units 
among a plurality of groups of remote units is provided. Each remote unit 
has a digital identifier respectively associated therewith. A maximum and 
a minimum average number of remote units per group is fixed. The remote 
units are assigned to the groups of remote units in accordance with the 
respective digital identifiers. The average number of remote units per 
group is then determined as remote units are assigned thereto. Next, the 
average number of remote units per group is compared to the fixed maximum 
number of remote units per group. The above steps are repeated while the 
average number of remote units per group is less than or equal to the 
fixed maximum number of remote units per group. The number of groups is 
changed such that the average number of remote units per group is between 
the fixed maximum and minimum number of remote units per group if the 
average number of remote units per group exceeds the maximum number of 
remote units per group. 
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Cable television radio frequency subscriber data 
transmission apparatus and RF return method 
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Automatic frequency selection in a bi-directional cable 
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Signalling scheme for controlling data encryption device 
in an electronic fund transaction processing system 
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Assembly, especially for a beverage-vending machine, with 

a container for the storage, cooling and carbonating of 
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Method of reducing wearout in a non-volatile memory with 
double buffer 
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Interface circuit for interfacing between asynchronous 
data in start/stop format and synchronous data 
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L4: 36 of 48 



L4: 37 of 48 



4,535,084 CIMAGE AVAILABLE] L4 : 38 of 48 

Certain 4- (£-hydroxy ethy 1th i omet hy 1 ) pyr id ines and 

derivatives thereof having immunoregulatory activity 

4,500,535 CIMAGE AVAILABLE] L4: 39 of 48 

Method of regulating the immune response with pyridine 
der i vat i ves 



US PAT NO: 
TITLE: 

US PAT NO: 
TITLE: 



4,371,871 CIMAGE AVAILABLE] 

Alert message communication system 



L4: 40 of 48 



4,371,696 CIMAGE AVAILABLE] L4: 41 of 48 

Certain pyridine methylthio acetaldehyde derivatives and 
non-cvclic and cvclic acetals thereof 



US PAT NO: 
TITLE: 



4,354,241 CIMAGE AVAILABLE] L4 : 42 of 48 

Programmable electronic real-time load controller 

providing for adaptation of load control in response to 

varying environmental conditions 



US PAT NO: 
TITLE: 



4,246,263 C IMAGE AVAILABLE] L4 : 43 of 48 

Antiinflammatory and i mmunoregulatory pyrimidines, their 
method of use and pharmaceutical compositions 



US PAT NO i 
TITLE: 



4,191,953 CIMAGE AVAILABLE] 
Intrusion sensor and aerial therefor 



L4s 44 of 48 



US PAT NO: 
TITLE: 



4,138,718 C IMAGE AVAILABLE] L4 : 45 of 48 

Numerical control system with downloading capability 



US PAT NOs 
TITLE: 



3,939,148 C IMAGE AVAILABLE] L4 : 46 of 48 

Process for preparing 1 , 3, 5, 7-tetran itro-1 , 3, 5, 7— 
tetraazacyclo octane 



US PAT NO: 
TITLE: 



3,766,370 C IMAGE AVAILABLE] 



L4: 47 of 48 



ELEMENTARY FLOATING POINT CORDIC FUNCTION PROCESSOR AND 
SHIFTER 



US PAT NO: 
TITLE: 



3,632,883 C IMAGE AVAILABLE] L4 : 48 of 48 

TELECOMMUNICATION EXCHANGE WITH TIME DIVISION MULTIPLEX 



=> d 14 43 bib,ab 



US PAT NO: 


4,246,263 E IMAGE AVAILABLE] L4 : 43 of 


48 


DATE ISSUED: 


Jan. £0, 1981 




TITLE: 


Antiinflammatory and immunoregulatory pyrimidines, 


their 




method of use and pharmaceutical compositions 




INVENTOR: 


Joseph G. Lombardino, Groton, CT 






Charles A. Harbert, Groton, CT 




ASSIGNEE: 


Pfizer Inc., New York, NY (U.S. corp.) 




APPL-NO : 


06/085, 01 1 




DATE FILED: 


Oct. 15, 1979 




ART-UNIT: 


121 




PRIM-EX MR: 


Henry R. Jiles 




ASST-EXMR: 


Robert T. Bond 




LEGAL-REP : 


Connolly and Hutz 




US PAT NO: 


4,246,263 C IMAGE AVAILABLE] L4 : 43 of 


48 


ABSTRACT: 







series of 4~ <£l-hydroxy et hy Ithiomethy 1 ) pyridines and related compounds, 
and their pharmaceut ically acceptable acid addition salts, having 
antiinflammatory and immunoregulatory activity are disclosed. Preferred 
compounds include 4- <£~hydroxy ethy Ithiomethy 1 ) pyridine itself, as well as 
4™ <£~hydroxypheny Ithiomethy 1 ) pyridine, 4- (£-hydroxy-l™ 
propylthiomethyl) pyridine, 4- (3-hydroxy-£-but ylthiomethyl)pyridine, 
4- (£-hydroxyethy 1th i omethy 1 )pyrimidine, the acetate esters corresponding 
to the above compounds, and 4- (£, 3-~d ihydroxy-l-propylthiomethyl) pyridine. 

=> log y 

U.S. Patent & Trademark Office LOGOFF AT BB:49 :31 ON £1 SEP 94 



=> fil reg; s whvar | pldvyk | ypafk | qaevs | peevtrpnyl | ydvpwneti I 
vppfeniely | ggqplwitatk | vnsmqtvf vgygptfk | diehltsldffr I 
tef lsny ltnvddit / sqsp 
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L1 35 WHVAR | PLDVYK | YPAFK | QAEVS | PEEVTRPNYL | YDVPWNETI I VPPFENIELY I G 

GQPLWITATK | VNSMQTVFVGYGPTFK | DIEHLTSLDFFR | TEFLSNYLTNVDDIT/S 

=> fil ca; s 11 and (autotox? or auto tox? or atx# or amf# or 
autocrin?(lw) factor# or motility (lw) autocrin? and antibod?) /ab, bi 
FILE 'CA' ENTERED AT 14:13:21 ON 21 SEP 94 
USE IS SUBJECT TO THE TERMS OF YOUR CUSTOMER AGREEMENT 
COPYRIGHT (C) 1994 AMERICAN CHEMICAL SOCIETY (ACS) 

FILE COVERS 1967 - 17 Sep 1994 (940917/ED) VOL 121 ISS 12 

To help control. your online searching costs, consider using the 
HCA File when conducting SmartSELECT searches with large 
numbers of terms. 

THE BASIC INDEX NOW INCLUDES ABSTRACT TEXT. SEE NEWS FOR DETAILS. 



27 LI 

47 AUTOTOX? /AB 
67 AUTOTOX? /BI 
3856 AUTO/AB 
172776 TOX?/AB 

0 AUTO TOX?/AB 

( (AUTO (W) TOX?) /AB) 
16925 AUTO/BI 
283181 TOX?/BI 

1 AUTO TOX?/BI 

( (AUTO (W) TOX?) /BI) 
13 0 ATX#/AB 
135 ATX#/BI 
270 AMF#/AB 
309 AMF#/BI 
3 082 AUTOCRIN? /AB 
32 60 AUTOCRIN? /BI 
443319 FACTOR#/AB 



524 327 FACTOR#/BI 

616 AUTOCRIN? ( 1W) FACTOR# 
9802 MOTILITY /AB 
12013 MOTILITY/BI 
3082 AUTOCRIN? /AB 
3260 AUTOCRIN? /BI 

2 MOTILITY (1W) AUTOCRIN? 
157273 ANTIBOD?/AB 

12 17313 ! m T AND D UuTOTOX? OR AUTO TOX? OR ATX# OR AMF# OR AUTOCRIN? 

( 1W)FACT0R# OR MOTILITY ( 1W) AUTOCRIN? AND ANTIBOD? ) / AB , BI 

=> d .bevstr; sel hit 12 rn 

L2 ANSWER 1 OF 1 CA COPYRIGHT 1994 ACS 

AN 1X9 * 218353 C A • 
TI Autotaxin: motility stimulating protein useful in cancer diagnosis 

IN 



SO 



^racke^Mary; Liotta, Lance A.; Schiffmann, Elliott; Kratzsch, 



Henry TT _. 
PA United States Dept. of Health and Human Services , USA 

U. S. Pat. Appl., 61 pp. Avail. NTIS Order No. PAT-APPL-7-822 043. 



CODEN: XAXXAV 
PI US 822043 AO 930101 
AI US 92-822043 920117 
DT Patent 
LA English 

AB Autotaxin (ATX) , an autocrine factor 

produced by A2058 human melanoma cells, is purified ana 
characterized. ATX appears to be a glycosylated Protein, 
has a pi of 7.7, and has a mol. wt. of 125 kDa on SDS-PAGE under 
reducing conditions. Purified ATX is active m the 
picomolar range. Peptide fragments of ATX were sequenced: 
no significant homologies to these peptides were found when 
searching GenBank, EMBL, SWISS-PROT, or GenPept protein databases. 
IT 147960-52-9, Autotaxin fragment (human) 147960-53-0 

, Autotaxin fragment (human) 147960-54-1, Autotaxin 
fragment (human) 147960-56-3, Autotaxin fragment (human) 
147960-57-4, Autotaxin fragment (human) 147960-58-5 
, Autotaxin fragment (human) 147977-73-9, Autotaxin 
fragment (human) 

(fragment of human autotaxin) 



El THROUGH E7 ASSIGNED 
=> fil reg; s el-e7 

FILE 'REGISTRY' ENTERED AT 14:14:00 ON 21 SEP 94 

USE IS SUBJECT TO THE TERMS OF YOUR CUSTOMER AGREEMENT 

COPYRIGHT (C) 1994 American Chemical Society (ACS) 

STRUCTURE FILE UPDATES: 17 SEP 94 HIGHEST RN 157748-89-5 
DICTIONARY FILE UPDATES: 20 SEP 94 HIGHEST RN 157748-89-5 



TSCA INFORMATION NOW CURRENT THROUGH MAY 1994 

Please note that search-term pricing does apply when 
conducting SmartSELECT searches. 

1 147960-52-9/BI 

(147960-52-9/RN) 
1 147960-53-0/BI 

(147960-53-0/RN) 
1 147960-54-1/BI 

(147960-54-1/RN) 
1 147960-56-3/BI 

(147960-56-3/RN) 
1 147960-57-4/BI 

(147960-57-4/RN) 
1 147960-58-5/BI 

(147960-58-5/RN) 
1 147977-73-9/BI 

(147977-73-9/RN) 

L3 7 (147960-52-9/BI OR 147960-53-0/BI OR 147960-54-1/BI OR 147 

960-56-3/BI OR 147960-57-4/BI OR 147960-58-5/BI OR 147977- 
7 3 — 9 /BI ) 

=> d 1-7 .bevreg; fil caprev; s 11 

L3 ANSWER 1 OF 7 REGISTRY COPYRIGHT 1994 ACS 
RN 147977-73-9 REGISTRY 

L-proly 1 ] -L-leucy 1 ] -L-tryptophy 1 ] -L-isoleucyl ] -L-threonyl ] -L-alany 1 - 
L-threonyl]- (9CI) (CA INDEX NAME) y J 

OTHER NAMES: 

CN Autotaxin fragment (human) 
SQL 11 

MF C54 H86 N14 015 

L3 ANSWER 2 OF 7 REGISTRY COPYRIGHT 1994 ACS 
RN 147960-58-5 REGISTRY 

CN =I^? i ! line ' L " tl ? re ? n y 1 - L -- al P ha --g lut ^yl-L-phenylalanyl-L-leucyl-L- 
seryl-L-asparagmyl-L-tyrosyl-L-leucyl-L-threonyl-L-asparaginyl-L- 
valy l-L- . alpha . -asparty 1-L- . alpha . -aspartyl-L-isoleucyl-L-threonyl-L- 
iSSftSS? y ^"Wycyl-I^oonyl-t-leucylgly^i- (9^>° ?CA 

OTHER NAMES: 

CN Autotaxin fragment (human) 
SQL 23 

MF C113 H180 N28 038 

L3 ANSWER 3 OF 7 REGISTRY COPYRIGHT 1994 ACS 
RN 147960-57-4 REGISTRY 

xsoleucy 1 ) -L- . alpha . -glutamyl ] -L-histidyl ] -L-leucy 1 ] -L-threonyl ] -L- 



seryl ] -L-leucyl ] -L- . alpha . -aspartyl] -L-phenylalany 1 ] -L-phenylalanyl] 
(9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN Autotaxin fragment (human) 
SQL 12 

MF C68 H101 N17 021 

L3 ANSWER 4 OF 7 REGISTRY COPYRIGHT 1994 ACS 

S "Ly^ine'l-va^-^ 
£hKony1-L-va^yl-L-ph^ 

p^o?^-L-threo^yl-L°phenylalanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN Autotaxin fragment (human) 
SQL 16 

MF C82 H123 N19 023 S 

L3 ANSWER 5 OF 7 REGISTRY COPYRIGHT 1994 ACS 

L-valy 1 ] -L-proly 1 ] -L-tryptophy 1 ] -L-asparaginy 1 ] -L- . alpha . -glutamyl ] 
L-threonyl]- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN Autotaxin fragment (human) 
SQL 9 

MF C53 H73 Nil 017 

L3 ANSWER 6 OF 7 REGISTRY COPYRIGHT 1994 ACS 

S "USEE'S- [^N-L-gluta.inyl-L-alanyl, -L-. alpha . -glutamyl ]-L- 

valyl]- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN Autotaxin fragment (human) 
SQL 5 

MF C21 H36 N6 010 

L3 ANSWER 7 OF 7 REGISTRY COPYRIGHT 1994 ACS 

Si iil^^^-L-^^^a-L-tyrosyl-L-prolyD-L-alanyH-L-phenylaianyl,. 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN Autotaxin fragment (human) 
SQL 5 

MF C32 H44 N6 07 

FILE 'CAPREVIEWS' ENTERED AT 14:14:33 ON 21 SEP 94 
SsE IS SUBJECT TO THE TERMS OF YOUR CUSTOMER AGREEMENT 
COPYRIGHT (C) 1994 American Chemical Society (ACS) 

FILE COVERS CURRENT RECORDS AND IS UPDATED DAILY 
FILE LAST UPDATED: 20 SEP 1994 (940920/ED) 



Some records may not contain a volume, issue, and page 
number, because they are from advance abstracts of 
articles accepted for publication in journals published 
by the ACS. These abstracts will appear first in 
Advance ACS Abstracts. Please enter HELP ACS JOURNALS to 
see a list of the advance ACS journal titles. 

To help control your online searching costs, consider usinq the 
HCAPREVIEWS File when conducting SmartSELECT searches with larae 
numbers of terms. y 



L4 0 LI 

=> fil reg; s 11 not 13 

FILE 'REGISTRY • ENTERED AT 14:14:49 ON 21 SEP 94 

USE IS SUBJECT TO THE TERMS OF YOUR CUSTOMER AGREEMENT 

COPYRIGHT (C) 1994 American Chemical Society (ACS) 

STRUCTURE FILE UPDATES: 17 SEP 94 HIGHEST RN 157748-89-5 
DICTIONARY FILE UPDATES: 20 SEP 94 HIGHEST RN 157748-89-5 
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conducting SmartSELECT searches. 

L5 28 LI NOT L3 

=> fil ca; s 15 

FILE 'CA' ENTERED AT 14:15:01 ON 21 SEP 94 

rnpv^^^V 0 ™ E TERMS ° F Y0UR CUSTOMER AGREEMENT 
COPYRIGHT (C) 1994 AMERICAN CHEMICAL SOCIETY (ACS) 

FILE COVERS 1967 - 17 Sep 1994 (940917/ED) VOL 121 ISS 12 

S2A h ?n. C SHo r01 y S Ur ° nline searching costs, consider using the 
HCA File when conducting SmartSELECT searches with large 
numbers of terms. ^ 

THE BASIC INDEX NOW INCLUDES ABSTRACT TEXT. SEE NEWS FOR DETAILS. 

L6 26 L5 

=> S 16 not 12 

L7 26 L6 NOT L2 



=> d 1-26 .bev; sel hit 17 1-26 rn 



L7 ANSWER 1 OF 26 CA COPYRIGHT 1994 ACS 
AN 
TI 



Human intestinal VIP receptor: cloning and functional expression of 
two cDNA encoding protein^ with Afferent N-terminal domains 
SO Biochem. Biophys. Res. Commun. (1994), 200(2), 769-76 

CODEN: BBRCA9; ISSN: 0006-291X ,. . . r»™™,i nalila- 

ronvineau Alain; Rouyer-Fessard, Christiane; Darmoul, Dalila, 
££S? Sean Jose; Carrero, Isabel; Ogier-Denis, Eric; Laburthe, 



AU 



Marc 
PY 1994 



AU 



L7 ANSWER 2 OF 26 CA COPYRIGHT 1994 ACS 

TI A 2 iransforming fragment within the direct repeat region of human 

herpesvirus type 6 that transactivates HIV-i 
SO Oncogene (1994), 9(4), 1167-75 

^o^^S^^i*^ KasfcancHi Fatah; Conner, 
Jay; Berneman, Zwi; Frenkel, Niza; Rosenthal, Leonard J. 
PY 1994 

L7 ANSWER 3 OF 26 CA COPYRIGHT 1994 ACS 

TI Sequencing and functional analysis of a 32,560 bp segment on the 

left arm of yeast chromosome II. Identification of 26 open reading 
frames, including the KIP1 and SEC17 genes 

SO Yeast (1993), 9(12), 1355-71 

*, SK^f^S" L££TSoh»-, vieren.eeXs , Fafcienne; Dubois. 

Evelyne; Messenguy, Francine 
PY 1993 

L7 ANSWER 4 OF 26 CA COPYRIGHT 1994 ACS 

TI Mo?ecu 9 lar 3 clSing and expression of cDNA for vasoactive intestinal 

polypeptide (VIP) receptor of rat 
SO Jpn. Kokai Tokkyo Koho, 14 pp. 

CODEN: JKXXAF 
IN Osada, Juichi; Ishihara, Takeshi 
AI JP 92-26607 920213 
PI JP 05255394 A2 931005 Heisei 
PY 1993 

L7 ANSWER 5 OF 26 CA COPYRIGHT 1994 ACS 

TI H |p90 7 homolog from Madagascar periwinkle (Catharantus roseus) : cDNA 
sequence, regulation of protein expression and location in the 
endoplasmic reticulum 

Plant Mol. Biol. (1993), 23(3), 583-94 

S ^"^runf ^cr™; Eichel, Cannes; vetter, Hans 
Peter; Schroeder, Joachim 
PY 1993 



SO 
AU 



L7 
AN 
TI 

SO 

AU 
PY 



ANSWER 6 OF 26 CA COPYRIGHT 1994 ACS 
119:263762 CA 

The partial 3 •-conserved segment duplications in the integrons In6 
from pSa and In7 from pDGOlOO have a common origin 
Plasmid (1993), 30(1), 39-50 
CODEN: PLSMDX; ISSN: 0147-619X 

Stokes, H. W.; Tomaras, C; Parsons, Yvonne; Hall, Ruth 
1993 



M. 



L7 
AN 
TI 
SO 

IN 



AI 
PI 
PY 



ANSWER 7 OF 26 CA COPYRIGHT 1994 ACS 
119:247982 CA 

Humanized monoclonal antibodies against human interleukin-4 
PCT Int. Appl., 115 pp. 
CODEN: PIXXD2 

Abrams, John S. ; Dalie, Barbara; Le, Hung V.; Miller, Kenneth; 
Murgolo, Nicholas J.; Nguyen Hanh; Pearce, Michael; Tindall 
Stephen; Zavodny, Paul J. ' 
WO 93-US1301 930218 
WO 9317106 Al 930902 
1993 



L7 
AN 
TI 

SO 

AU 

PY 



CA COPYRIGHT 1994 ACS 



ANSWER 8 OF 26 
119:152250 CA 

Cloning and functional expression of a human neuroendocrine 
vasoactive intestinal peptide receptor 
Biochem. Biophys. Res. Commun. (1993), 193(2), 546-53 
CODEN: BBRCA9; ISSN: 0006-291X 

Sreedharan, Sunil P.; Patel, Derek R. ; Huang, Jin Xing; Goetzl, 

Edward J • 

1993 



CA COPYRIGHT 1994 ACS 



L7 ANSWER 9 OF 26 
AN 119:111597 CA 

TI A pathogen-induced gene of barley 
striking similarity to vertebrate 
reticulum 

SO Plant Mol. Biol. (1993), 21(6), 1097-108 

CODEN: PMBIDB; ISSN: 0167-4412 
AU Walther-Larsen, Haidee; Brandt, Jakob; Collinge 

Thordal-Christensen, Hans ' 
PY 1993 



encodes a HSP90 homolog showing 
forms resident in the endoplasmic 



David B. 



L7 
AN 
TI 



SO 
AU 
PY 



CA COPYRIGHT 1994 ACS 



ANSWER 10 OF 26 
119:43542 CA 

?6?e P 6sTn^ ^n CtU r ? f a ? rotein containing a putative iron-sulfur 
L6Fe-6S] prismane cluster from Desulf ovibrio vulqaris 
(Hildenborough) 

Eur. J. Biochem. (1992), 208(2), 435-42 
CODEN: EJBCAI; ISSN: 0014-2956 

Stokkermans, Jack P. W. G. ; Pierik, Antonio J.; Wolbert, Ronnie B 
G.^Hagen, Wilfred R. ; Van Dongen, Walter M. a! M. ; Veeger Cees 



so 

AU 



L7 ANSWER 11 OF 26 CA COPYRIGHT 1994 ACS 

TI iie'primary^tructure of a protein containing a putative iron-sulfur 
^Fe^esT prismane cluster from Desulf ovibrio desulfuncans (ATCC 
27774) 

Biochim. Biophys. Acta (1992), 1132(1), 83-7 

=r^ Q ^r P : S!°S:rS2n Berg, Willy A. M.; Van Don 9 en, 
Walter M. A. M. ; Veeger, Cees 
PY 1992 

L7 ANSWER 12 OF 26 CA COPYRIGHT 1994 ACS 

S Cytochrome P-450terp. Isolation and purification of the protein and 

cloning and sequencing of its operon 
SO J. Biol. Chem. (1992), 267(20), 14193-203 

CODEN: JBCHA3 ; ISSN: 0021-9258 T ™<-. 
an Peterson Julian A.; Lu, Jui Yun; Geisselsoder, Janet, 

G?aham-!Jorence? Sandra ;' Carmona, Cynthia; Witney, Frank; Lorence, 

Matthew C. 
PY 1992 

L7 ANSWER 13 OF 26 CA COPYRIGHT 1994 ACS 

AN Jom P ?ete 9 nucleotide sequence of the Actinomyces viscosus T14V 

sialidase gent: Presence of a conserved repeating sequence among 
strains of Actinomyces spp 
Infect. Immun. (1993), 61(1), 109-16 
CODEN: INFIBR; ISSN: 0019-9567 
AU Yeung, Maria K. 
PY 1993 

L7 ANSWER 14 OF 26 CA COPYRIGHT 1994 ACS 

TI Functional expression and tissue distribution of a novel receptor 

for vasoactive intestinal polypeptide 
SO Neuron (1992), 8(4), 811-19 

CODEN: NERNET; ISSN: 0896-6273 § T*v a hashi 

AU ishihara, Takeshi; Shigemoto, Ryuichi; Mori, Kensaku; Takahashi, 

Kenji; Nagata, Shigekazu 
PY 1992 

L7 ANSWER 15 OF 26 CA COPYRIGHT 1994 ACS 

H sl^noduLtion genes of nod box locus 4 in ai.oblu» ..liloti are 

involved in nodulation signal production: nodM codes for 

D-glucosamine synthetase 
SO Mol. Gen. Genet. (1991), 228(1-2), 113-24 

CODEN: MGGEAE; ISSN: 0026-8925 
AU Baev, Nedelcho; Endre, Gabriella; Petrovics, Gyorgy; Banfalvi, 

Zsofia; Kondorosi, Adam 
PY 1991 



TI 



SO 



L7 ANSWER 16 OF 26 CA COPYRIGHT 1994 ACS 
AN 118:2872 CA 

TI Cloning, sequencing and expression of the sialidase gene from 

Actinomyces viscosus DSM 43798 
SO Biol. Chem. Hoppe-Seyler (1991), 372(12), 1065-72 

CODEN: BCHSEI; ISSN: 0177-3593 
AU Henningsen, Michaela; Roggentin, Peter; Schauer, Roland 
PY 1991 

L7 ANSWER 17 OF 26 CA COPYRIGHT 1994 ACS 
AN 117:68366 CA 

TI Chimeric and complementarity-determining region-grafted 

anti-carcinoembryonic antigen antibodies and their production 

SO PCT Int. Appl., 70 pp. 
CODEN: PIXXD2 

IN Adair, John Robert; Bodmer, Mark William; Mountain, Andrew; Owens, 

Raymond John 
AI WO 91-GB1108 910705 
PI WO 9201059 Al 920123 
PY 1992 

L7 ANSWER 18 OF 26 CA COPYRIGHT 1994 ACS 
AN 116:1417 CA 

TI The regions of sequence variation in caulimovirus gene VI 
SO Virology (1991), 182(2), 830-4 
CODEN: VIRLAX; ISSN: 0042-6822 
AU Sanger, Margaret; Daubert, Steve; Goodman, Robert M. 
PY 1991 

L7 ANSWER 19 OF 26 CA COPYRIGHT 1994 ACS 
AN 114:57704 CA 

TI The primary structure of DNA binding protein II from the extreme 

thermophilic bacterium Thermus thermophilus 
SO FEBS Lett. (1990), 273(1-2), 59-62 

CODEN: FEBLAL; ISSN: 0014-5793 
AU Zierer, Rainer; Choli, Dora 
PY 1990 

L7 ANSWER 20 OF 26 CA COPYRIGHT 1994 ACS 
AN 113:93447 CA 

TI The complete cDNA and polypeptide sequences of human erythroid 

. alpha . -spectrin 
SO J. Biol. Chem. (1990), 265(8), 4434-43 

CODEN: JBCHA3; ISSN: 0021-9258 
AU Sahr, Kenneth E. ; Laurila, Pekka; Kotula, Leszek; Scarpa, Alphonse 

L.; Coupal, Elaine; Leto, Thomas L. ; Linnenbach, Alban J.; 

Winkelmann, John C. ; Speicher, David W. ; et al. 
PY 1990 

L7 ANSWER 21 OF 26 CA COPYRIGHT 1994 ACS 
AN 113:92067 CA 

TI Structural and functional analysis of the mini-circle, a 



transposable element of Streptomyces coelicolor A3 (2) 
SO Mol. Microbiol. (1989), 3(10), 1307-18 

CODEN: MOMIEE; ISSN: 0950-382X 
AU Henderson, D. J.; Lydiate, D. J.; Hopwood, D. A. 
PY 1989 

L7 ANSWER 22 OF 26 CA COPYRIGHT 1994 ACS 

TI Characterization of the Schizosaccharomyces pombe ral2 gene 
implicated in activation of the rasl gene product 

SO Mol. Cell. Biol. (1989), 9(12), 5617-22 

ronFN* MCEBD4: ISSN: 0270-7306 , . 

AU Fukui! Yasuhisa; Miyake, Sanae; Satoh, Misako; Yamamoto, Masayuki 

PY 1989 

L7 ANSWER 23 OF 26 CA COPYRIGHT 1994 ACS 

TI Rat : p!5017.a?pha. from testis: characterization of a full-length 
cDNA encoding a unique steroid hydroxylase capable of catalyzing 
both .DELTA. 4- and .DELTA. 5-steroid-17 , 20-lyase reactions 

SO Mol. Endocrinol. (1989), 3(6), 968-75 

CODEN: MOENEN; ISSN: 0888-8809 Trant 

AU Fevold, H. Richard; Lorence, Matthew C ; McCarthy , John L , Trant, 
John M.; Kagimoto, Masaaki; Waterman, Michael R. ; Mason, J. Ian 

PY 1989 

L7 ANSWER 24 OF 26 CA COPYRIGHT 1994 ACS 

T*I Rat'testis P-45017. alpha. DNA: the deduced amino acid sequence, 
expression and secondary structural configuration 

SO Biochem. Biophys. Res. Commun. (1988), 157(2), 705-i<2 

CODEN: BBRCA9; ISSN: 0006-291X n „* a » M*t-i* t 

AU Namiki, Mikio; Kitamura, Masaya; Buczko, Ellen; Dufau, Maria L. 

PY 1988 

L7 ANSWER 25 OF 26 CA COPYRIGHT 1994 ACS 

Puliulanase C gene of Klebsiella and its cloning and sequencing 

SO Jpn. Kokai Tokkyo Koho, 12 pp. 

CODEN: JKXXAF . M „ K „ H 4™ 

IN Muroka, Yoshikatsu; Takizawa, Noboru; Katsuragi, Nobuhiro 

AI JP 87-78355 870331 

PI JP 63245676 A2 881012 Showa 

PY 1988 

L7 ANSWER 26 OF 26 CA COPYRIGHT 1994 ACS 

TI Entire°nucleotide sequence of the pullulanase gene of Klebsiella 

aerogenes W70 
SO J. Bacteriol. (1987), 169(5), 2301-6 

CODEN: JOBAAY; ISSN: 0021-9193 vo C hiV«tsu 
AU Katsuragi, Nobuhiro; Takizawa, Noboru; Murooka, Yoshikatsu 

PY 1987 
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DICTIONARY FILE UPDATES: 20 SEP 94 HIGHEST RN 157748-89-5 

TSCA INFORMATION NOW CURRENT THROUGH MAY 1994 

Please note that search-term pricing does apply when 
conducting SmartSELECT searches. 



1 110071-62-0/BI 

(110071-62-0/RN) 
1 110071-63-1/BI 

(110071-63-1/RN) 
1 121938-40-7/BI 

(121938-40-7/RN) 
1 147477-78-9/BI 

(147477-78-9/RN) 
1 121763-54-0/BI 

(121763-54-0/RN) 
1 127831-57-6/BI 

(127831-57-6/RN) 
1 128771-18-6/BI 

(128771-18-6/RN) 
1 128909-10-4/BI 

(128909-10-4/RN) 
1 131571-34-1/BI 

(131571-34-1/RN) 
1 137800-99-8/BI 

(137800-99-8/RN) 
1 142661-57-2/BI 

(142661-57-2/RN) 
1 144130-50-7/BI 

(144130-50-7/RN) 
1 144813-79-6/BI 

(144813-79-6/RN) 
1 147446-26-2/BI 

(147446-26-2/RN) 
1 147883-85-0/BI 

(147883-85-0/RN) 
1 148349-17-1/BI 

(148349-17-1/RN) 
1 148591-55-3/BI 

(148591-55-3/RN) 
1 149408-27-5/BI 



(149408-27-5/RN) 
1 150138-10-6/BI 

(150138-10-6/RN) 
1 151066-62-5/BI 

(151066-62-5/RN) 
1 151552-72-6/BI 

(151552-72-6/RN) 
1 152745-57-8/BI 

(152745-57-8/RN) 
1 154363-46-9/BI 

(154363-46-9/RN) 
1 155980-72-6/BI 

(155980-72-6/RN) 
1 156288-65-2/BI 

(156288-65-2/RN) 
1 156288-66-3/BI 

fl56288 — 6 6 ~3 /RN ) 

L8 26 (110071-62-0/BI OR 110071-63-1/BI OR 121938-40-7/BI OR 147 

i77-78-9/BI OR 121763-54-0/BI OR 12783 1-57-6/BI OR 128771- 
18-6/BI OR 128909-10-4/BI OR 131571-34-1/BI OR 137800-99-8 
/BI OR 142661-57-2/BI OR 144130-50-7/BI OR 144813-79-6/BI 
OR 147446-26-2/BI OR 147883-85-0/BI OR 148349-17-1/BI OR 1 
48591-55-3/BI OR 149408-27-5/BI OR 150138-10-6/BI OR 15106 
6-62-5/BI OR 151552-72-6/BI OR 152745-57-8/BI OR 154363-46 
-9/BI OR 155980-72-6/BI OR 156288-65-2/BI OR 156288-66-3 /B 

I) 

=> d 1-26 .bevreg; fil horn 

L8 ANSWER 1 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 156288-66-3 REGISTRY . . ^ . . , 

cn Protein (human clone hIVR5 vasoactive intestinal peptide 

receptor-related precursor reduced) (9CI) (CA INDEX NAME) 

OTHER NAMES: . . ^ . . , _ . • 

CN Protein (human clone hIVR5 vasoactive intestinal peptide 

receptor-related precursor) 

SQL 495 

MF Unspecified 
CI MAN 

L8 ANSWER 2 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 156288-65-2 REGISTRY , T . roQ 

CN Receptor, vasoactive intestinal polypeptide (human clone hIVR8 
precursor reduced) (9CI) (CA INDEX NAME) 

OTHER NAMES: . , >,tvtjr 

CN Vasoactive intestinal peptide/PACAP receptor (human clone hIVR8 

precursor) 
SQL 460 

MF Unspecified 
CI MAN 



L8 ANSWER 3 OF 26 REGISTRY COPYRIGHT 1994 ACS 
RN 155980-72-6 REGISTRY 



CN Protein (human herpes virus 6 strain U1102 clone pNF1022 331-amino 

acid reduced) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN Genbank x73675-derived protein 
SQL 331 

MF Unspecified 
CI MAN 

L8 ANSWER 4 OF 26 REGISTRY COPYRIGHT 1994 ACS 
RN 154363-46-9 REGISTRY 

CN Protein (Saccharomyces cerevisiae clone .alpha. 1006. 13 gene YBL0511 

reduced) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN gene 6 YBL051?) :harOinyCeS Cerevisiae chrom °some II clone .alpha. 1006. 13 
SQL 418 

MF Unspecified 
CI MAN 



L8 ANSWER 5 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 152745-57-8 REGISTRY 

CN (i«r in ( cf !^ c N^r nthus roseus heat shock precursor reducea> 

OTHER NAMES: 

CN Protein HSP90 (Catharanthus roseus precursor) 

SQL 817 ' 

MF Unspecified 

CI MAN 



CN 



L8 ANSWER 6 OF 26 REGISTRY COPYRIGHT 1994 ACS 
RN 151552-72-6 REGISTRY 

(CA t ?NS E X P N^f d PDG01 °° integr ° n In7 3 41-amino acid reduced) (9CI) 
SQL 341 

MF Unspecified 

CI MAN 

L8 ANSWER 7 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 151066-62-5 REGISTRY 

CN INDEX NAME NOT YET ASSIGNED 

SQL 140 

MF Unspecified 

CI MAN 

L8 ANSWER 8 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 150138-10-6 REGISTRY 

° N (ZSTUT I^EX S N^ ******** an done LHT21 

OTHER NAMES: 

CN VIP receptor (human HT29 cell clone LHT21) 

SQL 457 ' 

MF Unspecified 

CI MAN 



US ' ANSWER 9 OF 26 REGISTRY COPYRIGHT 1994 ACS 

S Pro"!n"sP 90 E (barfey done pBT6-l precursor reduce,, (9CI, (« 
INDEX NAME) 

CN HER He^-shock protein GRP 94 hoMoloa (Hordeu* vulgare clone pBT6-l 

precursor) 
SQL 809 

MF Unspecified 
CI MAN 



RN 
CN 



L8 ANSWER 10 OF 26 REGISTRY COPYRIGHT 1994 ACS 

Protei; 5 (Desu?foi?brL vulgaris clone P JSP9 prisxnane iron-sulfur 
core-containing reduced) (9CI) (CA INDEX NAME) 
SQL 553 

MF Unspecified 
CI MAN 

L8 ANSWER 11 OF 26 REGISTRY COPYRIGHT 1994 ACS 

3 S(«KL desulfuricans clone pWBPSl prismane 
iron-sulfur core-containing rduced) (9CI) (CA INDEX NAME) 
SQL 545 

MF Unspecified 
CI MAN 

L8 ANSWER 12 OF 26 REGISTRY COPYRIGHT 1994 ACS 

3 Cytocnr"e°P ^(Pseudoxnonas clone P T3/ P T1 isofor, terp reduced) 

(9CI) (CA INDEX NAME) 
SQL 428 

MF Unspecified 
CI MAN 

L8 ANSWER 13 OF 26 REGISTRY COPYRIGHT 1994 ACS 

3 Recep^rt'vasoacUvfintestinal polypeptide (rat clone P V19 

p?ecSrsor rlduced) (9CI) (CA INDEX NAME) 
SQL 459 

MF Unspecified 
CI MAN 

L8 ANSWER 14 OF 26 REGISTRY COPYRIGHT 1994 ACS 

S Seura'lntdase^nLyces viscosus clone pMY450-l gene nanH 

precursor reduced) (9CI) (CA INDEX NAME) 
S™Siil5i« (Actinomyces viscosus strain T14V clone PMY450-1 precursor 

reduced) 
SQL 901 

MF Unspecified 



CI 



MAN 



L8 ANSWER 15 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 144813-79-6 REGISTRY 

CN Neuraminidase (Actinomyces viscosus strain DSM 43798 reduced) (9CI) 

(CA INDEX NAME) 

SQL 913 

MF Unspecified 

CI MAN 

L8 ANSWER 16 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 144130-50-7 REGISTRY 

CN Protein (Rhizobium meliloti clone pD85 gene nolF reduced) (9CI) (CA 

INDEX NAME) 

SQL 367 

MF Unspecified 

CI MAN 

L8 ANSWER 17 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 142 661-57-2 REGISTRY 

CN 1-145-Immunoglobulin G 1 (human-mouse clone pAL45 gHl-A5B7 

.gamma. 1 -chain anti-antigen CEA reduced) (9CI) (CA INDEX NAME) 

SQL 146 

MF Unspecified 

CI MAN 

L8 ANSWER 18 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 137800-99-8 REGISTRY 

CN Protein IBMP (figwort mosaic virus clone DxS reduced) (9CI) (CA 

INDEX NAME) 

SQL 512 

MF Unspecified 

CI MAN 

L8 ANSWER 19 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 131571-34-1 REGISTRY 

CN Protein II (Thermus thermophilus strain HB8 DNA-binding) (9CI) (CA 

INDEX NAME) 

SQL 95 

MF C458 H781 N127 0129 S 

CI MAN 

L8 ANSWER 20 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 128909-10-4 REGISTRY 

CN Spectrin (human clone . alpha. 3/ . alpha. 37/ .alpha. 7 .alpha. -subunit 

precursor reduced) (9CI) (CA INDEX NAME) 

SQL 2426 

MF Unspecified 

CI MAN 

L8 ANSWER 21 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 128771-18-6 REGISTRY 

CN Protein (Streptomyces coelicolor strain 3(2) minicircle element 



122-amino acid) (9CI) (CA INDEX NAME) 

SQL 122 

MF Unspecified 

CI MAN 

L8 ANSWER 22 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 127831-57-6 REGISTRY 

CN Protein (Schizosaccharomyces pombe gene ral2 reduced) (9CI) (CA 

INDEX NAME) 

SQL 611 

MF Unspecified 

CI MAN 

L8 ANSWER 23 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 121938-40-7 REGISTRY 

CN Cytochrome P 450 (rat clone NA isoform 17. alpha, protein moiety 

reduced) (9CI) (CA INDEX NAME) 

SQL 507 

MF Unspecified 

CI MAN 

L8 ANSWER 24 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 121763-54-0 REGISTRY 

CN 2-1077-Pullulanase (Klebsiella pneumoniae protein moiety reduced) , 

N-(l-oxohexadecyl)- (9CI) (CA INDEX NAME) 

SQL 1076 

MF Unspecified 

CI MAN 

L8 ANSWER 25 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 110071-63-1 REGISTRY 

CN Pullulanase (Klebsiella pneumoniae protein moiety reduced) (9CI) 

(CA INDEX NAME) 

SQL 1077 

MF Unspecified 

CI MAN 

L8 ANSWER 26 OF 26 REGISTRY COPYRIGHT 1994 ACS 

RN 110071-62-0 REGISTRY 

CN Pullulanase (Klebsiella pneumoniae precursor protein moiety reduced) 

(9CI) (CA INDEX NAME) 

SQL 1096 

MF Unspecified 

CI MAN 



FILE •HOME' ENTERED AT 14:16:45 ON 21 SEP 94 



AT/* 




ids i - u vp/ 




##«## 
tt*#« 
*#### 
#ttti 
#*t«# 

##### User: stic7! shears 
#####' 

##### Title: us-08-249-182-l.res 

mn 

##### stic7 
«*#t* 

»t*#* Printed: Ued 16:26 Sep Sir 1994 
ttt«# 

«#### Job nunber: MT66 1-7-87 
##### 

##### 

#«##« 
#!### 
##### 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairwise Conparison of Sequences 
Release 5.4 

Results file us-08-249-182-l.res nade by on Wed 21 Sep 94 12:06:27-PDT. 



Query sequence being conpared:US-08-249-182-l (1-5) 
Nunber of sequences searched: 5543 
Nunber of scores above cutoff: 1605 

Results of the initial conparison of US-08-249-182-1 (1-5) with: 
File : /hone/shears/loring/lorin«.pep 



10000- 



N 

U 5000- 
H » 
B 
E 
R 

0 

F 1000- 
S 

E 500- 

e 
u 

E 
N 
C 
E 

S 100- 



50- 



10- 



5- 



II I III III I I I I 
SCORE 0 | 0 1 | 1 2 | 2 3 3 4 4 
STDEV 1 2 3 



PARAMETERS 



Similarity matrix 


Unitary 


K- tuple 


2 


Mismatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 15 




Optimized scores to 


save 0 


Display context 50 






SEARCH STATISTICS 




Scores: 


Mean 


Median Standard Deviation 




0 


1 0.95 




Times? 


CPU 


Total Elapsed 






00:00rl0.85 


00:00:19.00 





Number of residues; 482836 



Nunber of scores above cutoff? 1605 * . * 

The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 100% identical sequence to the query sequence bias not found. 
The list of best scores is? 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



4 standard deviations above mean **** 



1 

1 . 


R3/443 


Autotaxin peptide ATX 18. 


6 


4 


4 


4.23 


0 


2. 


R42697 


pl6 of nef peptide of HIV-1. 


35 


4 


4 


4.23 


0 


3. 


R42698 


pl7 of nef peptide of HIV-1. 


65 


4 


4 


4.23 


0 


4. 


R38893 


Nef protein of HIV-1. 


206 


4 


4 


4.23 


0 






***» 3 standard deviations above nean *•** 








5. 


R34888 


Hunan TSH residues 61-75. 


15 


3 


3 


3.17 


0 


6. 


R30061 


Hunan PMP sequence used to ra 


16 


3 


3 


3.17 


0 


7. 


P30110 


Sequence of VP1 capisid prote 


4 O 

18 


3 


3 


3.17 


0 


8. 


P90011 


Synthetic peptide pol-1 corre 


4 A 

18 


3 


3 


3.17 


0 


9 a 


fi*f 1 TP ft A- 

R36792 


Prion protein region A subfra 


19 


3 


3 


3.17 


0 


4 ft 

10. 


R37300 


E.coli shiga-like toxin segne 


20 


3 


3 


3.17 


0 


4 i 

11 . 


ATFA 4 A A 

P30108 


Sequence of VPi capisid prote 


20 


3 


3 


3.17 


0 


12. 


P7o462 


Sequence of C. trachonatis se 


22 


3 


Tf 

3 


Tf 4 "T 

3.17 


0 


4 *T 

13. 


DOO A CO 


Sequence of C. trachonatis se 


22 


3 


3 


3.17 


0 


14. 


P9B454 


Sequence of C. trachonatis se 


22 


3 


3 


3.17 


0 


15. 


DOS h CA 

P98450 


Sequence of C. trachonatis se 


22 


3 


3 


3.17 


0 


16. 


P98446 


Sequence of C. trachonatis se 


22 


3 


3 


3.17 


0 


17. 


P98442 


Sequence of C. trachonatis se 


22 


3 


3 


3.17 


0 


18. 


P98466 


Sequence of C. trachonatis se 


22 


3 


3 


3.17 


0 


19. 


ni < AAC 

R41085 


HTLV-I and HTLV-II peptide I- 


23 


3 


3 


3.17 


0 


20. 


Rl 1555 


Native HIV gpl60 peptide (2) 


24 


3 


3 


3.17 


0 


21 . 


R30757 


HIV discriminatory peptider H 


27 


3 


3 


3.17 


0 


22. 


R3S030 


Ovine prion protein region E 


30 


3 


3 


3.17 


0 


23. 


R38027 


Bovine prion protein region E 


30 


3 


3 


3.17 


0 


24. 


R31565 


HTLV-II epitope, HTLV-II env- 


30 


3 


3 


3.17 


0 


25. 


R31547 


HTLV-I epitope, HTLV-I env-4. 


31 


3 


3 


3.17 


0 


26. 


R31546 


HTLV-I epitope, HTLV-I env-3. 


31 


3 


3 


3.17 


0 


27. 


P30122 


Sequence of VPI capsid protei 


31 


3 


3 


3.17 


0 


28. 


P30107 


Sequence of VPI capisid prote 


31 


3 


3 


3.17 


0 


29. 


P30106 


Sequence of VPI capisid prote 


31 


3 


3 


3.17 


0 


30. 


P90947 


Peptide 240. 


31 


3 


3 


3.17 


0 


31. 


R39873 


C peptide RV-C6r residues 119 


34 


3 


3 


3.17 


0 


32. 


R44497 


Sequence of the HIV-1 epitope 


35 


3 


3 


3.17 


0 


33. 


R39893 


Lipopeptide TPRV-C6. 


37 


3 


3 


3.17 


0 


34. 


R41092 


HTLV-I and HTLV-II peptide II 


40 


3 


3 


3.17 


0 


35. 


P90948 


Peptide 242. 


41 


3 


3 


3.17 


0 


36. 


R3S503 


P. aeruginosa pilin protein s 


53 


3 


3 


3.17 


0 


37. 


R33877 


Polypeptide p380.LG conprisin 


57 


3 


3 


3.17 


0 


38. 


B42575 


Abl heavy chain variable reg 


101 


3 


3 


3.17 


0 


39. 


R30763 


Heavy chain variable donain o 


120 


3 


3 


3.17 


0 


40. 


R47104 


Hunan NT-4 encode by genonic 


132 


3 


3 


3.17 


0 



1. US-08-249-182-1 (1-5) 

R37443 Autotaxin peptide ATX 18. 

ID R37443 standard; peptide; 6 AA. 

AC R37443; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 18. 



i\n veil nuLiiAuy s tr i.nu i ob i ny t uauuci ric uas ua a 1 a i anuiuuuyr ucocuuiunf 

KH immunostains; disease outcome prediction; therapy choice? 

KM cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992? 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUNAN SERVICE. 

PI Krutzsch Hf Liotta LAt Schiffmann Ef Straclte H. 

DR WPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin * useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 18. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in inmunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SG Sequence 6 AA; 

so 2 A; 0 R; 1 n; o D; 0 B; o c; o q; o E; o z; 0 G; i H; 

so o i; o L? 0 k; o N; o f; o p; o S; o T; i h; o y; l v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:47-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 4.23 
Residue Identity = 80X Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

mi 

WHVAAN 
X X 



2. US-08-249-182-1 (1-5) 

R42697 pl6 of nef peptide of HIV-1. 

ID R42697 standard; protein; 35 AA. 

AC R42697; 

DT 10-N0V-1993 (first entry) 

DE p!6 of nef peptide of HIV-1. 

KW AIDS; antibody; p25; gpllO; gp4l; assay; detection; 

KW immunity; vaccine. 

OS Hunan immunodeficiency virus-!. 

FH Key Location/Qualifiers 

FT Nodified_site 35 

FT /note= "CystacetamidomethyD " 

PN U55221610-A. 

PD 22-JUN-1993. 

PF 26-MAY-1988; 199143. 

PR 26-MAY-1988; US-199143. 

PR 04-SEP-1991; US-754300, 

PA (INRM ) INST NAT SANTE & RECH MEDICALE. 

PA (INSP ) INST PASTEUR. 

PI Bahraoui EMi Chamaret Sr Ferris S» Granier C» ftontagnier L; 

PI Rietschoten JV» Rochat Hr Sabatier JM; 

DR HPI; 93-213434/26. 

PT Diagnosis of HIV infection - by detecting HIV antibodies using 

PT antigenic polypeptide derived from nef protein of HIV-1 

PS Disclosure; Page 3; 15pp? English. 

CC The peptide is expressed in vivo in HIV infected patients before 

CC detectable amts. of p25i gpllO and gp41 are expressed". Thusi it 



Lou uc ustru in assays lur crony ueucLLiun ut niv. 

CC It can also be used to raise antibodies for usfc in detection/ 

CC to induce cellular innunity or to raise neutralising antibodies 

CC that either inactivate the AIDS virus or reduce the viability of 

CC the virus in vivo or destroy infected cells. 

CC The peptide nay be used in viral vaccines. 

SQ Sequence 35 AA; 

SQ 2 A; 4 R; l N; 3 D; 0 B; l C; 0 Q; 5 E; 0 2; 1 G; 3 H; 

SQ o I; 3 l; l k; l w; 3 f; 2 P; i S; 0 T; l H; 1 Y? 2 V; 

CC Retrieved by shears on Wed 21 Sep 94 11 :59: 15-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 4.23 
Residue Identity = SOX Hatches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

111) 

GMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 
10 20 X 30 



3. US-08-249-182-1 (1-5) 

R42698 pl7 of nef peptide of HIV-1. 

ID R42698 standard; protein; 65 AA. 

AC R42698; 

DT 10-N0V-1993 (first entry) 

DE p!7 of nef peptide of HIV-1. 

KW AIDS; antibody; p25; gpllO; gp4l; assay; detection; 

KW innunity; vaccine. 

OS Hunan immunodeficiency virus-!. 

FH Key Location/Qualifiers 

FT Modified.site 1 

FT /note= n Cys(acetanidoRethyl) a 

FT Modified_site 65 

FT /note= "Cystacetanidonethyl) 0 

PN US5221610-A. 

PD 22-JUN-1993. 

PF 26-HAY-1988; 199143. 

PR 26-HAY-1988; US-199143. 

PR 04-SEP-199U US-754300. 

PA (INRM ) INST NAT SANTE & RECH MEDICALE. 

PA (INSP > INST PASTEUR. 

PI Bahraoui EM* Chanaret S> Ferris Si Granier C» ttontagnier L; 

PI Rietschoten JV, Rochat Hi Sabatier JM? 

DR UPl; 93-213434/26. 

PT Diagnosis of HIV infection - by detecting HIV antibodies using 

PT antigenic polypeptide derived fron nef protein of HIV-1 

PS Disclosure; Page 3; 15pp> English. 

CC The peptide is expressed in vivo in HIV infected patients before 

CC detectable ants, of p25> gpllO and gp41 are expressed. Thus, it 

CC can be used in assays for early detection of HIV. 

CC It can also be used to raise antibodies for use in detection* 

CC to induce cellular innunity or to raise neutralising antibodies 

CC that either inactivate the AIDS virus or reduce the viability of 

CC the virus in vivo or destroy infected cells. 

CC The peptide nay be used in viral vaccines. 

SQ Sequence 65 AA; 

S9 3 A; 4 R; 3 N; 4 D; 0 B; 2 C; 0 G; 9 E; 0 Z; 2 G; 5 H; 

SQ 0 If 7 L J 4 Ki l «; 3 F; 5 P; 3 S? l T; l w; 2 Y; 6 v; 

CC Retrieved by shears on Wed 21 Sep 94 11 :59: 15-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 4.23 
Residue Identity = 80X Matches = 4 Misnatches - 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
WHVAR 

mi 

CYKLVPVEPDKVEEANKGENTSILHPVSLHGNDDPEREVLEHRFDSRLAFHHVARELHPEYFKNC 
10 20 30 40 50 X 60 



4. US-08-249-182-1 (1-5) 

R38893 Nef protein of HIV-1. 

ID R38893 standard; Protein; 206 AA. 

AC R38893; 

DT 10-N0V-1993 (first entry) 

DE Nef protein of HIV-1. 

KU AIDS; antibody; p25; gpllO; gp41; assay; detection; 

KW innunity; vaccine. 

OS Hunan innunodef iciency virus-l. 

PN US5221610-A. 

PD 22-JUN-1993. 

PF 26-MAY-1988; 199143. 

PR 26-MAY-1988; US-199143. 

PR 04-SEP-1991; US-754300. 

PA (INRM ) INST NAT SANTE & RECH MEDIC ALE. 

PA UNSP ) INST PASTEUR. 

PI Bahraoui Eh\ Chanaret S» Ferris Si Granier C» Montagnier L; 

PI Rietschoten JV> Rochat H> Sabatier JM; 

DR WPI; 93-213434/26. 

PT Diagnosis of HIV infection - by detecting HIV antibodies using 

PT antigenic polypeptide derived fron nef protein of HIV-1 

PS Disclosure; Fig 2; 15pp; English. 

CC The nef protein conprises peptides which are expressed in vivo in HIV 

CC infected patients before detectable ants, of p25i gpllO and gp41 are 

CC expressed. Thusi they can be used in assays for early detection of HIV. 

CC They can also be used to raise antibodies for use in detection 

CC to induce cellular innunity or to raise neutralising antibodies 

CC that either inactivate the AIDS virus or reduce the viability of 

CC the virus in vivo or destroy infected cells. 

CC The peptides nay be used in viral vaccines. 

SQ Sequence 206 AA; 

SO 17 A; 13 R; 6 N; 10 D; 0 B; 3 c; 6 Q; 19 E; 0 Z; 16 G; 9 H; 

SO 4 l; 17 L; 10 K; 4 M; 7 F; 15 P; 11 S; 10 T; 7 u; 7 Y; 15 V; 

CC Retrieved by shears on Wed 21 Sep 94 11 :59J 15-PDT using FindSeq 

Initial Score = 4 Optinized Score = 4 Significance = 4.23 
Residue Identity = 80X Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
UHVAR 
Mil 

CYKLVPVEPDKVEEANKGENTSLLHPVSLHGBDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 
150 160 170 180 190 X X 200 



5. US-08-249-182-1 (1-5) 

R34888 Hunan TSH residues 61-75. 

ID R34888 standard; peptide; 15 AA. 

AC R34888; 

DT 23-JUL-1993 (first entry) 

DE Hunan TSH residues 61-75. 

KU Thyroid stinulating hornone; beta subunit; inhibitor; cAHP; 

KM autoantibodies; Graves' disease. 

OS Synthetic. 

PN US5196513-A. ' 



PF 05-SEP-1989; 403564. W ™ 

PR 05-SEP-1989? US-403564. 

PR 25-JUL-1991; US-736030. 

PA (MAY0-) MAYO FOUNDATION. 

PI McCornicti DJi Morris JC, Ryan RJ; 

DR UPl; 93-116855/14. 

PT Synthetic peptide derived from beta-sub-unit residues 101-112 of 

PT TSH - inhibits binding of human thyroid stimulating hormone to 

PT thyroid membrane, useful in immuno-therapy and immuno-diagnosis 

PT e.g. of Graves disease 

PS Disclosure; Page 12; 16pp; English. 

CC The sequence represents amino acid residues 61-75 of the beta subunit 

CC of human thyroid stimulating hormone (hTSH). This synthetic peptide 

CC is capable of inhibiting binding of TSH to human thyroid membranes, 

CC inhibiting TSH-mediated cAMP generation and inhibiting the 

CC stimulatory effect of thyroid stimulating autoantibodies. The 

CC peptide may be used as a diagnostic and therapeutic reagent for 

CC thyroid diseases such as Graves' disease. The peptide may also be 

CC used to produce antibodies which can be used to measure thyroid 

CC stimulating autoantibody levels or to therapeutically neutralise 

CC these autoantibodies. 

CC See also R34882-92. 

SQ Sequence 15 AA; 

sa l a; 0 R; 0 n; 0 D; 0 B; l C; 0 fi; 1 E; 0 Z; 1 G; 1 H? 

SQ l I; l L; 0 K; o M; l F; 3 P; o s; l T; o w; 1 Y; 2 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58?46-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.17 
Residue Identity - 60% Matches - 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
III 

TVEIPGCPLHVAPYF 
10 X 



6. US-08-249-182-1 (1-5) 

R30061 Human PMP sequence used to raise antibodies. 

ID R30061 standard; peptide; 16 AA. 

AC R30061 ; 

DT 29-APR-1993 (first entry) 

DE Human PMP sequence used to raise antibodies. 

KM Charcot-Marie-Tooth disease; hypertrophic neuropathy; autosomal; 

KW CTM1A; hybridisation; PMP; peripheral myelin protein; Scuann. 

OS Homo sapiens. 

PN H09221694-A. 

PD 10-DEC-1992. 

PF 05-JUN-1992; U04833. 

PR 06-JUN-1991; US-711615. 

PR 06-HAY-1992; US-879623. 

PA (BAYU > BAYLOR COLLEGE MEDICINE. 

PA (STRD ) UNIV LELAND STANFORD JUNIOR. 

PI Cardenas OS, De Oca-tuna RM, Deleon Mi Lupski JR, Patel PI; 

PI Shooter EM, Snipes GJ, Suter U, Helcher A; 

DR HP I; 92-433607/52. 

PT Human peripheral myelin protein and nucleic acid sequence - used 

PT for diagnosis and treatment of hypertrophic neuropathies e.g. 

PT autosomal dominant Charcot-Marie-Tooth disease 

PS Disclosure; Fig 20; 93pp; English. 

CC The sequence is that of a fragment of human PMPi a peripheral myelin 

CC protein characterised in that it is expressed predominantly in 

CC peripheral Schwann cells, has a mol. ut. of 20 UD and has 



\#v* suusuaribioi sr^ucnuc nunuiuyy a rnr uutu« ir un nun an prr ijjnrr ai 

CC nervous tissue. The peptide uas used to raise antibodies which nay 

CC be used in diagnosis of Charcot-Marie-Tooth type 1 disease 

CC (CMTiA)i uhich is a connon peripheral neuropathy in huwans. The 

CC DNA based test can be performed on DNA isolated fron peripheral blood 

CC at less than one tenth of the cost of present diagnostic nethods* and 

CC the disease way be detected early at the genetic level without the need 

CC for synptons. 

CC See also R30056-60. 

S9 Sequence 16 AA; 

S3 0 A; 2 R; 2 N; 1 D; 0 B; 0 C; 0 Q; l E; 0 Z; 0 G; 2 H; 

so o i; o l; 0 k; o «; o f; 0 pj 2 s; o t? i w; 3 y; 2 v? 

CC Retrieved by shears on Wed 21 Sep 94 1U58S32-PDT using FindSeq 

Initial Score = 3 Optinized Score = 3 Significance = 3.17 
Residue Identity = 607. Matches = 3 Mi swatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
HHVAR 
III 

YRVRHSEWHVNNDYSY 
10 X 



7. US-08-249-182-1 (1-5) 

P301 10 Sequence of VP1 capisid protein residues 141-160 f 

ID P30110 standard? Protein; 18 AA. 

AC P30110? 

DT 03-APR-1992 (first entry) 

DE Sequence of VP1 capisid protein residues 141-160 fron the anino- 

DE terninusi FHDVr Tubingen type 0i subtype 1> strain Kaufbeuren. 

KW Antigen; Picornavirus; capsid protein? antibody; detection? 

KU vaccine? diagnosis. 

OS Foot and nouth disease virus. 

PN H08303547-A. 

PD 27-0CT-1983. 

PF 14-APR-1983? 002644. 

PR 14-APR-1982? US-36830S. 

PR 25-NAR-1983; US-478847. 

PR 20-SEP-1984; US-653475. 

PR 18-DEC-1984? US-682819. 

PA (BITT/) B1TTLE J L. 

PA (SCRI-) SCRIPPS CLINIC fc RE. 

PI Bittle JLi Lerner RA. 

DR WPI? 83-807942/44. 

PT Antigenic peptide(s) corresp. to picornavirus capsid protein - 

PT useful in prodn. of vaccines and in diagnostic tests 

PS Disclosure; Page 14; 90pp? English. 

CC The peptides of the invention corresp. to a region on the antigenic 

CC Picornavirus capsid protein. The capsid protein FMDV VP1 or polio 

CC virus VP1. When linked to carriers the peptides are immunogenic. 

CC Dose is 20 ug-2ng peptide for inoculations. 

S9 Sequence 18 AA? 

SG 2 A; 3 R; 0 N? l D; 0 B; 0 C; 0 Q; 0 E; 0 Z; 3 G; 0 H; 

S9 0 I; 2 L? 0 k; 0 m; l f; 2 P; 2 s? 0 t? o w; o y? 2 v; 

CC Retrieved by shears on Ued 21 Sep 94 11:57:59-PDT using FindSeq 

Initial Score = 3 Optinized Score = 3 Significance = 3.17 
Residue Identity = 60X Matches = 3 Mi snatches = 2 

Gaps = 0 Conservative Substitutions = 0 



X X 
WHVAR 
III 



auvnuvruaLfirnvnnLr 

10 X X 



8. US-03-249-182-1 (1-5) 

P90011 Synthetic peptide pol-1 corresp. to residues 822-8 

ID P90011 standard; protein; 18 AA. 

AC P90011; 

DT l-NQV-1989 (first entry) 

DE Synthetic peptide pol-1 corresp, to residues 322-838 of 

DE polymerase gene from hepatitis B ayu genome 

KM Hepatitis B virus peptide; determines antibody; synthetic; 

KW polymerase. 

0S Hepatitis virus 

PN W08904964-A. 

PD 01-JUN-1989. 

PF 15-N0V-1988; U04076. 

PR 16-N0V-1987; US-120979. 

PA (FOXC) Fox Chase Cancer CE. 

PI Feitelson M> Blunberg BS, Millman I; 

DR WPI; 89-178459/24. 

FT Immunoassay for antibody to hepatitis virus DNA polymerase 

PT - representing early marker of hepatitis infections 

PS Claim 3f page 30; 34pp; English. 

CC Synthetic peptide^ pol-l» corresp. to amino acid residues 

CC 822-838 of the polymerase gene fron hepatitis B ayu genone. 

CC Used for raising antibodies to determine hepatitis infection 

CC at an early stager and to reduce the incidence of post-transfusion 

CC hepatitis B. 

SQ Sequence 18 AA; 

so 2 A; 2 r; 0 n; o d; o B; i c; 0 a; 0 e; 0 z; 0 g; 2 h; 

SO 0 I; l l; 0 k; 0 «; l F; 4 P? l s; 0 T; 1 w; 0 Y; 3 v; 

CC Retrieved by shears on Med 21 Sep 94 li:56;48-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.17 
Residue Identity = 601 (latches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions - 0 

X X 
WHVAR 



10 X 



9. US-08-249-182-1 (1-5) 

R36792 Prion protein region A subfragment #1. 

ID R36792 standard; protein; 19 AA. 

AC R36792; 

DT 14-0CT-1993 (first entry) 

DE Prion protein region A subfragment #1. 

KM Antigen; prion; protein; region; frame shift; repeat; nutation; PrPc; 

KW FSa; FSb; subfragment; antibody; treatment; spongiform encephalopathy; 

KM human; sheep; cattle; cellular binding; aggregation; mammal? scrapie; 

KM immune system; PrPsc; ratio-inverso peptide; enzymatic degradation; 

KM resistance. 

OS Synthetic. 

FH Key Location/Oualif iers 

FT Misc_difference 1 

FT /note= "One or more residues or may be absent" 

FT Hiscjhfference 2 

FT /note- "Nay be absent" 

FT MiscjJifference 3 

FT /note= "Way be absent" 



mi »l ui t r cr cfit.tr t 



FT /note= "May be absent" 

FT Misc_difference 5 

FT /note= "May be absent" 

FT Misc_difference 15 

FT /note= "May be absent" 

FT Misc_difference 16 

FT /note= "May be absent" 

FT Misc_difference 17 

FT /note= "May be absent" 

FT Misc.difference 18 

FT /note= "May be absent" 

FT MiscjJifference 19 

FT /note= "One or nore residue or nay be absent" 

PN W09311155-A. 

PD 10-JUN-1993. 

PF 03-DEC-1992? G02246. 

PR 03-DEC-1991; GB-025747, 

PR 10-JUL-1992; GB-014663. 

PA (PROT-) PROTEUS MOLECULAR DESIGN LTD. 

PI Fishleigh RV, Mee RPi Robson B> 

DR UPl; 93-196994/24. 

PT New polypeptide (s) contg. antigenic site of prion protein - 

PT useful for treatment and diagnosis of nannalian encephalopathies 

PT e.g. Creutzfeld-Jacob disease and kuru 

PS Clain 7; Page 63; 82ppi English. 

CC The sequences given in R36792-95 represent polypeptide subfragnents 

CC derived fron an antigenic site* region At of a prion protein. Prion 

CC proteins comprise six regions of interest (A-F)i and two related frane 

CC shift peptides sequences caused by a repeating section in region E 

CC having a nucleic acid coding sequence frane shift nutation of +1 (FSa) 

CC or -1 (FSb). The full length peptide region (see also R36786-89 and 

CC these subfragnents? and antibodies raised against these t nay be used 

CC to treat or prevent spongiform encephalopathy in hunans* sheep or 

CC cattle. They can be used to block cellular binding and aggregation 

CC of prion proteins and to stimulate the nannalian innune systen. These 

CC peptides nay be used to distinguish between the nornal forn of prion 

CC protein (PrPc) and the scrapie-associated forn (PrPsc). These 

CC peptides nay include rare or synthetic anino acids or a ratio-inverso 

CC peptide modification to inprove resistance to enzynatic degradation. 

SQ Sequence 19 AA; 

S6 6 A; o R; 0 N; o D ; 0 B ; 0 C; 0 9; 0 E? 0 Z; 6 G; 1 Hi 

sq o l; 1 l; o k; o M; o F; 0 P? o s; o T; o w; o y; 3 v; 

SQ 2 Others? 

CC Retrieved by shears on Wed 21 Sep 94 11:59:15-PDT using FindSeq 



Initial Score = 3 Optinized Score = 3 Significance = 3.17 
Residue Identity = 60X Matches = 3 Misnatches = 2 

Gaps = 0 Conservative Substitutions = 0 



X X 
WHVAR 
III 

XHVAGAAAAGAVVGGLGGX 
X X 10 



10. US-08-249-182-1 (1-5) 

R37300 E.coli shiga-like toxin segnent. 

ID R37300 standard; Protein; 20 AA. 

AC R37300; 

DT 13-SEP-1993 (first entry) 

DE E.coli shiga-like toxin segment. 

KW Type I ribosone-inactivating protein; ricin; nonordin; 

KW innunoconjugate; autoinnune disease; cell killing; toxin; 



i\w riundn enyifierreu tmtiuuuyi vdridLue reyiurii LiyriL Lridirit 

KM cell targetting; chimeric antibody; SLT. 

OS Escherichia coli. 

FH Key Location/Qualifiers 

FT Disulfide.bond 1..20 

FT /note= "intervening loop includes protease 

FT sensitive anino acid sequence" 

PN H09309130-A. 

PD 13-MAY-1993. 

PF 04-N0V-1992; U09487. 

PR 04-N0V-1991; US-787567. 

PR 19-JUN-1992; US-901707. 

PA (XOfIA ) XOMA CORP. 

PI Berhard SL, Better HDr Carroll SF, Lane JA, Lei SP. 

DR WPI? 93-167617/20. 

PT Analogues of type I ribosone inactivating protein - useful as 

PT cytotoxic agents? ircnuno toxins for treating auto innune diseasest 

PT cancer* graft versus host disease and selective cell killing in=vivo 

PB Exanple 10; Page 114; 163pp; English, 

CC The invention covers analogues of the plant type I RIP gelonin 

CC which have a non-naturally occurring Cys residue in a position 

CC which enables the analogue to be conjugated via a disulphide 

CC linkage to a nolecule which specifically binds to a target cell. 

CC Pref, target-cell binding nolecules are antibodies or their 

CC fragments* esp. hunan engineered H65 antibody fragments. Fusion 

CC constructs were assenbled that included a natural sequence gelonin 

CC gene fused to an H65 truncated heavy chain gene or an H65 light 

CC chain (kappa) gene. A DNA linker encoding a peptide segment of the 

CC E.coli shiga-like toxin uas inserted between the gelonin gene and 

CC the Ab gene. The resulting iiwunoconjugates can be used as cytotoxic 

CC therapeutic agents. 

SO Sequence 20 AA ; 

SQ 3 A; 2 R; 0 Nf 1 D; 0 B; 2 C; 0 S; 1 E> 0 Z? 0 G; 3 H; 

SQ 0 I; 0 L; 0 K; 2 M; l F; l P; 3 s; 0 T? 0 w; o y; 1 V; 

CC Retrieved by shears on Wed 21 Sep 94 1H59J09-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.17 
Residue Identity - 60X Matches = 3 Mismatches = 2 

Gaps » 0 Conservative Substitutions = 0 

X X 
WHVAR 
III 

CHHHASRVARHASDEFPSHC 
X 10 20 



11. US-08-249-182-1 (1-5), 

P30108 Sequence of VP1 capisid protein residues 141-160 f 

ID P30108 standard; Peptide; 20 AA. 

AC P30103; 

DT 03-APR-1992 (first entry) 

DE Sequence of VP1 capisid protein residues 141-160 fron the anino- 

DE terninusi FHDV* Tubingen type Of subtype 1* strain Kaufbeuren. 

KW Antigen; Picornavirusf capsid protein; antibody; detection; 

KW vaccine; diagnosis. 

OS Foot and nouth disease virus. 

PN W08303547-A. 

PD 27-0CT-1983. 

PF 14-APR-1983; 002644. 

PR 14-APR-1982; US-368308. 

PR 25-MAR-1983; US-47S847. 

PR 20-SEP-1984; US-653475. 

PR 18-DEC-1984? US-682819. 

PA (BUT/) BITTLE J L. 



rn 



PI Bittle JL, Lerner RA. 

DR WPI; 83-807942/44. 

PT Antigenic peptide(s) corresp. to picornavirus capsid protein - 

PT useful in prodn. of vaccines and in diagnostic tests 

PS Disclosure; Page 14; 90pp; English. 

CC The peptides of the invention corresp. to a region on the antigenic 

CC Picornavirus capsid protein. The capsid protein FHDV VP1 or polio 

CC virus VP1. When linked to carriers the peptides are innunogenic. 

CC Dose is 20 ug-2ng peptide for inoculations. 

SQ Sequence 20 AA; 

SQ 2 A; 2 R; l N; l D; 0 B? 0 c; 2 Q; o E; 0 Z; l G; 0 H; 

SQ o i; 4 L; l K; o H; o F; 2 P; o s; l T; o H; o Y; 3 v; 

CC Retrieved by shears on Wed 21 Sep 94 1 1 :57:59-PDT using FindSeq 

Initial Score = 3 Qptinized Score = 3 Significance = 3.17 
Residue Identity = 602 Matches = 3 Hisnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
III 

VPNLRGDLQVLAQKVARTLP 
10 X X 20 



12. US-08-249-182-1 U-5J 

P98462 Sequence of C. trachonatis serovar L3 najor outer 

ID P9S462 standard; Protein; 22 AA. 

AC P98462? 

DT 06-HAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar 13 najor outer nenbrane protein (H0HP) 

DE variable donain (VD) gene L3-VDIV base pairs 487-552 

KW Clanydia trachonatis; antigen; nonclonal antibody; vaccine; 

KW diagnosis; serotyping; non-innunologic assay; ss. 

OS Chlamydia trachomatis. 

FH Key Location/Qualifiers 

FT CDS 1 . .66 

FT /ttag= a 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-MAR-1989; 324664. 

PR 17-HAR-1989; US-324664. 

PA (USSH ) US DEPT HEALTH & HUMAN. 

PI Caldwell HDr Ying Y, Zhang YX> Walk ins NG; 

DR WPI; 89-339697/46. 

DR P-PSDB; P98467. 

PT Clanydia trachonatis genes - used for deternn. of nucleotide and 

PT anino sequences of the variable donains of the najor outer 

PT nenbrane proteins 

PS Disclosure; Fig 19; 49pp; English. 

CC The inventors sequenced the 4 ROMP VDs of ten C. trachonatis 

CC serovars and the anino acid sequences were deduced. The H0MP VDs 

CC with the greatest total hydrophilicity and charge values uere found 

CC to be the location of antigenic deterninants recognised by W0MP 

CC specific nonoclonal antibodies. The nucleotide! anino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate MOitP antigenic deterninants to be used 

CC in the construction of synthetic peptides i subunits or recontinant 

CC chlanydial vaccines. This will allow the prodn. or reagents and 

CC methodologies applicable in the development of neu diagnostic tests 

CC for serotyping. 

SQ Sequence 22 AA; 

SS 3 A; l R; 2 N; 2 D; 0 B; o c; l Q; l E; o Z; l G; o H; 

SQ 0 I; l L; o K; 0 M; 0 F; 3 P? 1 S; 4 T; 0 «; 0 Y; 2 V; 



Initial Score = 3 Optimized Score = 3 Significance = 3.17 
Residue Identity = 60X Matches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions - 0 

X X 

mm 
III 

AAPTTSDVEGLQNDPTTNVARP 
10 X 20 



13. US-08-249-182-1 (1-5) 

P98458 Sequence of C. trachomatis serovar J major outer m 

ID P98458 standard; Protein; 22 AA. 

AC P98458; 

DT 06-KAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar J najor outer membrane protein (MOMP) 

DE variable domain (VD) J-VDI encoded by base pairs 256-321 

KW Clamydia trachonatis; antigen; monclonal antibody? vaccine; 

KW diagnosis; serotyping; non-immunologic assay. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-HAR-1989; 324664. 

PR 17-MAR-1989; US-324664. 

PA (USSH ) US DEPT HEALTH & HUH AN. 

PI Caldwell HD* Ying Y, Zhang YX, Hatkins NG; 

DR HPI; 89-339697/46. 

DR N-PSDB; N97093. 

PT Clamydia trachomatis genes - used for deternn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure; Fig 17; 49pp; English. 

CC The inventors sequenced the 4 NOPIP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The NOPIP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by HOHP 

CC specific monoclonal antibodies. The nucleotide? amino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate HOMP antigenic determinants to be used 

CC in the construction of synthetic peptides i subunits or recontinant 

CC chlamydial vaccines. This will allow the prodn. or reagents and 

CC methodologies applicable in the development of new diagnostic tests 

CC for serotyping. 

SQ Sequence 22 AA; 

SG 4 A; i R; 2 N; 2 D; o 8; o c; 1 9; 0 E? 0 z; 1 G; 0 H; 

Sfl o i; l l; o K; o h; o f; 3 P; i s; 4 t; o »; o y; 2 v? 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:44-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.17 
Residue Identity = 60% Hatches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions - 0 

X X 
WHVAR 



10 X 20 



14. US-08-249-182-1 (1-5) 

P98454 Sequence of C. trachomatis serovar I major outer m 



iu i 7otJi a bar iuoi u ? rruucinr t-t- nn* 

AC P98454; 

DT 06-MAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar I major outer membrane protein (MOMP) 

DE variable domain (VD) I-VDI encoded by base pairs 256-321 

KW Clamydia trachomatis; antigen; nonclonal antibody; vaccine; 

KW diagnosis; serotyping; non-immunologic assay. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-NAR-1989; 324664. 

PR 17-MAR-1989; US-324664. 

PA (USSH ) US DEPT HEALTH ft HUMAN. 

PI Caldwell HD, Ying Y, Zhang YX, Matkins NG; 

DR WPU 89-339697/46. 

DR N-PSDB; N97089. 

PT Clamydia trachomatis genes - used for determn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure; Fig 16; 49pp; English. 

CC The inventors sequenced the 4 MOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The MOMP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by MOMP 

CC specific monoclonal antibodies. The nucleotide! amino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate MOMP antigenic determinants to be used 

CC in the construction of synthetic peptides r subunits or recomtinant 

CC chlamydial vaccines. This will allot* the prodn. or reagents and 

CC methodologies applicable in the development of neu diagnostic tests 

CC for serotyping. 

S9 Sequence 22 AA; 

sa 4 A; l R; 2 N; 2 D; 0 b; 0 c; o 8; 1 E; 0 z; i G; 0 H; 

S9 o l; i l; i K; 0 M; 0 F; 3 P; o s; 4 T; o H; o Y; 2 v; 

CC Retrieved by shears on Med 21 Sep 94 11:57?44-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.17 
Residue Identity = 60X Matches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
III 

AAPTTKDVAGLENDPTTNVARP 
10 X 20 



15. US-08-249-182-1 (1-5) 

P98450 Sequence of C. trachomatis serovar H major outer m 

ID P98450 standard; Protein; 22 AA. 

AC P98450; 

DT 06-MAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar H major outer membrane protein (MOMP) 

DE variable domain (VD) H-VDI encoded by base pairs 256-321 

KW Clamydia trachomatis; antigen; monclonal antibody? vaccine; 

KW diagnosis; serotyping; non-immunologic assay. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-MAR-1989; 324664. 

PR 17-MAR-1939; US-324664. 

PA (USSH ) US DEPT HEALTH St HUMAN. 

PI Caldwell HD, Ying Y. Zhang YX, Watkins NG; 

DR WPI; 89-339697/46. 

DR N-PSDB; N97085. 



50- 



10- 



II II I I II II I I I II III I 

SCORE 0| |1 | 3 4| |5 | 7 | 8| 9 | 1 1 12 
STDEV 012 34567 8 



PARAMETERS 



Similarity matrix Unitary 


K-tuple 2 


Mismatch penalty 1 


Joining penalty 20 


Gap penalty 1.00 


Window size 5 


Gap size penalty 0.05 




Cutoff score 0 




Randomization group 0 




Initial scores to save 40 


Alignments to save 15 


Optimized scores to save 0 


Display context 50 


SEARCH STATISTICS 


Scores? Mean 


Median Standard Deviation 


1 


3 1.21 


Times; CPU 


Total Elapsed 


00:00?08.91 


00:00:08.00 


Number of residues: 


482836 


Number of sequences searched: 


5543 


Number of scores above cutoff: 


2815 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence was found: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 

1. R37452 Autotaxin peptide ATX 102. 12 12 12 9.06 0 

The list of other best scores is: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure; Fig 15; 49pp; English. 

CC The inventors sequenced the 4 ilOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The M0MP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by HOHP 

CC specific monoclonal antibodies. The nucleotide! amino acid 

CC sequences and hydrophilicity/charge value analyses ui I 1 assist in 

CC the selection of appropriate ROUP antigenic determinants to be used 

CC in the construction of synthetic peptides , subunits or recontinant 

CC chlamydial vaccines. This will allow the prodn. or reagents and 

CC methodologies applicable in the development of new diagnostic tests 

CC for serotyping. 

SQ Sequence 22 AA? 

SO 5 A; i R; 3 N; 3 D; 0 B; 0 c; l G; 0 E; 0 Z; 0 G; 0 H; 

so 0 I; l L; l k; 0 «; 0 f; 3 P; o s; 3 t; o w; 0 Y; l V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:44-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.17 
Residue Identity = 602 Matches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

II! 

AAPTTNDAADLflNDPKTIWARP 
10 X 20 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairwise Comparison of Sequences 
Release 5.4 

Results file us-08-249-182-10.res made by on Ued 21 Sep 94 12:05U5-PDT. 



Query sequence being compared:US-08-249-182-10 (1-12) 
Number of sequences searched: 5543 
Number of scores above cutoff? 2815 

Results of the initial comparison of US-08-249-182-10 (1-12) with; 
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1. US-08-249-182-10 (1-12) 

R37452 Autotaxin peptide ATX 102. 

ID R37452 standard; peptide; 12 AA. 

AC R37452; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 102. 

KW Cell motility stimulating; cancer netastasis; antibody; detection? 

KW inpiunostains; disease outcone prediction! therapy choice; 

KH cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992J 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH It HUMAN SERVICE. 

PI Krutzsch Hr Liotta LA* Schiffnann Ei Stracke M. 

DR HPI; 93-085861/10. 

PT Motility stimulating protein naned autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 102. It nay be used to 



CC metastasis and in innunostains of patient sanples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which nay also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SO Sequence 12 AA; 

SO 0 A; l R; 0 N; 2 D; 0 B; 0 C; 0 Q; l E; 0 2; 0 G; l H; 

SO 1 I; 2 L; 0 K; 0 M; 2 F? 0 P; 1 S; 1 T; 0 W? 0 Y? 0 Vi- 
CC Retrieved by shears on Ned 21 Sep 94 ll:58:47-PDT using FindSeq 

Initial Score = 12 Optinized Score = 12 Significance = 9.06 
Residue Identity = 100"/, Hatches = 12 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 

lillllllllll 
DIEHLTSLDFFR 
X 10 X 



2. US-08-249-182-10 (1-12) 

R08086 Feline T-cell lynphotrophic lentivirus of clone R5 



ID R08086 standard; protein; 57 AA. 

AC R08086; 

DT 26-FEB-1991 (first entry) 

DE Feline T-cell lynphotrophic lentivirus of clone R5XCXL1. 

KW Feline T-cell lynphotropic lentivirus; FIV? R5XCX11; antibodies; 

KM vaccines. 

OS Feline T-cell lynphotropic lentivirus 2428 (Pentaluwa). 

PN W09013573-A. 

PD 15-N0V-1990. 

PF 30-APR-1990; U02338. 

PR 08-HAY-1989; US-348784. 

PR 08-DEC-1989; US-447810. 

PA (IDEX-) IDEXX CORP. 

PI Anderson PRi Qconnor TPi Tonelli QJ; 

DR WPI; 90-361429/48. 

DR N-PSDB; 006655. 

PT Feline T-cell lyspho-tropic lentivirus poly-peptide(s) - used for 

PT specific detection of FIV antibodiesr prodn. of antibodies and in 

PT vaccines 

PS Disclosure? Fig 5(c); 37pp; English. 

CC The anino acid sequence shows honology uith the envelope gene of 

CC equine infectious anenia virusi a lentivirusi innunologically 

CC closely related to FIV. 

CC Strain R5X has been deposited ATCC 67939. 

CC See also 006653-55 and R08094-96. 

S8 Sequence 57 AA; 

SO l A; 3 r; 5 N; 2 D; 0 B; 2 C; 2 0; 2 E; 0 Z; l G; l H; 

so 2 i; 6 L; o k; o n; i F; 3 p; 6 s; 3 T; 2 y; o Y; i v; 

SO 2 Others; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57U0-PDT using FindSeq 



Initial Score = 6 Optinized Score = 6 Significance « 4.12 
Residue Identity = SOX Matches = 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
(III II 

ATNSOSCENVFSSXLPNGHNTVHEQVVFPLSIFWDLFXRVRNCVLTSLFPFRDLVIF 
10 20 30 40 50 X 



3. US-08-249-182-10 (1-12) 

P82053 Outer membrane protein F of Pseudononas aeruginosa 

ID P82053 standard; protein; 350 AA. 

AC P82053; 

DT 22-0CT-1990 (first entry) 

DE Outer nenbrane protein F of Pseudononas aeruginosa. 

KH Outer nenbrane protein F; OMPF? vaccination; antibodies. 

KW 

05 Pseudononas aeruginosa. 

PN DE371B591-A. 

PD 15-DEC-1987. 

PF 03-JUN-1987? 718591. 

PR 03-JUN-1987; DE-71B591. 

PA (8EHM) Behringuerke AG. 

PI Dondey tit Lottspeich Ft von Specht B-U> Duchene M; 

DR WPI; 88-361619/51. 

DR N-PSDB; N82023. 

PT Neu outer nenbrane protein F of Pseudononas aeruginosa - 

PT DNA sequences encoding it and derived antibodies* useful for 

PT vaccination and diagnosis. 

PS Disclosure? p; Gernan. 

CC The protein is isolated fron the OMP of P. aeruginosa serotype 6 

CC ATTCC 33354) and purified by HPLC. The anino-terninal and trypsin 

CC fragments are sequenced and a series of oligonucleotide probes 

CC constructed corresponding to the established sequences. These probes are 

CC used to screen a gene bank of 15-20 kb fragments of genomic DNA in 

CC lambda EMBL 3. One positive clone includes a 15 kb insert contg. the 

CC protein gene, which can be isolated as a 2.5 kb PstI fragment. This 

CC fragment cannot be cloned int a high copy no. vector because of the 

CC toxicity of the gene prod..> so is subcloned as tuo fragments with an 

CC overlapping region of about 500bp. Ab's are raised by usual immunisation 

CC or cell-fusion procedures. The Ab's are useful in diagnosis. 

SO Sequence 350 AA; 

SQ 37 a; 15 R; 26 N; 26 D? 0 B; 4 C; 11 s; 21 E; 0 z; 37 G; 6 H; 

50 9 I; 19 L; 19 K; 7 H; 13 F; 10 P; 21 s; 18 T? l w; 16 Y; 34 v; 

CC Retrieved by shears on Wed 21 Sep 94 1H57:00-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 41X Matches = 5 Mismatches « 7 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 

inn 

RYFTDSVRNMKNADLYGGSIGYFLTDDVELALSYGEYHDVRGTYETGNKKVHGNLTSLDAIYHFGTPGVGLR 
40 50 60 70 80 90 100 

PYVSAGLAHSNITNINSDSQGRSSMTHANIGAGLKYYFTE 
110 120 130 140 



4. US-08-249-182-10 (1-12) 
R41746 MN protein. 

ID R41746 standard; Protein; 429 AA. 

AC R41746J 

DT 25-HAR-1994 (first entry) 

DE MN protein, 

KW MN; endogenous; MaTu; quasi-viral agent; human; mammary tumour; prion; 

KW classical virus; slot* virus; exogenous MX; p58X; cytoplasmic antigen; 

KW conservative; HeLa cell; twin protein; p54/58N; cell surface; nucleus; 

KW monoclonal antibody; MAb M75; neoplasm; pre-neoplastic disease; 

KW vaccine. 



U3 nunu sdpieris. 

PN WQ9318152-A. 

PD 16-SEP-1993. 

PF 08-MAR-1993; U02024. 

PR ll-MAR-1992; CS-000709. 

PR 21-0CT-1992; US-964589. 

PA (CIBA ) CIBA CORNING DIAGNOSTICS CORP. 

PA (VIRO-) INST VIROLOGY. 

PI Pastorek J» Pastorekova St Zavada J; 

DR HPI; 93-303466/38. 

DR N-PSDB; Q48456. 

PT New MN gene and polypeptide(s) - used in diagnosis* prognosis and 

PT therapy of neoplastic and/or pre-neoplastic disease 

PS Claim 7; Fig i; 72pp; English. 

CC This sequence is encoded by the intronless MN gene which is a cellular 

CC gene uhich is the endogenous component of the MaTu agent. HaTu is a 

CC novel quasi-viral agent with rather unusual properties. It is 

CC presumably derived from a human mammary tumour. In some aspects it 

CC resembles classical viruses* whereas in other respects it resembles 

CC "slow* viruses (prions)? and in still other aspects it is different 

CC from both classes of viruses. HaTu is a two component sytem. One 

CC part of the complex* exogenous MX* is transmissible* and is manifest 

CC by a protein* p58X* which is a cytoplasmic antigen which reacts with 

CC soRe natural sera* of humans and of various animals. The other 

CC conponenti UN, is endogenous to hunan cells. UN is a cellular gene 

CC showing very little homology with known DNA sequences. It is rather 

CC conservative and present as a single copy in the chronosoRal DNA of 

CC various vertebrates. MN is manifest in HeLa cells by a twin protein 

CC p54/58N* that is localised on the cell surface and in the nucleus. 

CC Innunoblots using a monoclonal antibody reactive with p54/58N (HAb H75) 

CC revels two bands at 54 fcD and 58 kD. These two bands nay correspond to 

CC one type of protein that differs by glycosylation pattern or by how it 

CC is processed. The expression of the MN gene is strongly correlated 

CC with tunourigenicity. MN products can be used in can be used in 

CC diagnostic and/or prognostic assays for neoplastic and/or pre- 

CC neoplastic disease. MN polypeptides* produced recombinant^ by 

CC unicellular hoststcan also be used for antibody production and in 

CC vaccines for inducing protective immuntity against neoplastic disease 

CC and a dampening effect upon tumourigenic activity. 

S8 Sequence 429 AA; 

SQ 41 A? 43 Rl 8 N; 17 D, 0 B; 8 C; 21 Q» 28 E; 0 Z; 33 G» 13 H? 

SQ 13 I? 40 l; 10 k; 6 m; n f; 39 P; 40 s; 16 t; 13 w; 7 y; 22 v; 

CC Retrieved by shears on Wed 21 Sep 94 U:59?31-PDT using FindSeq 

Initial Score - 5 Optimized Score = 6 Significance = 3.30 
Residue Identity = 50X Matches - 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
III II I 

VDEALGRPGGLAVLAPFHRRARKKTVSYEQLLSRLEEIAEEGSET8VPGLDISALLPSDFSRYF0YEGSLTT 
230 240 250 260 270 X 280 X 290 

PPCAQGVIWTVFNQTVMLSAK8LHTLSDTLHGPGDSRLQL 
300 310 320 330 



5. US-08-249-182-10 (1-12) 

R40856 43kd regression associated antigen. 

ID R40856 standard; Protein; 453 AA. 

AC R40856; 

DT 07-MAR-1994 (first entry) 

DE 43kd regression associated antigen. 

KW Regression associated antigen; tumour; immunotherapy; 



r\w drtLi -iwiuuyyiL dntiuuuiebr draiuuuiesr tunuur regression. 

OS Mycoplasma hyorhinis. 

FH Key Location/Qualifiers 

FT Hisc_difference 80 

FT /note= "Tryptophan encoded by TGA* normal in 

FT Hycoplasna hyorhinis. 0 

FT HiscjJifference 124 

FT /note- tryptophan encoded by TGAt nornal in 

FT Mycoplasma hyorhinis." 

FT Hisc.difference 165 

FT /note* tryptophan encoded by TGA> nornal in 

FT Hycoplasna hyorhinis." 

FT Hisc_difference 344 

FT /note= tryptophan encoded by TGA» normal in 

FT Hycoplasna hyorhinis." 

PN US5242823-A. 

PD 07-SEP-1993. 

PF 07-HAR-1986; 837494. 

PR 07-HAR-1986; US-837494. 

PR 16-SEP-1987; US-097910. 

PR ll-DEC-1987? US-131815. 

PR 04-JAN-19885 US-13S923. 

PR 16-HAR-1990; US-474730. 

PR 02-0CT-1992; US-956546. 

PA (ITGE-) INT GENETIC ENG INC. 

PI Fareed GC> Ghosh-dastidar Pr Jar-hou Lf Sen A; 

OR WPI; 93-295229/37. 

DR N-PSDBJ 947816. 

PT DNA encoding a regression-associated antigen fron H. hyorhinis - 

PT is used to obtain prods, for diagnosis* localisation and therapy 

PT of tunours 

PS Disclosure; Figure 3; 40pp; English. 

CC Regression associated antigens (RAA's) are identified in naterial 

CC fro* neoplastic cells by their ifwunological reactivity with 

CC regression associated antibodies fron the serun of patients 

CC diagnosed as undergoing regression of a tunour. RAA's can be used 

CC for tunour immunotherapy and for producing and purifying antibodies 

CC which can be used for tunour diagnosis* localisation and therapy. 

CC The antibodies can also be used for the production of 

CC anti-idiotypic antibodies which can also be used in immunotherapy. 

SQ Sequence 453 AA; 

SQ 34 A; 6 R; 35 N; 33 D; 0 Br 1 C? 14 Q? 22 E; 0 Z; 30 Gi 4 H; 

SQ 35 I; 33 L! 49 K» 2 M? 30 F; 10 P; 32 S? 39 T? 4 U; 11 Y; 29 V? 

CC Retrieved by shears on Wed 21 Sep 94 1H59:26-PDT using FindSeq 

Initial Score = 5 Optinized Score = 5 Significance « 3.30 
Residue Identity = 417. Hatches = 5 Hisnatches = 7 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

OIEHLTSLDFFR 
II I II 

FDNSFVKDRQAEIEKAKNFDFNTVLLTAGGTVSDKSFN9SIHEAVLEHYDGIEKTTNLDRVS8ETNNSSELI 
40 50 60 70 80 90 100 110 

GKYKNFLNGNKNVWILTGFQQG9EFPKFLKQTDSNGKKYS 
120 130 140 150 



6. US-08-249-182-10 (1-12) 

R40916 Sequence of a CD26 fragnent lacking a portion of t 

ID R40916 standard? Protein; 593 AA. 

AC R40916; 

DT 05-FEB-1994 (first entry) 

DE Sequence of a CD26 fragment lacking a portion of the carboxy 



ul. v/ci mind i f cuiuii 

KW Hunan T cell activation antigen; nonoclonal antibody Tal; CD26. 

OS Synthetic. 

PN W09316102-A. 

PD 19-AUG-1993. 

PF 09-APR-1992; U02892. 

PR 06-FEB-1992? US-832211. 

PA (DAND ) DANA FARBER CANCER INST INC. 

PI Norinoto Cr Schlossnan SF, Tanaka T? 

DR WPI; 93-272827/34. 

PT Polypeptide fragments of CD26 - are capable of disrupting binding 

PT of CD45 and CD26 and thus interfering with T-cell activation 

PS Exanple; Pages 46-48? 73pp? English. 

CC C26 is a hunan T cell antivation antigen originally identified by 

CC its reactivity with the MAb Tal. C26 cDNA library was constructed 

CC fron hunan PHA-activated T cells using the CDM7 vector. 

CC Fragnents of CD26 can be prepd in the following wanner. 

CC CD26 Xbal-SphI cDNA fragment is ligated to the vector 

CC RcSR-alpha-26 Xbal-Hindlll DNA fragment and the linker 846092. 

CC The linker introduces an in-frame stop codon that results in the 

CC deletion of the segnent of CD26 fron AA 594 to the carboxy 

CC terninus of the yild-type protein. This deletion nutant* shown 

CC in R40916i lacks the putative catalytic site of CD26 and has a new 

CC carboxy terninus given in R40917. 

SQ Sequence 593 AA; 

SQ 28 A; 22 R; 34 N? 35 D; 0 B; 10 C; 21 6; 31 E? 0 Z; 30 G; 11 H; 

SQ 39 I; 54 L; 34 K; 9 Mr 22 Fi 25 Pi 49 S? 41 T; 17 W; 46 Y; 35 V; 

CC Retrieved by shears on Wed 21 Sep 94 Ii:59i24-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance ~ 3.30 
Residue Identity = 41X Hatches = 5 Misnatches = 7 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II III 

RFRPSEPHFTLDGNSFYKIISNEEGVRHICYF8IDKKDCTFITKGTWEVIGIEALTSDYLYYISNEYKGMPG 
360 370 380 390 400 X 410 420 

GRNLYKIQLSDYTKVTCLSCELNPERC8YYSVSFSKEAKY 
430 440 450 460 



7. US-08-249-182-10 (1-12) 

R40909 Sequence encoded by hunan CD26 cDNA. 

ID R40909 standard; Protein? 766 AA. 

AC R40909; 

DT 05-FEB-1994 (first entry) 

DE Sequence encoded by hunan CD26 cDNA. 

KW Hunan T cell activation antigen; nonoclonal antibody Tal. 

OS Hopio sapiens. 

FH Key Location/Qualifiers 

FT Region 7.. 28 

FT /label= hydrophobic 

FT Region 29.. 323 

FT /label= N-terninal glycosylated region of 

FT extracellular donain 

FT /note= B 8 sites for N-l inked glycans 0 

FT Region 324.. 551 

FT /label= Cysteine rich region of extracellular 

FT donain 

FT /note= M N-linked glycosylation site" 

FT Region 552.. 766 

FT /label= C-terninal region of extracellular 

FT donain 



n miuic- i n iirmeu y tyuu&y t dt lun Site « 1 

FT catalytic site 0 

FT Active_site 627.. 631 

FT /label= active site of serine protease/esterase 

FT /note= "fits the consensus sequence GXSXG W 

PN W09316102-A. 

PD 19-AUG-1993. 

PF 09-APR-1992? U02892. 

PR 06-FEB-1992? US-832211. 

PA (DAND ) DANA FARBER CANCER INST INC. 

PI Moriftoto Ci Schlossnan SFf Tanaka T; 

DR MPI; 93-272827/34. 

DR N-PSDB? Q46089. 

PT Polypeptide fragments of CD26 - are capable of disrupting binding 

PT of CD45 and CD26 and thus interfering with T-cell activation 

PS Disclosure; pages 39-43? 73pp? English. 

CC C26 is a hunan T cell antivation antigen originally identified by 

CC its reactivity with the NAb Tal. C26 cDNA library uas constructed 

CC fron hunan PHA-activated T cells using the CDN7vector.The hydrophobic 

CC N-terninal of the predicted CD26 polypeptide has the characteristics 

CC of a signal sequence of the type II nenbrane protein* uhich is 

CC reinforced by the observation that potential N-glycosylation sites 

CC are located in the carboxy side of the hydrophobic core. Therefore 

CC the N-terninal 6 AAs are predicted to be cytoplasnici the next 22 

CC AAs are predicted to transverse the cytoplasmic nenbrane, and the 

CC 738 C-terpiinal AAs constitute the predicted extracellular doaain. 

SG Sequence 766 AA; 

S3 40 A; 30 R; 40 N? 46 D; o B; 12 C; 30 o; 40 E; o Z? 43 G; 19 h; 

SG 49 I? 62 L? 40 K; 15 M; 31 F? 29 P; 64 S; 50 T? 21 w; 56 Y; 49 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:24-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 41X Matches = 5 Mismatches = 7 

Gaps = 0 Conservative Substitutions ' 0 

X 10 X 

DIEHLTSLDFFR 
II III 

RFRPSEPHFTLDGNSFYKIISNEEGYRHICYFQIDKKDCTFITKGTWEVIGIEALTSDYLYYISNEYKGMPG 
360 370 380 390 400 X 410 420 

GRNLYKIQLSDYTKVTCLSCELNPERCQYYSVSFSKEAKY 
430 440 450 460 



8. US-08-249-182-10 (1-12) 

P50287 Sequence encoded by hepatitis A virus (HAV) cDNA f 

ID P50287 standard; Protein? 854 AA. 

AC P50287? 

DT 30-N0V-1991 (first entry) 

DE Sequence encoded by hepatitis A virus (HAV) cDNA fron near the 

DE genone 5' terninus to the end of the area corresponding to the 

DE capsid protein region of poliovirus RNA. 

KU Hepatitis A virus assay; antigen? antibody. 

OS Hepatitis A virus. 

PN W08501517-A. 

PD ll-APR-1985. 

PF 27-SEP-1984? U01552. 

PR 30-SEP-1983? US-537911. 

PA (MASI ) MASSACHUSETTS INST TECH. 

PI Ticechurst JR» Baltimore D» Feinstone SMf Purcell RHi 

PI Racaniello VR; 

DR WPI; 85-098846/16. 

DR N-PSDB? N50330. 

PT New hepatitis A virus CDNA - useful in assays for the virus and 



PS Example; Fig 7; 60pp; English. 

CC The inventors clain HAV cDNA and a nethod for producing itt whereby 

CC large aats. can be obtd. economically. The cDNA is useful in the 

CC assay for detection of HAV quickly and easily and with high 

CC sensitivity and specificity. The HAV cDNA is also used in the prodn. 

CC of HAV antigen or antibodies to it. The antibodies nay be nonoclonal. 

SQ Sequence 854 AA; 

SQ 50 A,' 37 R; 36 N; 43 D; 0 B; 12 C; 38 Qi 45 E; 0 Z; 53 G> 22 H; 

SQ 46 I? 70 L; 39 K; 21 H; 47 F; 45 P; 73 S? 70 Ti 12 W; 34 Y; 61 V; 

CC Retrieved by shears on Wed 21 Sep 94 11 :57:41-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 41X Hatches = 5 Hisnatches = 7 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II I II 

HRGDLVFDFQVFPTKYHSGRLLFCFVPGNELIDVSGITLK8ATTAPCAVHDITGV9STLRFRVPWISDTPYR 
360 370 380 390 400 X 410 420 

VNRYTKSAHQKGEYTAIGKLIVYCYNRLTSPSNVASHVRV 
430 440 450 460 



9. US-08-249-182-10 (1-12) 

P50231 Sequence encoded by partial sequence of hepatitis 

ID P50231 standard; Protein; 993 AA. 

AC P50231; 

DT 28-N0V-1991 (first entry) 

DE Sequence encoded by partial sequence of hepatitis A virus (HAV) r 

DE including surface protein (VP-1). 

KM Hepatitis A virus vaccine; icmunisation; aonoclonal antibody; 

KU diagnostic assay. 

OS Hepatitis A virus. 

FH Key Location/Qualifiers 

FT Protein 628.. 993 

FT /note= "clained; X denotes translated stop codons 

FT and unspecified triplets" 

PN EP-138704-A. 

PD 24-APR-1985. 

PF 09-0CT-1984; 402025. 

PR 14-0CT-1983; US-541836. 

PR 02-HAR-1984? US-585942. 

PA (MERI ) MERCK & CO INC. 

PI Hughes JV, Scolnick EH> Tonassini JE; 

DR UPI; 85-100818/17. 

DR N-PSDB; N50274. 

PT New hepatitis A virus surface protein - useful for binding to 

PT neutralising antibodies to the virus 

PS Disclosure; Page 17-23; 49pp; English. 

CC VP1 is isolated by solubilisation of the intact virus in an aq. 

CC anionic surfactant and a reducing agent. The viral proteins are sepd. 

CC and the protein of molecular wt. 33000 daltons is sepd. 

SQ Sequence 993 AA; 

SQ 58 A; 48 R; 38 N; 48 D; o B; 16 C; 43 Q; 47 E; 0 Z; 63 G; 26 H; 

SQ 53 I; 94 L; 43 K; 24 M; 50 F; 51 P; 81 s; 77 T; 14 u; 35 y; 68 V; 

SQ 16 Others? 

CC Retrieved by shears on Wed 21 Sep 94 11?57;41-PDT using FindSeq 



Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 41X Matches = 5 Hisnatches = 7 

Gaps = 0 Conservative Substitutions = 0 



DIEHLTSLDFFR 
II I II 

MRGDLVFDFQVFPTKYHSGRLLFCFVPGNELIDVTGITLK9ATTAPCAVHDITGV9STLRFRVPHISDTPYR 
500 510 520 530 540 X 550 X 560 

VNRYTKSAHQKGEYTA IGKL I VYCYNRLTSPSNVASHVRV 
570 580 590 600 



10. US-08-249-182-10 (1-12) 

P98500 Partial sequence encoded by murine nuscular dysiro 

ID P98500 standard; Protein; 1091 AA. 

AC P98500; 

DT 10-HAR-1993 (revised entry) 

DE Partial sequence encoded by marine nuscular dystrophy (HD) cDNA. 

KU Dystrophin; nuscular dystrophy; probe; antibody; diagnosis; 

KU prenatal; heterozygote; gene therapy; genetic screening; 

KU foetal screening. 

OS Hus nusculus. 

PN H08906286-A. 

PD 13-JUL-1989. 

PF 16-DEC-1988; U04504. 

PR 22-DEC-1987; US-136618. 

PA (CHIL-) CHILDRENS MED CENT. 

PI Kunkel LH> Monaco A> Hoffnan EP. Koenig Hi 

DR HPI; 89-220587/30. 

DR M-PSDB; N97129. 

PT Muscular dystrophy gene - used for prepn. of probes ■ dystrophin 

PT polypeptide and antibodies for diagnosis and therapy of nuscular 

PT dystrophy 

PS Exanple; Fig 7B; 68pp; English. 

CC The inventors clain an HD probe conprising a purified ss NA SG uhich 

CC hybridises to at least a part of the HD gene; pure dystrophin (DS) 

CC polypeptide! purified NA encoding DS and antibodies (Ab) to DS. The 

CC probes are equal to or greater than 10b of one of 12 cDNA sequences 

CC deposited as ATCC 58666-57677. The HD gene is hunan. or a nurine Dnd 

CC gene . 

SO Sequence 1091 AA; 

SQ 63 A; 42 R; 43 N; 44 D; 0 Bi 13 C; 93 9; 138E; 0 Z; 30 G; 25 H; 

SO 44 If 130U 89 K; 32 Hi 27 FS 27 P; 79 S; 72 T; 23 U; 18 Y? 59 V; 

CC Retrieved by shears on Wed 21 Sep 94 1 1 :57: 10-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 41Z Hatches = 5 Hisnatches = 7 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II I II 

8NGFNYLSDTVKEMAKKAPSE IC9KYLSEFEE IEGHWKKLSSSLVESCSKLEEHHNKLRKFRI H I KTL9KHH 
700 710 720 730 740 X 750 760 

AEVDVFLKEEHPALGDAEILKKQLKflCRLLVGDIOTIQPS 
770 780 790 800 



11. US-08-249-182-10 (1-12) 

R12141 Enteroviral polypeptide. 

ID R12141 standard; Protein; 2185 AA. 

AC R12141; 

DT 05-AUG-1991 (first entry) 

DE Enteroviral polypeptide. 

KM Enteroviruses; nonoclonal antibodies; nyorarditis? myositis; 



nw neriingitis. encepnauiisi pancreaimsi posv virai taiigue. 

'OS Enterovirus sp. 

PN DE3939200-A. 

PD 29-HAY-1991. 

PF 27-N0V-19895 939200. 

PR 27-N0V-1989; DE-939200. 

PA (PLAC ) MAX PLANCK GES UISSENSCH. 

PI Kandolf R; 

DR HPIJ 91-165150/23. 

OR N-PSDB; Ql 1816. 

PT Neu enieroviral polypeptide for raising group specific antibodies 

PT - for detecting any type of enterovirus in blood or seruni and 

PT neu DNA encoding it 

PS Clain l; pages 14-15; 26pp; Gernan. 

CC This eteroviral polypeptide is used to raise poly- or nonoclonal 

CC antibodies (Abs) . These are useful in assays for detecting entero- 

CC virus specific antigensi as an indication of enteroviral disease. 

CC All 70 serotypes of the enteroviral fanily can be detected. 

CC Diseases such as nyocarditis. myositis, neningitis. encephalitis 

CC and pancreatitis can be diagnosed using the Abs. 

58 Sequence 2185 AA; 

sa M8A; 89 R; 107N; iood; o b; 52 c; 92 Q; 117E; 0 Z; 155G; 49 H; 

59 1151; 1741; 118K; 61 N; 96 F; 120P; 154S; 147T; 32 u; 93 Y; 166V; 
CC Retrieved by shears on Wed 21 Sep 94 ll:57:20-PDT using FindSeq 

Initial Score = 5 Optinized Score = 5 Significance = 3.30 
Residue Identity = 41X Hatches = 5 Misnatches = 7 

Gaps = 0 Conservative Substitutions - 0 

X 10 X 

DIEHLTSLDFFR 
III II 

LttV I PFVPLDYCPGSTT Y VPI TVT I APMCAE YNGLRLAGHfiGLPTNNTPGSCflFLTSDDFQSPSAHPSYDVT 
300 310 320 330 340 X 350 X 360 

PEMR I PGEVKNLHE I AE VDSVVPVfiNVGEKVNSME AY6I P 
370 380 390 400 



12. US-08-249-182-10 (1-12) 

P30304 Sequence corresp. to a protein antigen or allergen 

ID P30304 standard; Protein; 12 AA. 

AC P30304; 

DT 20-APR-1992 (first entry) 

DE Sequence corresp. to a protein antigen or allergen uith high local 

DE average hydrophilicity 

KU Synthetic vaccine; antigen; allergen; innunological response; 

KU antibody. 

PN EP-93851-A. 

PD 16-N0V-1983. 

PF ll-HAR-1983; 102392. 

PR 09-JAN-1981; US-223558. 

PR 12-JUN-1981; US-272855. 

PR 15-HAR-1982; US-358150. 

PR 28-JAN-1983; US-461802. 

PR 16-DEC-1986; US-942562. 

PA (NYBL-) NEU YORK BLOOD CENT. 

PI Hopp TP; 

DR MPU 83-822049/47. 

PT Synthetic vaccine - contains peptide residue coupled to higher 

PT alkyl or alkenyl Gps. and uith 6 anino acids in residue 

PS Clain 59; Page 48; 54pp; English. 

CC The inventors clain a synthetic vaccine uhich conprises a peptide 

CC residue coupled to an alkyl or alkenyl gp. having at least 12C>or 

CC other lipophilic substance. The residue contains a sequence of 6 AAs 



\.v Lorresp.to tne aa ot sutn hhs in a protein anwgen or aiiergen unere 

CC the greatest local average hydrophilicity is found. Pref. the AAs in 

CC the peptide do not exceed 50 residues , and they esp. contain 12-18 

CC residues. The alky I or alkenyl gp. pref. contains 12-24C and it is 

CC pref. coupled to the terminal anino gp. of the residue opt. via a CO 

CC gp. If a lipophilic substance is used* it is pref. palniticr stearic* 

CC behenicr oleic or nycoloic acid. 

SO Sequence 12 AA; 

SO 0 A ; l R; o N; 2 D; 0 B? 0 C? o 0; 2 E; 0 z? o G? 2 H; 

so l i; l L? o k? o m; i f; o p? i s; o t? o h? o y? i v? 

CC Retrieved by shears on Wed 21 Sep 94 li:58:01-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 2.47 
Residue Identity = 66X Matches = 4 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 10 
DIEHLTSLDFFR 

1111 
HDFRSDEIEHLV 

X 10 X 

13. US-08-249-182-10 (1-12) 

R41129 HCV peptide XXc-1 (aa 383-404; E2/NS1 N-terninal). 

ID R41129 standard; peptide; 24 AA. 

AC R4U29? 

DT 22-MAR-1994 (first entry) 

DE HCV peptide XXc-1 (aa 383-404? E2/NS1 N-terainal) . 

KM Hunan immunodeficiency virus; HIV; hepatitis C virus; HCV; 

KM non-A non-B hepatitis; NANBH; huaan T-cell lynphotropic virus; HTLV; 

KM epitope; antibody? biotin; diagnosis; detection; vaccine. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Modif ied.site 1 

FT /note= "the N-terninal comprises (A)-(B)-(X)-Y? where 

FT B= biotin; 

FT X= biotinylation cpd. incorporated 

FT during synthesis? 

FT Y= bond or linking gp(s). which 

FT Rininises steric hindrancei 

FT where Y is not a bond it is pref. 1-10 

FT residues of (sane or different) glycine* 

FT beta-alaninet 4-aninobutyric acid* 

FT 5-aninovaleric acid or 6-aninohexanoic acid; 

FT parenthesis around B and X indicate opt. presence 

FT at the specified positions but B or X Rust be 

FT present in at least one of the positions shown/ 

FT B interacts with the peptide to give a cpd. 

FT uith greater diagnostic sensitivity? 

FT A (optional)= one or nore anino acids* NH2 or 

FT gp. which nodifies the N-terninus; 

FT Z= one or wore anino acids* OHi NH2i or a 

FT linkage involving either of these 2 gps." 

FT Hodified.site 24 

FT /note= °the C-terninal comprises Y-(X)-Z° 

PN W09318054-A. 

PD 16-SEP-1993. 

PF 08-MAR-1993; E005I7. 

PR 06-NAR-1992? EP-400598. 

PA (INN0-) INNOGENETICS NV SA. 

PI De LEYS R? 

DR WPI? 93-303397/38. 

PT New biotinylated peptide(s) corresp. to innuno-doninant 

PT epitope(s) - uith increased antigenicity* useful in antibodies 



X 10 X 

DIEHLTSLDFFR 

II M 
SQFGGILDILVSRSLKHRGQAFVI 
10 20 



15. US-08-249-182-10 (1-12) 

R25061 Thyrotrope reacting peptide 1. 

10 R25061 standard; peptide; 24 AA. 

AC R250M; 

DT ll-DEC-1992 (first entry) 

DE Thyrotrope reacting peptide 1. 

KM autoantibody? thyroid functional abberation? hypothyroidism. 

OS Synthetic. 

PN CA2050046-A. 

PD 13-APR-1992. 

PF 27-AUG-1991; 050046. 

PR 12-0CT-1990; JP-274668. 

PA (TANA ) TANABE SEIYAKU CO. 

PI Mori Hi Murakami M; 

DR UPl; 92-217472/27. 

PT New peptide (s) which react with thyrotrope antibody - used for 

PT the diagnosis of diseases of thyroid functional aberration such 

PT as hyperthyroidism 

PS Claim 1; Page 15; 19pp? English. 

CC The sequences give in R25061-2 are novel peptides which react 

CC immunochemical^ with an antibody of thyrotrope. The antibodies 

CC raised against these peptides and other sequences containing these 

CC peptides can be used to isolate autoantibodies which lead to 

CC diseases of thyroid functional abberation, such as neonatal 

CC hypothyroidism or hyperthyroidisn. 

SQ Sequence 24 AA; 

sq 0 a; 2 R; o N; 2 D; 0 b; l c; 3 a; 2 E; o z; o g; l H; 

SQ 2 I? i L; i K; o «; 1 F; 3 p; 2 s; 2 T; 0 w? o Y; i v? 

CC Retrieved by shears on Hed 21 Sep 94 11:58: 14-PDT using FindSeq 

Initial Score - 4 Optimized Score = 4 Significance = 2.47 
Residue Identity = 33X Hatches = 4 Mismatches = 8 

Gaps = 0 Conservative Substitutions - 0 

X 10 X 

DIEHLTSLDFFR 

II II 
HQEEDFRVTCKDIQR1PSLPPSTQ 
10 X 20 X 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 

Results file us-08-249-182-ll.res made by on Wed 21 Sep 94 12:07?16-PDT. 



Query sequence being compared:US-08-249-182-ll (1-23) 
Number of sequences searched: 5543 
Number of scores above cutoff: 3892 

Results of the initial comparison of US-08-249-182-11 (1-23) with: 
File ; /hone/shears/loring/lorin*.pep 



PS Claim 4; Page 90-98; 133pp; English. 

CC Peptide compsns. comprise at least one and pref. a combination of 

CC tuoi three* four or more biotinylated peptides chosen from the 

CC sequences given in R41058-R41166. The peptides represent 

CC immunologically important regions of viral proteins and are 

CC prepd. by solid phase peptide synthesis. The compsns. are 

CC useful for the detection of antibodies to HCV> and/or HIV* 

CC and/or HTLV-I or II. 

SQ Sequence 24 AAi 

S9 0 A; 2 R; o n; o D; o B? l c? l Q; 0 E; 0 z; 4 G? 2 Hi 

sa 0 If 2 L; 0 K; 0 lli l F; 0 P? 1 S; 5 T? 0 w; 0 Y; 3 V; 

SQ 2 Others; 

CC Retrieved by shears on Med 21 Sep 94 tH59:31-PDT using FindSeq 

Initial Score = 4 Optimized Score = 5 Significance = 2.47 
Residue Identity = 50% Matches = 6 Mismatches = 4 

Gaps = 2 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 

III! II 
XGHTRVTGGVQGHVTCTLTSL--FRX 
10 X 20 X 



14. US-08-249-182-10 (1-12) 

R39589 Ribonucleoprotein snRNP-Ul protein A peptide. 

ID R39589 standard; peptide; 24 AA. 

AC R39589; 

DT 15-N0V-1993 (first entry) 

DE Ribonucleoprotein snRNP-Ul protein A peptide. 

KW In vitro diagnosis; nixed connective tissue disease; MCTD; 

KU autoantibodies kit; complex formation; vaccine. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Peptide 4.. 16 

FT /note= "preferred peptide" 

PN FR2682113-A. 

PD 09-APR-1993. 

PF 25-JUN-1991; 007818. 

PR 25-JUN-1991; FR-007818. 

PA (CNRS ) CNRS CENT NAT RECH SCI, 

PI Barak at Si Briand JPi Muller St van Regenmortel MHV. 

DR WPI; 93-215860/27. 

PT Peptide (s) derived fron ribonucleoprotein snRNP-Ul - for 

PT diagnosing autoi immune diseases and treating e.g. connective 

PT tissue disorders 

PS Claim l; Page 30; 37pp; French. 

CC The peptide is derived from ribonucleoprotein snRNP-Ul and is useful 

CC for in vitro diagnosis of nixed connective tissue disease (MCTD) by 

CC formation of a complex with autoantibodies (AAb). It is specific for 

CC AAb present in patients with MCTD and is not significantly homologous 

CC with other antigens associated with autoimmune disease* It may also 

CC be used in vaccines (usual dose 0.01-100mg/kg) while antibodies are 

CC useful as specific immunological probes for rMCTD and as agents for 

CC raising anti-idiotypic antibodies which are also useful in diagnosis 

CC and vaccines. 

SQ Sequence 24 AA; 

SQ l a; 2 r; o N; l D; 0 B; 0 C; 3 6; 0 E; o Z; 2 G; o H; 

SQ 3 l; 3 L; l K; l M; 2 F; o P; 3 S; o T; o w; 0 Y? 2 v; 

CC Retrieved by shears on Wed 21 Sep 94 11 :59: 15-PDT using FindSeq 



Initial Score = 4 Optimized Score = 4 Significance = 2.47 
Residue Identity = 33X Matches = 4 Mismatches = 8 



Nunber of sequences searched: 5543 
Number of scores above cutoff? 3892 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 100% identical sequence to the query sequence was not found. 



The list of best scores is: 



Sequence Name Description 



Init. Opt. 
Length Score Score 



Sig. Frane 







**** 14 standard deviations above nean **»* 






1. 


R37453 


Autotaxin peptide ATX 103. 


23 22 22 


14.17 


0 






«*»* 4 standard deviations above nean *«** 






2. 


R04574 


Derived anino acid sequence o 


810 8 9 


4.72 


0 


3. 


P71672 


Hunan serine protease. 


262 7 7 


4.05 


0 


4. 


R33279 


43 kD endoflagellun sheath pr 


320 7 9 


4.05 


0 


5. 


R07130 


H20B receptor. 


392 7 7 


4.05 


0 


6. 


R07131 


H20A receptor. 


416 7 7 


4.05 


0 


7. 


R31041 


srciR polypeptide. 


655 7 7 


4.05 


0 


8. 


FRZE.HYXXA 


GLIDING MOTILITY REGULATORY P 


777 7 7 


4.05 


0 






***• 3 standard deviations above nean **»* 






9. 


R27328 


Peptide corresp. to an epitop 


19 6 7 


3.37 


0 


10. 


R37537 


Hethicillin-resistant S. aure 


32 6 7 


3.37 


0 


11. 


R37524 


Hethicillin-resistant S. aure 


32 6 7 


3.37 


0 


12. 


R27562 


Insert B to prevent steric hi 


34 6 7 


3.37 


0 


13. 


R40101 


Hib 0MP P1-P2 hybrid peptide 


54 6 7 


3.37 


0 


14. 


R43686 


Hunan kappa constant donain a 


106 6 6 


3.37 


0 


15. 


R41687 


Undefined 0RF1 encoded by pla 


106 6 6 


3.37 


0 


16. 


P81028 


C region of L chain (chi) of 


106 6 6 


3.37 


0 


17. 


Rl 1013 


Feline innunoglobulin kappa c 


109 6 6 


3.37 


0 


IS. 


R39464 


HAb BU 2128 heavy chain V-reg 


119 6 6 


3.37 


0 


19. 


R38S83 


Antibody light chain. 


129 6 6 


3.37 


0 


20. 


R31531 


HI223 HAb light chain. 


130 6 6 


3.37 


0 


21. 


P70624 


Sequence encoded by anti-hepa 


136 6 6 


3.37 


0 


22. 


R37717 


House 4C10 anti-idiotype Ab h 


138 6 6 


3.37 


0 


23. 


R33953 


gHl variable donain. 


139 6 6 


3.37 


0 


24. 


R27049 


VH425 antibody cloned into pU 


140 6 7 


3.37 


0 


25. 


R22564 


Antibody specific for PB1. 


178 6 6 


3.37 


0 


26. 


R05274 


Segnent of hunan B cell diffe 


185 6 7 


3.37 


0 


27. 


R05275 


Segnent of hunan B cell diffe 


185 6 7 


3.37 


0 


28. 


P90436 


Interferon-beta-2. 


212 6 7 


3.37 


0 


29. 


R43338 


Conpletely hunanised C4G1 Ig 


214 6 6 


3.37 


0 


30. 


R30776 


H52L6-158 nurine ant i -CD 18 an 


214 6 6 


3.37 


0 


31. 


R33312 


Hunanised HaEll Version 1 (in 


218 6 6 


3.37 


0 


32. 


R04494 


HIV fusion protein PBlrf 


229 6 7 


3.37 


0 


33. 


R04496 


HIV fusion protein PBlsc. 


231 6 7 


3.37 


0 


34. 


R30777 


pH52-9.0 hunanised nurine ant 


233 6 6 


3.37 


0 


35. 


R22755 


Reshaped CD4 antibody light c 


233 6 6 


3.37 


0 


36. 


R22754 


Reshaped CAHPATH-1 antibody I 


233 6 6 


3.37 


0 


37. 


R20058 


Light chain of 3D6 anti-HIV a 


234 6 6 


3.37 


0 


38. 


R 13050 


CD4-specific CDR-grafted Ugh 


234 6 6 


3.37 


0 


39. 


R42065 


Hunan anti-HBs light chain. 


236 6 6 


3.37 


0 


40. 


R22565 


Vllys-HuCkappa region of Fab 


236 6 6 


3.37 


0 



1. US-08-249-182-11 (1-23) 

R37453 Autotaxin peptide ATX 103. 

ID R37453 standard; peptide; 23 AA. 
AC R37453; 



N 

U 5000- 
M 

B - * 
E 

R * 
0 

F 1000- 

S * 

E 500- » 

Q 

U 

E 

N 

C 

E 

S 100- 



50- 



- * 



to- 



ll I I I I I I II I I I I I i I 

SCORE 0| 2 | | 5 | 7 10 | 12 | 15 17 20 22 
STDEV 0 23 45 67 89 



PARAMETERS 



Sinilarity Ratrix 


Unitary 


K-tuple 


2 


Mismatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 


40 


Alignment s to save 


IS 


Optimized scores to save 0 


Display context 


50 



SEARCH STATISTICS 



Scores: Mean Median Standard Deviation 

1 3 1.48 



Tines; 



CPU 
00:00:03.93 



Total Elapsed 
00:00:09.00 



DE Autotaxin peptide ATX 103. 

KU Cell motility stimulating? cancer metastasis; antibody; detection; 

KU immunostains? disease outcome prediction; therapy choice; 

KM cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992? US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch Hi Liotta LA, Schiffmann E, Stracke M. 

DR WPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 103. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in immunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

S9 Sequence 23 AA; 

S6 0 A; l R; 2 n; 2 D; o B; 0 C; 0 8? 1 E; 0 z; 2 G; 0 H; 

SQ l I; 4 l; 0 K; 0 M; l F; l P? l S; 4 T; 0 w; l Y; 2 V; 

CC Retrieved by shears on Wed 21 Sep 94 1 1 :58:47-PDT using FindSeq 

Initial Score = 22 Optimized Score = 22 Significance = 14.17 
Residue Identity = 95X Matches = 22 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

lllllllillllllllll Nil 

TEFLSNYLTNVDDITLVPGTLGR 
X 10 20 X 



2. US-08-249-182-11 (1-23) 

R04574 Derived amino acid sequence of coding region of mu 

ID R04574 standard; protein; 810 AA. 

AC R04574; 

DT 17-SEP-1990 (first entry) 

DE Derived amino acid sequence of coding region of murine IL-4 receptor 

KU mammalian interleukin-4 receptor; cytokine; antibody production; 

OS synthetic. 

FH Key Location/Qualifiers 

FT misc_feature 209.. 232 

FT /label* putative transmembrane region 

PN EP-367566-A. 

PD 09-NAY-1990. 

PF 31-0CT-1989; 311244. 

PR 23-JUN-1989; US-370924. 

PA (IMMU-) Immunex Corp. 

PI Cosman DJ» Park L, Mosley B> Beckmann P, March CJ, Idzerda R> 

DR HPI; 90-141470/19. 

DR N-PSDB; Q04305. 

PT Recombinant mammalian interleukin-4 receptor used in diagnosisr 

PT assays and therapy and for prodn. of antibodies for diagnosis r therapy 

PT and for prodn. of antibodies 

PS Disclosure; p; English. 

CC The interleukin-4 receptor can be used to regulate immune responses or to 

CC treat IgE-induced hypersensitivity. 



59 Sequence 310 AA; 

SO 49 A? 23 R! 28 Nl 34 D? 0 B; 32 C; 39 9; 52 E; 0 Z? 63 G; IB H; 

S9 29 I; 80 l; 30 K; 15 n; 25 F; 83 p; 89 s; 39 T? 16 w? 18 Y? 48 V? 

CC Retrieved by shears on Med 21 Sep 94 11:56:58-PDT using FindSeq 

Initial Score = 8 Optinized Score = 9 Significance = 4.72 
Residue Identity = 43X Matches = 10 Mismatches = 12 

Gaps = 1 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II II II II II 

AFSSLLSSNAIRGDTAAAGTDDGHGGYKPFQNPVPNSSPSSVPLFTFGLDTE-LSPSPLNSDPPKSPPECLG 
610 620 630 640 650 X 660 670 

X 
R 

LELGLKGGDWVKAPPPADEVPKPFGDDLGFGIVYSSLTCHLCGHLK9HHS9 
X 680 690 700 710 720 



3. US-08-249-182-11 (1-23) 

P71672 Hunan serine protease. 

ID P71672 standard; Protein? 262 AA. 

AC P71672J 

DT 10-JUN-1991 (first entry) 

DE Hunan serine protease. 

KM Serine protease; assay; antibodies? immunisation? HSP? diagnosis. 

OS Homo sapiens. 

FH Key Location/Sualif iers 

FT Peptide 1..28 

FT /label* sig_peptide 

FT Protein 28 

FT /label= nat_protein 

PN EP-245051-aT 

PD 11-N0V-1987. 

PF 01 -HAY- 1987; 303945. 

PR 06-MAY-1986? US-860085. 

PR 08-MAY-1986J US-861221. 

PR 31-DEC-1986? US-948248. 

PA (STRD) Lei and Stanford JR Univ. 

PI Weissnan IL, Gershenfeld HK. 

DR HPIJ 87-315213/45. 

DR N-PSDB; N71407. 

PT New pure hunan serine protease and fragments - used as labels in 

PT assays and for prodn. of antibodies for passive immunisation 

PT against immune disorders. 

PS Disclosure? Fig li 7pp? English. 

CC Amino acid homology within the active enzyme to the mouse protease is 

CC 71X uith 77X at the DNA level. The overall homolgy is 72X when the 

CC complete coding region and the 3' UTR are included. The amino acids 

CC of the charge-relay systeR are His41» Asp86 and Serl84. The acidic 

CC residue Aspl78 determines substrate specificity for Lys or Arg. 

CC Asnl42 is a Asn-linked carbohydrate site. 

CC The HSP is produced by activated killer cells. The enzyme acts in 

CC conjunction with other components of a killer cell to provide 

CC cytolytic capability. 

S9 Sequence 262 AA? 

S9 13 A? 15 R? 15 N? 13 D? 0 B? 11 C? 4 9? 11 E; 0 Z? 22 G? 7 H? 

S9 18 I? 27 L? 19 K? 7 H; 5 F; 13 P? 18 S? 14 T? 3 H; 7 Y? 20 V? 

CC Retrieved by shears on Wed 21 Sep 94 ll;57U4-PDT using FindSeq 



Initial Score 



7 Optimized Score = 7 Significance = 4.05 



Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 
II I I I II 

HRNSYRFt ASSLSV V VSLLL I PED VCEK 1 1 GGNEVTPHSRPYHVLLSLDRKT I CAGAL I AKDWVLTAAHCNL 
X 10 20 X 30 40 50 60 70 

NKRSQ 



4. US-08-249-182-11 (1-23) 

R33279 43 kD endof lagellun sheath protein. 



ID R33279 standard? Protein; 320 AA. 

AC R33279; 

OT 16-JUL-1993 (first entry) 

DE 43 kD endof lagellun sheath protein. 

KW Endof lagellun? sheath protein? T. hyodysenteriae; core; antibody; 

KW bacteriacide; 43 kD; vaccine; infection; suine dysentery. 

OS Treponena hyodysenteriae. 

FH Key Location/Qualifiers 

FT Peptide 1..19 

FT /note= "Signal peptide" 

FT Protein 20.. 320 

FT /note= "Mature protein" 

PN EP-534526-A. 

PD 31-KAR-1993. 

PF 14-SEP-1992; 202796. 

PR 25-SEP-1991; EP-202478. 

PR 24-JUL-1992; EP-202273. 

PA (DUIN ) DUPHAR INT RES BV. 

PI Koopnan MBHi Kusters JG; 

DR WPI; 93-102665/13. 

DR N-PSDB; Q38583. 

PT Vaccine to protect pigs against suine dysentery - comprises 

PT Treponema hyodysenteriae endo-f lagellun sheath protein* applied 

PT orally or intranasally 

PS Clain 2; Page 21-22; 34pp; English. 

CC This sequence represents the endof lagellun sheath protein of T. 

CC hyodysenteriae. The endof lagellun consists of at least four 

CC proteins* this protien forns the sheath of the flagellun and three 

CC proteins i of nolecular weights 37* 34 and 32 kD* nake up its core. 

CC Antibodies raised against the sheath protein have been shown to be 

CC bacteriacidal for T. hyodysenteriae. The 43 kD sheath protein can 

CC be used in the production of a vaccine against infections such as 

CC suine dysentery. 

SQ Sequence 320 AA; 

SO 28 A; 18 R; 23 N* 22 D; 0 B; 0 C; 11 3; 25 E; 0 Z; 26 G; 1 H; 

SQ 13 I; 26 ls 15 K; 4 M; 10 F; 7 P; 16 s; 20 T; 8 u; 17 Y; 30 V; 

CC Retrieved by shears on Wed 21 Sep 94 li:58:52-PDT using FindSeq 

Initial Score = 7 Qptinized Score = 9 Significance = 4.05 
Residue Identity = 39X Matches = 9 Hisnatches = 14 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II II I II I I 

IDNVGEIKSISSWVYGRNYLISYFVNLQNEFGELKSYPHGTVYFNGMRQVRWENREYLPNVRDSVLVREPLY 
140 150 160 170 180 190 200 



X 
R 



PRKIPSVKLDSLGFYRTKDTKGGDFITYVKDVTLEYDVVVVDFEEDIDDEA 
210 220 230 240 250 



5. US-08-249-182-11 (1-23) 
R07130 H20B receptor. 

ID R07130 standard; protein; 392 AA. 

AC R07130; 

DT 23-JAN-1991 (first entry) 

DE H20B receptor, 

KU Picornavirus proteins; poliovirus; transgenic animals? vaccines; 

KW antibodies; imaging . 

FH Key Location/Qualifiers 

FT Peptide 1..20 

FT /label=signal peptide 

FT Domain 345.. 368 

FT /label=transmembrane domain 

PN W09010699-A. 

PD 20-SEP-1990. 

PF 09-MAR-1990; U01320. 

PR 10-MAR-1989; US-321957. 

PA (UYC0-) COLUMBIA UNIV NY. 

PI Racaniello Vf Mendelsohn C> Costantini F; 

DR WPI; 90-305023/40. 

DR N-PSDB; Q06069. 

PT DNA encoding picornavirus partic. poliovirus receptors proteins - 

PT for treating picornavirus infections or for expression in 

PT transgenic animals used to test vaccines 

PS Disclosure; fig 4; 88pp; English. 

CC This poliovirus receptor, H20B* has a sequence differing from the 

CC receptor, H20A at the cytoplasmic tail only. Antibodies (Abs) 

CC raised against it are useful for targetted delivery of the human 

CC poliovirus, conjugated to a drug. Transgenic animals contg. the 

CC corresp. DNA (genome- or cDNA) can be used to test the efficiency 

CC and virulence of picornavirus vaccines. 

CC See also 606070. 

SO Sequence 392 AA; 

S9 26 A; 17 R; 17 N; 9 D; 0 B; 10 C; 20 8; 20 E? 0 2; 30 G? 10 Hi 

SO 13 I; 42 L; 9 K; 9 M; 12 f; 31 P; 31 S; 28 T; 10 u; 10 Y; 38 v; 

CC Retrieved by shears on Wed 21 Sep 94 11 ?57:08-PDT using FindSeq 

Initial Score = 7 Optimized Score = 7 Significance = 4.05 
Residue Identity = 302 Matches = 7 Mismatches = 16 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
III I III 

AKPQNTAEVQKVQLTGEPVPMARCVSTGGRPPAQITWHSDLGGMPNTSaVPGFLSGTVTVTSLWILVPSSQV 
150 160 170 180 190 X 200 210 

X 
R 

DGKNVTCKVEHESFEKP8LLTVNLTVYYPPEVSISGYDNNWYLG9NEATLT 
X 220 230 240 250 260 



6. US-08-249-182-11 (1-23) 
R07131 H20A receptor. 

ID R07131 standard; protein; 416 AA. 
AC R07131; 

DT 23-JAN-1991 (first entry) 



KW Picornavirus proteins; poliovirus; transgenic animals; vaccines; 

KU antibodies; imaging. 

FH Key Location/Qualifiers 

FT Peptide 1..20 

FT /label=signal peptide 

FT Domain 345.. 368 

FT /label=transmembrane domain 

PN H09010699-A. 

PD 20-SEP-1990. 

PF 09-NAR-1990; U01320. 

PR 10-HAR-1989; US-321957. 

PA (UYC0-) COLUMBIA UNIV NY. 

PI Racaniello V» Mendelsohn Ci Costantini F; 

DR WPI; 90-305023/40. 

DR N-PSDBi Q06070. 

PT DNA encoding picornavirus partic. poliovirus receptors proteins - 

PT for treating picornavirus infections or for expression in 

PT transgenic animals used to test vaccines 

PS Disclosure; fig 4; 88pp; English. 

CC This poliovirus receptor i H20A> has a sequence differing from the 

CC receptor, H20B at the cytoplasmic tail only. Antibodies (Absi 

CC raised against it are useful for targetted delivery of the human 

CC poliovirus* conjugated to a drug. Transgenic animals contg. the 

CC corresp. DNA (genomic- or cDNA) can be used to test the efficiency 

CC and virulence of picornavirus vaccines. 

CC See also 006069. 

SO Sequence 416 AA; 

sa 30 A; 18 r; 18 n; io D; o b? 9 c; 21 a; 22 E? 0 z? 32 g; 10 h; 

sa 13 I; 42 l; 9 K; 9 m; 12 F; 32 P; 37 s; 31 T; 10 u; 11 Y; 40 v; 

CC Retrieved by shears on Med 21 Sep 94 ll:57:08-PDT using FindSeq 

Initial Score = 7 Optimized Score = 7 Significance = 4.05 
Residue Identity = 30X Matches - 7 Mismatches = 16 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
III I lit 

AKPaNTAEV8KVaLTGEPVPMARCVSTGGRPPA8ITWHSDLGGMPNTSaVPGFLSGTVTVTSLHILVPSS8V 
150 160 170 180 190 X 200 210 

X 
R 

DGKNVTCKVEHESFEKPQLLTVNLTVYYPPEVSISGYDNNUYLG8NEATLT 
X 220 230 240 250 260 



7. US-08-249-182-11 (1-23) 

R31041 srmR polypeptide. 

ID R31041 standard; Protein; 655 AA. 

AC R31041; 

DT 24-MAY-1993 (first entry) 

DE srnR polypeptide. 

KW Regulatory; activator; protein; srmR; complementation experiment; 

KU macrolide; biosynthetic; gene; transcription; S. anbof aciens? 

KU biosynthesis; antibody; polyclonal; monoclonal; pathway. 

OS Streptomyces ambofaciens. 

PN EP-524832-A. 

PD 27-JAN-1993. 

PF 24-JUL-1992; 306792. 

PR 26-JUI-1991; US-736178. 

PA (ELIL ) LILLY i CO ELI. 

PI Rao RNi Turner JR; 



Vn fir * r 7»J UtOBU/VTi 

DR N-PSDB; 835141. 

PT DNA encoding regulatory activator protein srnR - for nacrolide 

PT biosynthesis to increase efficiency of antibiotic prodn. 

PS Clain 5; Page 19-20; 22pp; English. 

CC The sequence given represents regulatory (activator) protein srnR. 

CC The DNA encoding this peptide has the ability in conplenentation 

CC experinents to restore nacrolide biosynthetic gene transcription 

CC in nutants having defective srnR genes due to insertional 

CC inactivation of that region of the S. anbofaciens genone. The srflR 

CC gene product activates nacrolide biosynthetic gene transcription and 

CC nay be used to increase the efficiency of nacrolide biosynthesis. 

CC The translation product of this gene is useful for the generation of 

CC antibodies (polyclonal or nonoclonal) uhich are useful in the 

CC detection of other nacrolide biosynthetic pathways. 

SQ Sequence 655 AA; 

SQ 84 A; 69 Ri 8 N; 48 D; 0 B; 8 C; 14 9; 45 Ei 0 Z; 56 G; 16 H; 

S8 13 i; 86 L; 6 K; 10 H; 13 F; 37 P; 38 S; 37 Tf 3 w; 12 Y; 52 v; 

CC Retrieved by shears on Wed 21 Sep 94 11 :5B:38-PDT using FindSeq 

Initial Score = 7 Optinized Score = 7 Significance = 4.05 
Residue Identity = 30X Matches - 7 Hi snatches = 16 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II I I I II 

PLGRPLAEYGTLRPVAPTELRAACRRAAETGRPTSVAPGVWTVPLLPGGNAGFLLTDLGPDADHTAVPLLPH 
310 320 330 340 350 X 360 370 

X 
R 

VARTLALHLR VOHDDSPKAQSHQEFFDDL I GAPRSPTLLRER ALMFSHSFR 
380 390 400 410 420 



8. US-08-249-182-11 (1-23) 

FRZE.HYXXA GLIDING MOTILITY REGULATORY PROTEIN (EC 2.7.1.-). 

ID FRZE.MYXXA STANDARD; PRT; 777 AA. 

AC P18769; 

DT 01-N0V-1990 (REL. 16, CREATED) 

DT 01 -NOV- 1990 (REL. 16. LAST SEQUENCE UPDATE) 

DT 01-N0V-1991 (REL. 20, LAST ANNOTATION UPDATE) 

DE GLIDING MOTILITY REGULATORY PROTEIN (EC 2.7.1.-). 

GN FRZE. 

OS MYXOCOCCUS XANTHUS. 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIA; HYXOBACTERALESi 

OC NYXOCOCCACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 90332690 

RA MCCLEARY W.R., ZUSMAN D.R.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 87:5898-5902(1990). 

RN [2] 

RP PHOSPHORYLATION AT ASN-49. 

RM 91072208 

RA MCCLEARY W.R., ZUSMAN D.R.; 

RL J. BACTERIOL. 172:6661-6668(1990). 

CC -!- FUNCTION: FRZE IS INVOLVED IN A SENSORY TRANSDUCTION PATHUAY THAT 

CC CONTROLS THE FREQUENCY AT HHICH CELLS REVERSE THEIR GLIDING 

CC DIRECTION. FRZE SEEMS TO BE CAPABLE OF AUTOPHOSPHORYLATING ITSELF 

CC ON AN HI STIDINE RESIDUE AND THEN TO TRANSFER THAT GROUP TO AN 

CC ASPARTATE RESIDUE IN THE C-TERMINAL PART OF THE PROTEIN. 

CC -!- SIMILARITY: TO OTHER PROKARYOTIC REGULATORY PROTEINS WHICH BELONG 



v\* I u n t wu Lunr uncu t r\cuuL«iur\f of o ten t\nu innnauuut cwvinunincniftu 

CC SIGNALS TO TRANSCRIPTIONAL APPARATUS. 

CC -!- SIMILARITY; FRZE IS SIMILAR TO BOTH CHEA AND CHEY. 

DR EMBLJ M35192; MXFRZE. 

DR PIR? A35966; A35966. 

KW SENSORY TRANSDUCTION? TRANSFERASE; KINASE; PHOSPHORYLATION. 

FT M0D_RES 49 49 PHOSPHORYLATION (AUTO-) (BY SIMILARITY). 

FT DOMAIN 130 197 ALA/PRO-RICH (POSSIBLE HINGE REGION) . 

SQ SEQUENCE 777 AA; 83189 MM; 2825541 CN; 

CC -!- Retrieved by shears on Wed 21 Sep 94 11:59: 19-PDT using FindSeq 

Initial Score = 7 Optimized Score = 7 Significance = 4.05 
Residue Identity = 30X Matches = 7 Hisnatches = 16 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II I llll 

VEDDGRG I DPERLRQ A A I SKRL I NA V9 A AALSERE A I EL I FRPGFSTRD8 VSELSGRGVGMD V VKRK VETLG 
420 430 440 450 460 X 470 480 

X 
R 

GSVGVSSR I GRGST I TLRLPSSL ALMKVLL VRLGDDV YGMPA ADVEAVHRV 
490 500 510 520 530 



9. US-08-249-I82-11 (1-23) 

R27328 Peptide corresp. to an epitope of HIV-1 gpl20 prot 

ID R27328 standard; peptide; 19 AA. 

AC R27328; 

DT 27-0CT-1992 (first entry) 

DE Peptide corresp. to an epitope of HIV-1 gpl20 protein. 

KM Hunan innunodef iciency virus; vaccine; env gene; HTLV-IIIB; 

KW immunisation; AIDS; antibodies. 

OS Hunan immunodeficiency virus. 

FH Key Location/Oualif iers 

FT Modified.site 1 

FT /note= "if presesent is an anino acid selected to 

FT facilitate coupling of the peptide to a 

FT carrier." 

FT MiscjJifference 19 

FT /note= "nay be absent 0 

FT Modified_site 19 

FT /note= °aay be anidated" 

PN W09205800-A. 

PD 16-APR-1992. 

PF 25-SEP-1991; SE0641. 

PR 27-SEP-1990; US-589422. 

PA (SYNT-) SYNTELLO VACCINE DE. 

PI Horal Pf Jeansson S» Ryno Lr Svennerholn B> Vahlne A; 

DR WPI; 92-150590/18. 

PT Neu peptide(s) corresp. to epitope(s) of HIV-1 gpl20 protein - 

PT used in vaccination and induction of neutralising antibodies 

PT against HIV 

PS Disclosure; Page 16; 48pp; English. 

CC The peptide gpl20-32 corresponds to an epitope of the gp 120 

CC protein fron anino acids 400-417 encoded by the env gene of HIV-1. 

CC The peptide nay be synthesised by standard solid phase techniques 

CC and is useful in a vaccine to inmunise against HIV infection. 

CC Polyclonal and nonoclonal antibodies are elicited by the peptidei 

CC uhich can thus heighten immune response in subjects already infected 

CC by HIV. The peptide nay be covalently attached to sinilar peptides 

CC or to a carrier for use in vaccines. 



50 Sequence 19 AA; 

SQ 0 A; 0 R; 2 N; 1 D; 0 B? 1 C; 0 0; 2 E; 0 Z; 2 G; 0 Hi 

S9 l i; i L; o Kf o h; 0 F; l P; 3 S; 4 T; o w; o y; o v; 

SO 1 Others; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:04-PDT using FindSeq 

Initial Score = 6 Optimized Score = 7 Significance = 3.37 

Residue Identity = 44X Matches = 8 Mismatches = 9 

Gaps = 1 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLGR 

II II I III 
XSTE-GSNNTEGSDT I TLPC 
X 10 X 



10. US-08-249-182-11 (1-23) 

R37537 Methici 11 in-resistant S. aureus detection peptide. 

ID R37537 standard; peptide; 32 AA. 

AC R37537; 

DT 08-SEP-1993 (first entry) 

DE Methicillin-resistant S. aureus detection peptide. 

KM HRSA; anti-MRSA antibody; competitive reaction; dermatitis; 

KW pneumonia; organ failure? treatment; prophylaxis. 

OS Synthetic. 

PN DE4238S06-A. 

PD 27-MAY-1993. 

PF 17-N0V-1992; 238806. 

PR 18-N0V-1991; JP-354087. 

PR 10-APR-1992; JP-134097. 

PA (DAIN-) DAINABOT CO LTD. 

PI Nagata Mi Saito Mi Sekiguchi K* Yajima R. 

DR HPI; 93-176824/22. 

PT Methicillin-resistant Staphylococcus aureus (MRSA) detection 

PT method - by detection of MRSA penicillin binding protein (PBP)2 

PT antigen- antibody complex* and synthetic MRSA PBP2 peptides 

PS Claim l; Page 24; 36pp; German. 

CC The sequence is that of a peptide uhich may be used in a detection 

CC method for methicillin resistant Staphylococcus aureus (HRSA) or its 

CC fragments in a sample. The method comprises the competitive reaction 

CC of the peptide with an anti-MRSA antibody present in the samplei 

CC forming an immunological complex* and detection of the complex. 

CC The synthetically produced peptide provides a simple and accurate 

CC MRSA-test. S. aureus causes dermatitis* pneumonia and organ failure* 

CC detection of MRSA (and therefore MRSA infections) is essential for 

CC appropriate prophylaxis and treatment of the condition. 

SO Sequence 32 AA; 

so l A; o R; 2 n? l D; o B; l c; 2 a? 2 e; o z; i g; o h; 

so 2 I? 4 L; 5 k; o m; l f; 2 p; 2 s; 5 t; o w; i y; o v; 

CC Retrieved by shears on Wed 21 Sep 94 U:59:09-PDT using FindSeq 

Initial Score = 6 Optimized Score = 7 Significance = 3.37 
Residue Identity = 30X Matches = 7 Mismatches = 16 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

II II II I 
YNKLTEDKKEPLLNKF8ITTSPGSTQKILTAC 
X 10 20 X 30 



11. US-08-249-182-11 (1-23) 



ID R37524 standard; peptide; 32 AA. 

AC R37524; 

DT 08-SEP-1993 (first entry) 

DE Hethicillin-resistant S. aureus detection peptide. 

KM MRSA; anti-MRSA antibody? competitive reaction; dermatitis? 

KH pneumonia? organ failure; treatment; prophylaxis- 

OS Synthetic. 

PN DE4238806-A. 

PD 27-MAY-1993. 

PF 17-NQV-1992; 238806. 

PR 18-N0V-1991; JP-354087. 

PR 10-APR-1992? JP-134097. 

PA (DAIN-) DAINABOT CO LTD. 

PI Nagata M> Saito Mi Sekiguchi K* Yajina R. 

DR WPI; 93-176824/22, 

PT Methicillin-resistant Staphylococcus aureus (MRSA) detection 

PT method - by detection of URSA penicillin binding protein (PBP>2 

PT antigen-antibody complex* and synthetic MRSA PBP2 peptides 

PS Disclosure; Page 20; 36pp; German. 

CC The sequence is that of a peptide which nay be used in a detection 

CC method for methicillin resistant Staphylococcus aureus (MRSA) or its 

CC fragments in a sample. The method comprises the competitive reaction 

CC of the peptide with an anti-MRSA antibody present in the sample? 

CC forming an immunological complex r and detection of the complex. 

CC The synthetically produced peptide provides a simple and accurate 

CC MRSA-test. S. aureus causes dermatitis i pneumonia and organ failure* 

CC detection of MRSA (and therefore MRSA infections) is essential for 

CC appropriate prophylaxis and treatment of the condition. 

SO Sequence 32 AA; 

SO i A; 0 R; 2 N; i D; 0 B; 0 c; 2 8; 2 E; 0 Z; l G; 0 H; 

SQ 2 I; 4 l; 5 k; l M; l F; 2 p; 2 s; 5 T; o w; i y; o v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:09-PDT using FindSeq 

Initial Score = 6 Optimized Score = 7 Significance = 3.37 
Residue Identity = 30Z Matches = 7 Mismatches = 16 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDI TLVPETLGR 

II I I I! I 
YNKLTEDKKEPLLNKFQI TTSPGST9K I LTAM 
X 10 20 X 30 



12. US-08-249-182-11 (1-23) 

R27562 Insert B to prevent steric hindrance & competition 

ID R27562 standard; Protein; 34 AA. 

AC R27562J 

DT 26-FEB-1993 (first entry) 

DE Insert B to prevent steric hindrance & competition with E. coli coat. 

KW Dicistronic expression vector? fusion PCR; antibody; cDNA library? 

KW ss. 

OS Synthetic. 

PN W09215678-A. 

PD 17-SEP-1992. 

PF 27-FEB-1992; U01475. 

PR 01-MAR-1991? US-663442. 

PA (STRA-) STRATAGENE. 

PI Sorge JA? 

DR WPI; 92-331724/40. 

PT Prodn. of dicistronic DNA library used to make antibodies* etc. - 

PT includes forming 1st and 2nd PCR admixtures* subjecting them to 

PT PCR thermo-cyclesr sepg. double stranded DNAi hybridising* etc- 



ro i/isLiusure? rage iuvi ifjppi cngnsn. 

CC This peptide linker sequence is used to increase the distance of an 

CC expressed IgG polypeptide from the surface membrane of E. colir which 

CC results in decreased steric hindrance and competition of the preselected 

CC polypeptide uith the lipopolysaccharide coat of E. coli. The IgG is 

CC expressed by a vector produced by a novel form of fusion PCR which 

CC enables fusion of heavy and light chains prior to vector ligation! 

CC avoiding the cumbersome separate cloning of fragments. This linker 

CC sequence moves the Spel site* retains original IgGl upper hinge region, 

CC retains original lamB sequence. 

SB Sequence 34 AA; 

S9 2 Ai 0 R; 0 N; 2 D? 0 B; 1 C; 0 9; 2 E; 0 2? 0 G? 2 Hi 

S9 0 I; 2 Li 5 K; 0 M? l F? 6 P; 5 S; 5 T; o w; l Y; o v; 
CC Retrieved by shears on Med 21 Sep 94 11 :5S:26-PDT using FindSeq 

Initial Score = 6 Optimized Score = 7 Significance - 3.37 
Residue Identity = 307. Hatches = 7 Mismatches = 16 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

II I I II I 

PKSCDKTHTEPKSTDKTHTSPPAPAPELLKSSFY 
10 20 30 



13. US-08-249-182-11 (1-23) 

R40101 Hib 0MP P1-P2 hybrid peptide CP2-1P13. 

ID R40101 standard; peptide; 54 AA. 

AC R40101; 

DT 04-FEB-1994 (first entry) 

DE Hib OtIP P1-P2 hybrid peptide CP2-1P13. 

KW Haemophilus influenzae; type b; Hib; outer membrane protein; PI; P2; 

KW P6; vaccine; antibody; detection; lipoglycopeptide conjugate; 

KW immunogen. 

OS Synthetic. 

FH Key Location/9ualif iers 

FT Region 1..16 

FT /label* C-Pl 

FT Region 17.. 53 

FT /label= CHIBP2 

PN W09315205-A. 

PD 05-AUG-1993. 

PF 03-FEB-1993; CA0041. 

PR 03-FEB-1992; GB-002219. 

PA (C0NN-) C0NNAUGHT LAB LTD. 

PI Chong P, Kandil A, Klein MH, Sia Cf 

DR WPI; 93-258681/32. 

PT Synthetic Haemophilus influenzae conjugate vaccine - comprising 

PT T-helper cell determinants and B-cell epitope (s) linked to 

PT synthetic oligosaccharide^) 

PS Table 11; Page 59; 99pp; English. 

CC The sequences given in R40053-101 are peptide fragments derived from 

CC the Haemophilus influenzae type b (Hib) outer membrane proteins Pl» 

CC P2 and P6. These peptides nay be used in a vaccine against Hib 

CC infection and antibodies against these peptides nay be used in test 

CC kits to detect H. influenzae in a sample. The vaccine may further 

CC comprise a immunogenic or immunostimulatory molecule or the peptides 

CC may be modified uith lipids, or linked to synthetic PRP as synthetic 

CC lipoglycopeptide conjugates to produce alternative vaccines. 

S9 Sequence 54 AA; 

S9 4 A; 3 R; 5 N> l D; o B; 0 C; l 9; 3 E; o 2? 4 G; l H; 

SQ l I; 5 L; 6 K; 0 II; 2 F; 0 P; 2 s; 11 T? 0 w; 3 Y; 2 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:23-PDT using FindSeq 



Residue Identity = 30X Hatches = 7 Mismatches = 16 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDD1TLVPETLGR 

II I I II I 

ARTRTTETGKGVKTEKFKEVKTIGDKRTLTLNTTANYTS9AHANLYGLNLNYSF 
10 X 20 30 X 40 50 



14. US-08-249-182-11 (1-23) 

R43686 Hunan kappa constant dona in as encoded by pHCMV-KR 

ID R43686 standard; Protein; 106 AA. 

AC R 43686; 

DT 25-HAY-1994 (first entry) 

DE Hunan kappa constant domain as encoded by pHCMV-KR. 

KW Hunan; immunoglobulin? constant? region? humanised? P-selectin; light? 

KW blocking; antibody; heavy? chain? variable? marine? thrombotic disease; 

KW monoclonal? P81.3? CDR; comlenentarity determining region? leukocyte? 

KW expression vector? coexpression? pHCNV-1748RHA-ganmalCi-dhfr; epitope? 

KW pHCHV-1748RLA-KR-neo? PB1 .3/Hunanised version A; vascular endothelium? 

KW pHCMV-1747CH-gannaCi-neo? pHCMV-1747-CL-KR-neo? PB1.3 chimera? 

KW acute lung injury? ischaenia reperfusion injury? inflammation. 

OS HoflO sapiens. 

PN WQ9321956-A. 

PD 11-N0V-1993. 

PF 04-MAY-1993? U04274. 

PR 05-HAY-1992? US-880196. 

PA (CYTE-) CYTEL CORP. 

PI Chestnut RWi Paulson JC, Polley MJ? 

DR WPI? 93-368423/46. 

DR N-PSDB? 851548. 

PT Anti-P-selectin antibody for ischaenia acute lung injury treatment - 

PT useful to treat inflammation and pathological conditions of 

PT intercellular adhesion by competitive inhibition assays 

PS Example 10? Fig 10? 82pp? English. 

CC The sequences given in R43685-86 represent human immunoglobulin 

CC constant regions which were used in the production of the humanised 

CC P-selectin blocking antibody* along with the heavy and light chain 

CC variable region coding sequences of the murine monoclonal antibody 

CC PB1.3i given in R43687-88. The CDRs from PB1.3 heavy and light 

CC chains were substituted for the CDRs of human heavy and light chains. 

CC The humanised variable regions uere inserted into expression vectors. 

CC By coexpression of appropriate combinations of heavy and light 

CC chains, several humanised antibodies can be expressed. Coexpression 

CC of pHCPlV-1748RHA-gammalCi-dhfr and pHCMV-1748RLA-KR-neo gives rise 

CC to the PB1.3/Humanised version A. Coexpression of pHCMV-1747CH- 

CC gammaCi-neo and pHCHV-1747-CL-KR-neo gives rise to the PB1.3 chimera. 

CC These humanised antibodies selectively bind epitopes on P-selectin and 

CC block adhesion of leukocytes to the vascular endothelium. They nay be 

CC used to treat inflammatory and thrombotic diseases and other 

CC pathological coditions involving P-selectin and antibodies to it r esp. 

CC acute lung injury and ischaenia reperfusion injury. 

SQ Sequence 106 AA? 

SQ 7 A? 2 R? 5 N? 5 D? 0 B? 3 C? 6 S; 7 E; 0 Z; 4 G; 2 H? 

SO 1 I? 8 L? 8 K? 0 M? 4 F? 5 P? 16 S? 8 T? 1 W? 4 Y? 10 V? 

CC Retrieved by shears on Wed 21 Sep 94 ll:59?42-PDT using FindSeq 

Initial Score = 6 Optimized Score = 6 Significance = 3.37 
Residue Identity = 26X Matches = 6 Mismatches = 17 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 



SVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNALGSGNS9ESVTEQDSKDSTYSLSSTLTLSKAD 
10 20 30 40 50 X 60 70 

X 
R 

YEKHKVYACEVTH8GLSSPVTK3FNRGEC 
80 90 100 



15. US-08-249-182-11 (1-23) 

R41687 Undefined 0RF1 encoded by plasnid pAH46ll. 

ID R41687 standard,* Protein; 106 AA. 

AC R41687; 

DT 20-QCT-1993 (first entry) 

DE Undefined 0RF1 encoded by plasnid pAH4611. 

Kfel Polymerase chain reaction; pricier; PGR? anplify; nurine? heavy; light? 

KM chain; variable; constant; region; anti-hunan; transferrin; receptor; 

KW antibody; brain; capillary; endothelial cell; conjugate; epilepsy; 

KW neuropharnaceutical; diagnostic; agent; tunour; AIDS; stroke; 

KW Parkinsons disease; Alzheiners disease. 

OS Synthetic. 

PN W09310819-A. 

PD 10-JUN-1993. 

PF 24-N0V-1992; U10206. 

PR 26-N0V-1991? US-800458. 

PA (ALKE-) ALKERMES INC. 

PI Friden PM; 

DR WPI; 93-196742/24. 

DR N-PSDB; 843845. 

PT Antibody conjugates specific for transferrin receptor - used 

PT for diagnosis and treatment of cancerr AIDS and neurological 

PT disorders 

PS Disclosure; Fig 13H; 151pp; English. 

CC The sequences given in R41686-87 represent proteins encoded by the 

CC expression vector pAH4611. This vector uas produced fron the plasnid 

CC pAG4270. pAG4270 is the expression vector for the light chain 

CC variable region (VL) of the antibody 128.1 which uas obtained by PCR 

CC uith leader/J region prining (see also 843842). The vector also 

CC contains an anpicillin resistance genei a gpt (nycophenolic acid 

CC resistance) selected narker, an immunoglobulin H enhancer and an 

CC intron for V-constant region splicing. Transcription of the CH gene 

CC is fron the VH promoter of the nurine 27.44 gene. The cloning of 

CC the 128.1 VL region uas accomplished in two stages uith the production 

CC of plasnid pSV4271 as an intermediate vector uhich lacks the promoter 

CC region. This plasnid uas used in conjunction uith pAH4602 in the 

CC production of the chimeric antibody 128.1. 128.1 is an anti-hunan 

CC transferrin receptor antibody uhich binds to the transferrin receptor 

CC on brain capillary endothelial cells. This antibody nay be used in a 

CC conjugate in uhich it is linked to a neuropharnaceutical or diagnostic 

CC agent. The conjugate nay be used to treat or prevent neurological 

CC disorders eg. brain tumours* AIDS* strokei epilepsy* Parkinsons and 

CC Alzheiners disease. It nay also be used for diagnostic methods. 

S8 Sequence 106 AA; 

S8 7 A; 2 R; 5 N; 5 D» 0 B; 3 C; 6 8; 7 E? 0 Z; 4 G; 2 H; 

58 i I; 8 L; 8 K; o M; 4 F; 5 P? 16 S; 8 T? l H; 4 Y; 10 v; 

CC Retrieved by shears on Med 21 Sep 94 Il:59:l4-PDT using FindSeq 

Initial Score = 6 Optimized Score = 6 Significance = 3.37 
Residue Identity = 267. Hatches = 6 Mismatches - 17 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 



II I I II 

SVFIFPPSDE6LKSGTASVVCLLNNFYPREAKV8HKVDNALQSGNSSESVTE8DSKDSTYSLSSTLTLSKAD 
10 20 30 40 50 X 60 70 

X 
R 

YEKHKVYACEVTHQGLSSPVTKSFNRGEC 
80 90 100 

> 0 < 

0| |0 Intel liGenetics 

> 0 < 

FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 

Results file us-08-249-182-2.res nade by on Med 21 Sep 94 12?03523-PDT. 



Query sequence being conpared.'US-08-249-182-2 (1-6) 
Nunber of sequences searched: 5543 
Nunber of scores above cutoff: 2034 

Results of the initial conparison of US-08-249-182-2 (1-6) with: 
File ; /hone/shears/loring/lorin*.pep 

10000- 

N 

U 5000- 
H » 
B 
E 

R - * 

0 

F 1000- 
S 

E 500- 
G 
U 
E 

N - ♦ 

C 

E 

S 100- 



50- 



10- 



DR WPI; 93-272183/34. 

PT New purified enterotoxin receptor protein - used to develop 

PT prods, for treating abnormal conditions caused by bacterialty 

PT released enterotoxin* partic. diarrhoea 

PS Disclosure; Fig 3; 26pp; English. 

CC The sequences given in R38862-63 represent the guanylyl cyclases? 

CC GC-A and GC-B* which binds heat stable enterotoxin. These proteins 

CC are enterotoxin receptors which nay be used as a therapeutic to control 

CC intestinal fluid permeation as well as abnormal conditions caused 

CC by bacterially released enterotoxin. The binding domain of the 

CC proteins* or antibodies to the proteins* can be used to eliminate 

CC diarrhoea. The proteins may be used to isolate ligands and to screen 

CC for antagonists of toxin binding. This sequence is given as it is 

CC represented in the specification. 

SQ Sequence 1025 AA; 

SQ 82 A; 69 R; 39 N; 54 D; 0 B* 15 C; 40 9; 64 Ei 0 Z; 78 G? 28 H; 

SQ 49 IJ 122L? 39 KJ 21 M; 46 Pi 53 P? 56 S; 53 T; 16 W; 37 Y; 64 V? 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:20-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.88 
Residue Identity = 66X Matches = 4 Mismatches = 2 

Gaps = 0 Conservative Substitutions - 0 

X X 
PLDVYK 
III! 

GETVQAEAFDSVTIYFSDIVGFTALSAESTPM9VVTLLNDLYTCFDAIIDNFDVYKVETIGDAYMVVSGLPG 
830 840 850 860 870 880 X 890 

RNGQRHAPEIARMALALLDAVSSFRIRHRPHDQL 
900 910 920 930 



9. US-08-249-182-2 (1-6) 
R38862 GC-A. 

ID R38862 standard; Protein; 1029 AA. 

AC R38862; 

DT 08-FEB-1994 (first entry) 

DE GC-A. 

KH Guanylyl cyclase; GC-C; heat stable; enterotoxin; rat; small intestine; 
KW mucosa; polyA+ RNA; PCR; enterotoxin receptor; bacterial enterotoxin; 
KU binding domain; antibody; diarrhoea; ligand; antagonist- 
OS Rattus rattus. 
PN US5237051-A. 
PD 17-AUG-1993. 
PF 06-DEC-1990; 623033. 
PR 06-DEC-1990; US-623033. 
PA (UYVA-) UNIV VANDERBILT. 
PI Garbers DL, Schulz S; 
DR WPI; 93-272183/34. 

PT New purified enterotoxin receptor protein - used to develop 

PT prods, for treating abnormal conditions caused by bacterially 

PT released enterotoxin* partic. diarrhoea 

PS Disclosure; Fig 3; 26pp; English. 

CC The sequences given in R38862-63 represent the guanylyl cyclases* 

CC GC-A and GC-Bi which binds heat stable enterotoxin. These proteins 

CC are enterotoxin receptors which may be used as a therapeutic to control 

CC intestinal fluid permeation as well as abnormal conditions caused 

CC by bacterially released enterotoxin. The binding domain of the 

CC proteins* or antibodies to the proteins* can be used to eliminate 

CC diarrhoea. The proteins may be used to isolate ligands and to screen 

CC for antagonists of toxin binding. 

SO Sequence 1029 AA; 

SQ 74 A; 73 R; 37 N* 56 D; 0 B; 16 C; 40 Q; 72 E; 0 Z; 75 G; 23 H; 



FT /note= °1 N-l inked glycosylation site" 

FT Region 552.. 766 

FT /label= C-terninal region of extracellular 

FT donain 

FT /note* u l N-l inked glycosylation site & 1 

FT catalytic site 0 

FT Active^site 627,. 631 

FT /label= active site of serine protease/esterase 

FT /note= "fits the consensus sequence GXSXG" 

PN W09316102-A. 

PD 19-AUG-1993. 

PF 09-APR-1992? UQ2892. 

PR 06-FEB-1992? US-832211. 

PA (DAND ) DANA FARBER CANCER INST INC. 

PI Morinoto C» Schlossnan SFr Tanaka T? 

DR WPI? 93-272827/34. 

DR N-PSDB; G46089. 

PT Polypeptide fragments of CD26 - are capable of disrupting binding 

PT of CD45 and CD26 and thus interfering with T-cell activation 

PS Disclosure; pages 39-43; 73pp? English. 

CC C26 is a hunan T cell antivation antigen originally identified by 

CC its reactivity with the MAb Tal. C26 cDNA library was constructed 

CC fron hunan PHA-activated T cells using the CDM7 vector. The hydrophobic 

CC N-terninal of the predicted CD26 polypeptide has the characteristics 

CC of a signal sequence of the type II neabrane protein* which is 

CC reinforced by the observation that potential N-glycosylation sites 

CC are located in the carboxy side of the hydrophobic core. Therefore 

CC the N-terninal 6 AAs are predicted to be cytoplasmic* the next 22 

CC AAs are predicted to transverse the cytoplasnic Reflbrane* and the 

CC 738 C-terninal AAs constitute the predicted extracellular donain. 

SQ Sequence 766 AA? 

SS 40 A; 30 R; 40 N; 46 D; o B; 12 c; 30 Q; 40 E; o z; 43 G; 19 h; 

SQ 49 I; 62 L? 40 K; 15 N; 31 F? 29 P? 64 S? 50 T? 21 U; 56 Y; 49 V; 

CC Retrieved by shears on Wed 21 Sep 94 11:59:24-PDT using FindSeq 

Initial Score = 4 Optinized Score = 4 Significance = 3.88 
Residue Identity = 66X Matches = 4 Mi snatches = 2 

Gaps = 0 Conservative Substitutions ~ 0 

X X 
PLDVYK 
Mil 

VLEDNSALDKMLQNVQf1PSKKLDFIILNETKFHY8«ILPPHFDKSKKYPLLLDVYAGPCS8KADTVFRLN«A 
500 510 520 530 540 X 550 560 

TYLASTEN11VASFDGRGSGYGGDKIMHAINRRL 
570 580 590 



8. US-08-249-182-2 (1-6) 
R3B863 GC-B. 

ID R38863 standard; Protein? 1025 AA. 

AC R38863? 

DT 08-FEB-1994 (first entry) 

DE GC-B. 

KM Guanylyl cyclase; GC-C? heat stable? enterotoxin? rat? snail intestine 

KW mucosa? polyA+ RNA; PCR? enterotoxin receptor? bacterial enterotoxin; 

KW binding donain? antibody? diarrhoea; ligand? antagonist. 

OS Rattus rattus. 

PN US5237051-A. 

PD 17-AUG-1993. 

PF 06-DEC-1990? 623033. 

PR 06-DEC-1990? U5-623033. 

PA (UYVA-) UNIV VANDERBILT. 



CC Retrieved by shears on Wed 21 Sep 94 11:59:20-PDT using FindSeq 



Initial Score = 4 Qptinized Score = 4 Significance = 3.88 
Residue Identity = 667. Hatches = 4 Mismatches = 2 

Gaps = 0 Conservative Substitutions * 0 

X X 
PLDVYK 
llll 

GETVQAEAFDSVTIYFSDIVGFTALSAESTPW9VVTLLNDLYTCFDAVIDNFDVYKVETIGDAYNVVSGLPV 
840 850 860 870 880 X 890 900 

RNG8LHAREVARHALALLDAVRSFRIRHRPQEQL 
910 920 930 



10. US-08-249-182-2 (1-6) 

R24102 Harek's disease virus HD20 polypeptide. 

ID R24102 standard? Protein; 1074 AA. 

AC R24102; 

DT 14-N0V-1992 (first entry) 

DE Harek's disease virus HD20 polypeptide. 

KM Antibodies; vaccine; recombinant; poultry; passive immunotherapy; 

KW diagnostic immunoassay; anti-idiotype? antigen. 

OS Harek's disease virus. 

PN EP-486106-A. 

PD 20-HAY-1992. 

PF 13-N0V-1991J 202947. 

PR 16-N0V-1990; US-615211. 

PA (ALKU ) AKZ0 NV. 

PI Morgan RW; 

OR HPir 92-168713/21. 

DR N-PSDB; Q24789. 

PT DNA encoding Harek's disease virus polypeptides HD18 and HD20 - 

PT and antibodies and vaccine useful for the protection of poultry 

PT against HDV infection 

PS Claim 8; Page 18; 31pp; English. 

CC The protein sequence of HDV HD20 uas deduced fron the DNA sequence 

CC obtd. by screening a lanbda EHBL 3 library nade by infecting chicken 

CC enbryo fibroblasts with a tissue-culture adapted passage of Harek's 

CC disease virus (HDV) strain GAi and incubating until a 90 percent 

CC cytopathic effect had developed. Vectors and host cells contg. the 

CC HDV HD20 gene and HDV polypeptides can be used in a vaccine to protect 

CC poultry against Harek's disease. Antibodies or antiserum raised by 

CC the polypeptides nay be used in passive ipmunotherapy, diagnostic 

CC immunoassays and in the generation of anti-idiotypic antibodies for 

CC use in a test kit for Harek's disease. The vaccine nay also contain 

CC inwunogens related to other poultry pathogens > e.g. infectious 

CC bronchitis-virus. Newcastle disease-virus or infectious bursal 

CC disease-virus to produce a (multivalent vaccine. 

CC See also R24102. 

SO Sequence 1074 AA; 

SO 72 A; 67 R; 49 N; 66 D; 0 B; 30 C; 25 Q; 56 E; 0 Z; 68 G; 27 h; 

SO 80 I; 109L; 49 K; 24 H? 49 F; 43 P; 92 S; 52 T; 8 u; 41 Y? 67 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:06-PDT using FindSeq 

Initial Score = 4 Optinized Score = 4 Significance = 3.88 
Residue Identity = 66X Hatches = 4 Hi snatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
II I! 

SGMILECSNTHSRGGPHIVKRLLALVSADSRAGGIGPANMLHGIDSAIDGPLPVYRVGHSKGRQAFTVLHTE 



tJiV JtV UJU J1U JJV A JOU d/U 

CHERT I PSPGSAK AHL I KLNNS YGTSTEDL I SRD 
580 590 600 610 



It. US-08-249-182-2 (1-6) 
R38861 GC-C. 

ID R38861 standard; Protein; 1075 AA. 

AC R38861? 

DT 08-FEB-1994 (first entry) 

DE GC-C. 

KW Guanylyl cyclase; GC-C; heat stable; enterotoxin; rat? snail intestine; 

KW mucosa? polyA+ RNA? PCR; enterotoxin receptor; bacterial enterotoxin; 

KW binding donain? antibody; diarrhoea; ligand; antagonist. 

OS Rattus rattus. 

FH Key Location/Qualifiers 

FT Peptide 1..22 

FT /note= "Signal peptide" 

FT Protein 23.. 1075 

FT /note* "Mature CG-C" 

FT Modif ied.site 31.. 33 

FT /note 2 "N-linked glycosylation site 0 

FT Modif ied.site 74.. 76 

FT /note* °N-1 inked glycosylation site 0 

FT Modif ied_site 78.. 80 

FT /note* 0 N-linked glycosylation site 0 

FT Modified.site 187, .189 

FT /note= "N-linked glycosylation site 0 

FT Modified.site 194.. 196 

FT /note= D N-linked glycosylation site 0 

FT Modif ied_site 306.. 308 

FT /note= "N-linked glycosylation site 0 

FT Modified.site 356.. 358 

FT /note= 0 N-linked glycosylation site" 

FT Modified.site 401.. 403 

FT /note= 0 N-linked glycosylation site 0 

FT Donain 433.. 453 

FT /note= "Transnenbrane donain" 

PN US5237051-A. 

PD 17-AUG-1993. 

PF 06-DEC-1990? 623033. 

PR 06-DEC-1990? US-623033. 

PA (UYVA-) UNIV VANDERBILT. 

PI Garbers DLr Schulz S; 

DR HP I; 93-272183/34. 

DR P-PSDB; R38861. 

PT New purified enterotoxin receptor protein - used to develop 

PT prods, for treating abnormal conditions caused by bacterially 

PT released enterotoxin* partic. diarrhoea 

PS Clain 2; Fig 1; 26pp; English. 

CC This sequence represents guanylyl cyclase* GC-C, which binds heat 

CC stable enterotoxin. The DNA encoding this protein was isolated fron 

CC rat snail intestinal nucosa polyA* RNA by PCR. This protein is an 

CC enterotoxin receptor which nay be used as a therapeutic to control 

CC intestinal fluid perneation as well as abnornal conditions caused 

CC by bacterially released enterotoxin. The binding donain of the 

CC proteinr or antibodies to the proteinr can be used to elininate 

CC diarrhoea. The protein nay be used to isolate ligands and to screen 

CC for antagonists of toxin binding. 

SQ Sequence 1075 AA; 

SQ 42 A? 64 R; 48 N? 68 D; 0 B? 20 C; 32 Q; 73 Ef 0 Z? 55 G? 23 H; 

SQ 57 I; 118L; 68 K; 32 M? 52 F; 43 P; 75 S; 71 T? 11 W? 48 Y; 75 V? 

CC Retrieved by shears on Wed 21 Sep 94 li:59:20-PDT using FindSeq 



ID R40916 standard; Protein; 593 AA. 

AC R40916? 

DT 05-FEB-1994 (first entry) 

DE Sequence of a CD26 fragment lacking a portion of the carboxy 

DE terminal region 

KW Hunan T cell activation antigen? monoclonal antibody Tal; CD26. 

OS Synthetic. 

PM W09316102-A. 

PD 19-AUG-1993. 

PF 09-APR-1992? U02892. 

PR 06-FEB-1992; US-832211. 

PA (DAND ) DANA FARBER CANCER INST INC. 

PI Morinoto Ci Schlossman SF, Tanaka T; 

DR MPI? 93-272827/34. 

PT Polypeptide fragments of CD26 - are capable of disrupting binding 

PT of CD45 and CD26 and thus interfering with T-cell activation 

PS Example; Pages 46-48; 73pp; English. 

CC C26 is a human T cell antivation antigen originally identified by 

CC its reactivity with the MAb Tal. C26 cDNA library was constructed 

CC from human PHA-activated T cells using the CDH7 vector. 

CC Fragments of CD26 can be prepd in the following manner. 

CC CD26 Xbal-SphI cDNA fragment is ligated to the vector 

CC RcSR-alpha-26 Xbal-Hindlll DNA fragment and the linker Q46092. 

CC The linker introduces an in-frame stop codon that results in the 

CC deletion of the segment of CD26 from AA 594 to the carboxy 

CC terminus of the wild-type protein. This deletion mutantr shown 

CC in R40916 , lacks the putative catalytic site of CD26 and has a new 

CC carboxy terminus given in R40917. 

SQ Sequence 593 AA; 

SQ 28 A; 22 R; 34 N; 35 D; o B; 10 c; 21 G; 31 E; o Z; 30 G; ii h; 

SQ 39 I; 54 L; 34 K; 9 M? 22 F; 25 P; 49 s; 41 t; 17 u? 46 y; 35 v; 

CC Retrieved by shears on Wed 21 Sep 94 1 1 :59: 24-PDT using FindSeq 



Initial Score = 4 Optimized Score = 4 Significance = 3.88 
Residue Identity = 66X Matches = 4 Hismatches = 2 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 
Mil 

VLEDNSALDK«LaNVQMPSKKLDFIILNETKFHY9MILPPHFDKSKKYPLLLDVYAGPCSflKADTVFRLN«A 
500 510 520 530 540 X 550 560 



TYLASTENI IVASFDGRGSGYQGDKIMHA 
570 580 590 



7. US-08-249-132-2 (1-6) 

R40909 Sequence encoded by human CD26 cDNA. 

ID R40909 standard? Protein? 766 AA. 
AC R40909? 

DT 05-FEB-1994 (first entry) 

DE Sequence encoded by human CD26 cDNA. 

KW Human T cell activation antigen; monoclonal antibody Tal. 

OS Homo sapiens. 

FH Key Location/Qualifiers 

FT Region 7.. 28 

FT /label= hydrophobic 

FT Region 29.. 323 

FT /label= N-terminal glycosylated region of 

FT extracellular domain 

FT /note= °8 sites for N-linked glycans 0 

FT Region 324.. 551 

FT /label= Cysteine rich region of extracellular 



X X 
PLDVYK 
ill! 

DSGEYKCQHQQVNESEPVYLEVFSDWLLLSASAEVVMEG8PLFLRCHGMRNWDVYKVIYYKDGEALKYHYEN 
90 100 110 120 130 140 X 150 

HN I S ITN AT VEDSGT Y YCTGK VHQLDYESEPLN I 
160 170 180 190 



5. US-08-249-182-2 (1-6) 

R42635 Hunan interferon receptor. 

ID R42635 standard; Protein; 557 AA. 

AC R42635; 

DT' 20-APR-1994 (first entry) 

DE Hunan interferon receptor. 

KW IFN-R; extracellular donain, wonoclonal antibody? viral infection; 

KW cell proliferation; allograft rejection; systenic lupus erythenatosus; 

KW psoriasis; multiple sclerosis; Behcet's Disease; aplastic anaemia; 

KM immunodeficiency; neasles virus; interferon-alpha-beta. 

OS Hono sapiens. 

FH Key Location/Qualifiers 

FT Donain 1..436 

FT /label= extracellular_donain 

FT /note= B solublej innunogenic forn of IFN-R 0 

PN EP-563487-A. 

PD 06-QCT-1993. 

PF 31-MAR-1992; 400902. 

PR 31-MAR-1992; EP-400902. 

PA (EUBI-) LAB EURO BIOTECHNOLOGIE SA. 

PI Benoit Pi Maguire D> Meyer F, Plavec I> Tovey KG; 

DR WPI; 93-312951/40. 

DR P-PSDB; R42635. 

PT Monoclonal antibody to hunan interferon type-I receptor - having 

PT neutralising activity against hunan type I interferon* used for 

PT therapy and diagnosis 

PS Disclosure; Fig 3; 21pp; English. 

CC Monoclonal antibodies produced against soluble forns of the hunan 

CC interferon alpha-beta receptor based on the full-length hunan IFN-R 

CC sequence are claimed. The antibodies are useful for treatment and 

CC prophylaxis of disorders involving cell proliferation and/or viral 

CC infection. 

SO Sequence 557 AA; 

SQ 26 A; 13 R; 35 N; 25 D; 0 B; 11 C? 23 a; 42 E; 0 z; 20 G; 9 H; 

S8 43 I; 44 L; 44 K; 6 M; 26 F; 26 P; 51 s; 36 T; 13 H; 22 Y; 42 V; 

CC Retrieved by shears on Wed 21 Sep 94 11:59J33-PDT using FindSeq 

Initial Score = 4 Dptinized Score « 4 Significance - 3.88 
Residue Identity = 66X Matches = 4 Misnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
II II 

IYIGAPKQSGNTPVIQDYPLIYEIIFMENTSNAERKIIEKKTDVTVPNLKPLTVYCVKARAHTMDEKLNKSS 
350 360 370 380 390 400 X 410 

VFSDAVCEKTKPGNTSKIWLIVGICIALFALPFV 
420 430 440 450 



6. US-08-249-182-2 (1-6) 

R40916 Sequence of a CD26 fragaent lacking a portion of t 



PS Disclosure; Page 5; 33pp; English. 

CC Antibodies were raised against two synthetic peptides that 

CC correspond to the N-terminal (R36681) and C-terminal (R36680) 

CC of guinea pig VPF (designated N-IgG and C-IgG> respectively). 

SO Sequence 25 AA; 

SO 2 A; 2 R; 0 N; l D; 0 B ; 1 C; l Q ; 3 E; 0 Z; 1 G; o H; 

SO 0 I; 0 l; 3 K? 2 M; 1 F? 2 p; 1 5? 0 T; 0 Ur 2 Yi 3 V; 

CC Retrieved by shears on Wed 21 Sep 94 11:59:00-PDT using FindSeq 

Initial Score = 4 Optimized Score - 4 Significance = 3.88 
Residue Identity = 66X Watches = 4 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
Nil 

APMAEGEQKPREVVKFHDVYKRSYC 
10 X 20 



4. US-08-249- 182-2 (1-6) 

R26064 Hunan FcERI alpha-subunit and IL-2 hybrid protein. 

ID R26064 standard; Protein; 235 AA. 

AC R26064; 

DT 02-FEB-1993 (first entry) 

DE Human FcERI alpha-subunit and IL-2 hybrid protein. 

KW High affinity Fc immunoglobulin E receptor? IgE? 

KH antibody; interleukin-2; histamine release; allergy. 

OS Homo sapiens. 

FH Key Location/Qualifiers 

FT Region 26.. 201 

FT /label* human_FcERI_alpha-subunit 

FT /note= "epitope recognised by new MAb fl 

PN EP-499112-A. 

PD 19-AUG-1992. 

PF 03-FEB-1992? 101732. 

PR ll-FEB-1991? US-653936. 

PA (HOFF ) HOFFMANN LA ROCHE & CO AG F. 

PI Chizzonite RA, Hakimi J» Kochan JPf 

DR WPI; 92-277871/34. 

PT Monoclonal antibodies bind to alpha sub-unit of Fc IgE receptor - 

PT for treatment and prevention of IgE induced allergic diseasesr 

PT also for measuring alpha sub-unit and IgE levels in biological 

PT fluids 

PS Disclosure; Page 8; 30pp? English. 

CC This is a preferred protein for use in generating the 

CC monoclonal antibodies of the invention. The protein comprises 

CC an epitope of the human FcERI alpha-subunit to which the cytoplasmic 

CC and transmembrane regions of the IL-2 receptor have been fused. 

CC (Cytoplasmic and transmembrane regions from receptors other IL- 

CC 2 receptor can be also used). The specification includes a 

CC nucleotide coding sequence which is a preferred fusion gene (see 

CC Q27267); the polypeptide which is decoded from that fusion gene 

CC differs from the amino acid sequence R26064 as follows: amino acids 

CC 5-7 are Arg-Ile-Leu (not Met-Glu-Ser) i amino acid 209 is Cys (not 

CC Lys), amino acid 229 is Ser (not Arg) , Arg233 is absent and an 

CC additional C-terninal amino acid (Phe) is present. 

SQ Sequence 235 AA; 

SS 13 A; 9 R; 16 Ni 7 D; 0 B; 5 C; 10 Q? 18 E; 0 z; 12 G; 4 h; 

SO 9 I; 24 L; 14 k; 3 M; 10 F; 11 P; 18 S; 12 T; 8 »; 11 y; 21 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:22-PDT using FindSeq 



Initial Score = 4 Optimized Score = 4 Significance = 3.88 
Residue Identity = 66X Matches = 4 Mismatches - 2 



r aluv i t\ 



X X 



2. US-08-249-182-2 (1-6) 

R13381 Vascular permeability Factor (1-21). 

10 R13381 standard; peptide; 21 AA. 

AC R13381; 

DT 22-0CT-1991 (first entry) 

DB Vascular permeability factor (1-21). 

KW VPF! antibody. 

OS Synthetic. 

PN US5036003-A. 

PD 30-JUL-1991. 

PF 02-SEP-1988; 240780. 

PR 21-AUG-1987; US-087739. 

PR 02-SEP-1988; US-240780. 

PA (HONS ) MONSANTO CO. 

PI Olander JVi Connolly DT. Adams SPi Feder J. 

DR WPI; 91-245462/33. 

PT Antibodies to vascular permeability factor peptides - which block 

PT permeability-enhancing activity and growth-promoting activity of 

PT vascular permeability factor. 

PS Claim 2; Page 5; 6pp; English. 

CC The amino acid sequence encodes a synthetic peptide fragment of VPF 

CC prepared by solid phase synthesis which is used to stimulate an 

CC antibody response. This response blocks the permeability-enhancing 

CC and growth promoting activity of VPF used by tumours to increase 

CC their nutrient supply. The Abs are also useful as immunoadsorbents 

CC for VPF isolation and in assays for VPF. See also R13380. 

SQ Sequence 21 AA; 

SQ 2 A; l R; o N; i D; o B; 0 C; l Q; 3 E; 0 Z; l G; 0 H; 

SQ 0 I; 0 L; 3 K; 2 H; 1 f; 2 P; 0 S; 0 T; 0 w; l Y; 3 V; 

CC Retrieved by shears on Wed 21 Sep 94 11?57:33-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.88 
Residue Identity - 66% Hatches = 4 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
III) 

APHAEGEQKPREVVKFMDVYK 
10 X 20 



3. US-08-249-182-2 (1-6) 

R36681 Guinea pig VPF N-terminal. 

ID R36681 standard; peptide; 25 AA. 

AC R36681; 

DT 23-AUG-1993 (first entry) 

DE Guinea pig VPF N-terminal. 

KW Vascular permeability factor; effusion; malignancy; antibody. 

OS Cavia porcellus. 

PN W09308473-A. 

PD 29-APR-1993. 

PF 21-0CT-1992; U09068. 

PR 24-0CT-199U US-782350. 

PA (BETH-) BETH ISRAEL HOSPITAL ASSOC. 

PI Dvorak HF, Yeo K, Yeo T; 

DR WPI; 93-152625/18. 

PT Immunoassay for detecting vascular permeability (VPF) factor in 

PT effusions - comprises detecting VPF in sample using immobilised 

PT antibody to C-terminus of VPF having specified sequence and 2nd 



bL a 

S3. 


P98432 


Sequence of C. trachomatis se 


it 
14 


0 

3 


3 


C.Tl 

2.91 


0 


24, 


P98464 


Sequence of C. trachomatis se 


14 


3 


3 


2.91 


0 


25. 


P98428 


Sequence of C. trachonatis se 


14 


3 


3 


2.91 


0 


26. 


P98456 


Sequence of C. trachonatis se 


14 


3 


3 


2.91 


0 


27. 


P98420 


Sequence of C. trachonatis se 


14 


3 


3 


2.91 


0 


28. 


P98416 


Sequence of C. trachonatis se 


14 


3 


3 


2.91 


0 


29. 


P98412 


Sequence of C. trachonatis se 


14 


3 


3 


2.91 


0 


30. 


P98452 


Sequence of C. trachonatis se 


14 


3 


3 


2.91 


0 


31. 


P98448 


Sequence of C. trachonatis se 


14 


3 


3 


2.91 


0 


32. 


P98468 


Sequence of C. trachomatis se 


14 


3 


3 


2.91 


0 


33. 


P98460 


Sequence of C. trachonatis se 


14 


3 


3 


2.91 


0 


34. 


P93297 


Chalnydia trachonatis serovar 


14 


3 


3 


2.91 


0 


35. 


P93291 


Sequence of Chalnydia trachon 


14 


3 


3 


2.91 


0 


36. 


P91447 


Sequence of Chalnydia trachon 


14 


3 


3 


2.91 


0 


37. 


R43907 


HI V-l RF gpl20 monoclonal ant 


15 


3 


3 


2.91 


0 


38. 


R34502 


Innunogenic AHAS peptide #4. 


15 


3 


3 


2.91 


0 


39. 


R44830 


Hunan fibrin beta-chain inter 


16 


3 


3 


2.91 


0 


40. 


P93015 


38-57 region of equine follic 


20 


3 


3 


2.91 


0 



1. US-08-249- 182-2 (1-6) 

R37444 Autotaxin peptide ATX 19. 

ID R37444 standard; peptide? 7 AA. 

AC R37444? 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 19. 

KM Cell motility stimulating; cancer metastasis? antibody; detection; 

KW immunostains; disease outcome prediction? therapy choice? 

KM cancer therapy; crossl inked toxins. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Hodified.site 2 

FT /note= "potentially glycosylated residue 0 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992? 822043. 

PR 17-JAN-1992? US-822043. 

PA (USSH ) US DEPT HEALTH 8t HUHAN SERVICE. 

PI Krutzsch Hr Liotta LA, Schiffmann Ei Stracke H. 

DR UPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33? 36pp? English. 

CC The sequence is that of autotaxin peptide ATX 19. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in immunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SG Sequence 7 AA? 

SQ 0 A? 0 R; o N; l D; o 8; o c; o Q; o E; o z; o G; o H; 

S9 o i; l l; i k; o n; 0 F; 1 P; o s; 0 t; o H; 1 Y? l v? 

S3 1 Others; 

CC Retrieved by shears on Wed 21 Sep 94 11 :58:47-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 4.85 
Residue Identity = 83X Watches = 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 
Mill 



I! I II III III I I I 
SCORE 0| 1 |1 2 | 2 3|3 4 4 5 
STDEV 12 3 4 



PARAMETERS 



Similarity natrix 


Unitary 




Misnatch penalty 


1 


Joining penalty 20 


Gap penalty 


1.00 


Window size 5 


Gap size penalty 


0.05 




Cutoff score 


0 




Randomization group 


0 




Initial scores to save 40 


Alignments to save 15 


Qptinized scores to 


save 0 


Display context 50 




SEARCH STATISTICS 


Scores: 


Mean 


Median Standard Deviation 




0 


1 1.03 


Tines: 


CPU 


Total Elapsed 




00:00:09.94 


00:00U6.00 


Nunber of residues: 




482836 


Nunber of sequences 


searched: 


5543 


Nunber of scores above cutoff: 


2034 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence was not found. 



The list of best scores is: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



**** 4 standard deviations above nean «## 



1. 


R37444 


Autotaxin peptide ATX 19. 


7 


5 


5 


4.85 


0 






•«*« 3 standard deviations above nean 


Mi* 








2. 


R13381 


Vascular perneability factor 


21 




4 


3.88 


0 


3. 


R36681 


Guinea pig VPF N-terninal. 


25 




4 


3.88 


0 


4. 


R26064 


Hunan FcERI alpha-subunit and 


235 




4 


3.88 


0 


5. 


R42635 


Hunan interferon receptor. 


557 




4 


3.88 


0 


6. 


R40916 


Sequence of a CD26 fragnent 1 


593 




4 


3.88 


0 


7. 


R40909 


Sequence encoded by nun an CD2 


766 




4 


3.88 


0 


8. 


R38863 


GC-B. 


1025 




4 


3.88 


0 


9. 


R38862 


GC-A. 


1029 




4 


3.88 


0 


10. 


R24102 


Marek's disease virus HD20 po 


1074 




4 


3.88 


0 


11. 


R38861 


GC-C. 


1075 




4 


3.88 


0 


12. 


P60243 


Sequence encoding the entire 


2179 




4 


3.88 


0 






»**« 2 standard deviations above nean 


«<•• 








13. 


P30125 


Sequence of innunoregulatory 


6 


3 


3 


2.91 


0 


14. 


P71314 


Sequence of fibrin innunogen 


7 


3 


3 


2.91 


0 


15. 


R44837 


Hunan fibrin beta-chain N-ter 


8 


3 


3 


2.91 


0 


16. 


P82686 


Hunan fibrin beta chain N-ter 


8 


3 


3 


2.91 


0 


17. 


R44838 


Hunan fibrin beta-chain pepti 


12 


3 


3 


2.91 


0 


18. 


R44829 


Hunan fibrin beta-chain N-ter 


12 


3 


3 


2.91 


0 


19. 


R28&29 


N-terninal hunan fibrin pepti 


12 


3 


3 


2.91 


0 


20. 


R42343 


C-erbB-2 C-terninal peptide. 


13 


3 


3 


2.91 


0 


21. 


P90379 


Sequence of antigenic epitope 


14 


3 


3 


2.91 


0 



Residue Identity = 66X Hatches = 4 Mismatches = 2 

Gaps s 0 Conservative Substitutions = 0 



X X 
PLDVYK 
III! 

KGIVEPELYEEVTIYFSDIVGFTTICKYSTPMEVVDMLNDIYKBFD9IVDHHDVYKVETIGDAYVVASGLPM 
820 830 840 850 860 X 870 880 



RNGNRHAVDISKHALDILSFMGTFELEHLPGLPV 
890 900 910 



12. US-08-249-182-2 (1-6) 

P60243 Sequence encoding the entire genome RNA of human 



ID P60243 standard; Protein; 2179 AA. 

AC P60243; 

DT 07-AUG-1991 (first entry) 

DE Sequence encoding the entire genomic RNA of human rhinovirus 

KH Monoclonal antibody; NAB; HRV; vaccine; ss. 

OS Hunan rhinovirus. 

FH Key Location/Qualifiers 

FT Region 1..69 

FT /label= VP4 structural protein 

FT Region 70., 331 

FT /label= VP2 structural protein 

FT Region 332.. 567 

FT /label- VP3 structural protein 

FT Region 568.. 856 

FT /label= VP1 structural protein 

FT Region 857.. 1002 

FT /label= 3B protein 

FT Region 1003.. 1099 

FT /label= 5B protein 

FT Region 1100. .1429 

FT /label= X protein 

FT Region 1430.. 1514 

FT /label= protein IB 

FT Region 1515.. 1537 

FT /label= protein VPg 

FT Region 1538.. 1719 

FT /label- Protease 

FT Region 1720.. 2179 

FT /label= Replicase 

PN EP-169146-A. 

PD 22-JAN-1986. 

PF 17-JUL-1985? 401465. 

PR 20-JUL-1984; US-632785. 

PR 10-APR-1985; US-721735. 

PA <HERI ) MERCK k CO INC. 

PI Colonno RJ r Mitzutani S; 

DR HPI; 86-022809/04. 

DR N-PSDB; N60194. 

PT New DNA encoding the entire genonic RNA of human rhinovirus 14 - 

PT and monoclonal antibodies which block attachment or neutralise 

PT infectivity of rhinovirus. 

PS Example 11; Page 22-39? 80pp; English. 

CC Sequence nay be used for the manufacture of hybridoma cells 

CC expressing the HRV or fragments thereof. The fusion products may be 

CC used in immunisation! or to raise HAbs for passive treatment of HRV 

CC infection. 

SO Sequence 2179 AA; 

SO HOA; 82 r; 114N; H7D; o b; 43 c; 81 9; 103E; 0 Z; 146G; 53 h; 

SO 1441; 196L; 137K; 53 M? 85 Fs 121P; 158S? 172T? 24 W? 85 Y; 155V? 



Initial Score = 4 Optinized Score = 4 Significance = 3.88 
Residue Identity = 66X Matches = 4 fiisnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 

II II 

FAGSSLIQSICNTHHIFRDEIYVVEGGWPSGCSGTSIFNSfllNNIIIRTLILDAYKGIDLDKLKILAYGDDL 
1980 1990 2000 2010 2020 2030 X 2040 

IVSYPYELDPGVIATLGKNYGLTITPPDKSETFT 
2050 2060 2070 2080 



13. US-08-249-182-2 (1-6) 

P30125 Sequence of innunoregulatory peptides It II? XIII, 

ID P30125 standard; peptide; 6 AA. 
AC P30125; 

DT 03-AUG-1992 (first entry) 

DE Sequence of icmunoregutatory peptides I> lit XIII, XIV, and XV. 
KW Pharmaceutical ; innunoregulation; peptide; lymphocyte activity; 
KM antibody. 

H Key Location/Qualifiers 
Misc_difference 1 
/label= pyro-E 
Hisc_difference 2.. 4 
/label= I 

Hisc_difference 2.. 5 
/label= II 

Misc.difference 1..4 
/label= XIII 
HiscjJifference 1..S 
/label= XIV 
Nisc_difference 1..6 
/label= XV 
EP--67425-A. 
22-DEC-1982. 

11- JUN-1982? 387655. 

12- JUN-1981; HU-001755. 
(RICT ) RICHTER GEDE0N VEGY. 

Kisfaludy L* Nyeki 0, Schoen It Denes L» Entber J, Hajos G, 
Szporny L, Szende B; 
WPI; 83-00264K/01. 

Protected tri: tetra; and penta:peptide(s) - for influencing 
inftuno-regulation e.g. by affecting lynphocyte activity! antibody 
prodn.» defence nechanisns 
Clain 1; Page 45; 51pp; Gem an. 

The inventors clain 15 (peptides I-XV) opt. protected peptides and 
their salts* conplexesr anides and 1-5C aUyl esters (see P30125-35). 
The peptides (I)-(XV) are pharmaceuticals* esp. for influencing 
innunoregulation» with an effect on e.g. lymphocyte activity* 
CC antibody prodn. and cells uhich produce specific defences. 
SQ Sequence 6 AA; 

so 0 A; 1 R; o N; 1 D; 0 B; 0 c; 0 a; 1 E; 0 z; 0 G; 0 H; 
SQ 0 I; 0 L; l k; 0 H; 0 F; 0 P; 0 S; 0 T; 0 «; l Y; i v; 
CC Retrieved by shears on Wed 21 Sep 94 11:58?02-PDT using FindSeq 

Initial Score = 3 Optinized Score = 3 Significance = 2.91 
Residue Identity = 60X Hatches = 3 Misnatches = 2 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 



ri nar ley ud, 

DR HPIJ 93-351658/44. 

PT New linear epitope(s) for human auto-antibodies - from the 

PT Ro/SSAi La/SSB and Sm B/8' antigens and riboinucleoproteinr used 

PT for diagnosing and treating auto-immune disorders e.g. systemic 

PT lupus erythematosus 

PS Claim 1; Page 30; 43pp; English. 

CC The sequences given in R43391-562 are linear epitopes which are 

CC derived from the 60 kD Ro/SSA peptide, the La/SSB autoantigen, 

CC the 70 kD nuclear ribonucleoprotein (nRNP) and the Sm B/B' 

CC polypeptide. These antigens are common in systenic lupus 

CC erythematosus (SLE) and closely related disorders. The Ro/SSA family 

CC of proteins has been shown to have several molecular forms which are 

CC defined by the molecular weight of the antigen identified. The major 

CC form has a molecular weight of 60 kD and two additional forms have 

CC molecular weights of 52 and 54 kD. La/SSB is also a member of this 

CC group of autoantibodies and binds small RNAs with a polyuridine 

CC terminus. La/SSB is bound by a third of the anti-Ro/SSA precipitin 

CC positive sera. La/SSB has been shown to be a 46-50 kD monomeric 

CC phosphoprotein uhich associates with RNA polymerase III transcripts. 

CC Anti-Sm antibodies precipitate snRNPs containing the Uli U2, U4/U6 and 

CC U5 RNA. Anti-Sm antibodies may be directed against one or a 

CC combination of the polypeptides: B (26 kD), B' (27 kD) f D (13 kD), 

CC E/F (11 kD doublet) and G (less than 10 kD). These epitopes may be 

CC used for preventing* treating or screening autoimmune disorders, 

CC especially SLE or Sjogrens syndrome (SS). They bind to a human 

CC autoantibody and may therefore be used as vaccines. 

SO Sequence 10 AA; 

SS 3 A; 0 R; 0 N; 0 D; 0 B; 0 C; 1 G; 0 E; 0 Z; 0 G; 0 H; 

SQ 0 I; o Lf 2 k; o m; l F; o p; l s; l t; o w; o y; 1 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59?35-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.07 
Residue Identity = 60X Hatches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
III 

STKQAAFKAV 
X 10 



7. US-08-249-182-3 (1-5) 

R07105 Hepatitis B Virus PreSl epitope (I). 

ID R07105 standard; peptide; 11 AA. 

AC R07105; 

DT 24-JAN-1991 (first entry) 

DE Hepatitis B Virus PreSl epitope (I). 

KM Monoclonal antibodies; Hepatitis B Virus PreSl? PreS2? epitope; 

KM immunoassay; vaccine. 

OS Synthetic. 

PN EP-389983-A. 

PD 03-0CT-1990. 

PF 23-MAR-1990; 105550. 

PR 31-HAR-1989; US-332014. 

PR 25-HAY-1989; US-357708. 

PA (ABB0 ) ABBOTT LABORATORIES. 

PI Mimns LT, Floreani MF, Eble KS, Rosenlof RVr Tyner JD; 

PI Hitters E; 

DR WPI; 90-298968/40. 

PT Monoclonal antibodies to hepatitis B virus preSl and preS2 

PT epitope (s) - use in immunoassays antibody assays, sub-typing HBV, 

PT vaccines, etc. 

PS Claim 1; Page 20; 22pp; English. 



CC immunogensr followed by fusion. The MAb were then gpd. by binding and 

CC inhibition studies using peptides corresp. to epitopes specific for 

CC PreSli as represented here, 

CC The antibody binds to the PreSi protein and not to the H protein. 

CC The MAbs can be used to develop specific and sensitive immunoassays 

CC to detect H6V and H6 surface antigen in samples. They can also be 

CC used in immunoassays and to detect? characterise and isolate 

CC epitopal sites in PreSl that nay be useful components of 

CC subunit HBV vaccines. They have a potential as a prophylactic and 

CC curative agent. The Ab nay further be used to nap the hepatocyte 

CC binding region of the L protein and for intracellular and cell 

CC surface staining of HBV infected hepatocytes and experimentally 

CC transfected hepatoma cell lines. 

CC See also R07106-07. 

SQ Sequence 11 AA; 

SQ 2 A; 0 Rj 1 N? 1 D? 0 B» 0 Cf 1 8; 0 E; 0 Z; 1 G; 1 H; 

SQ 0 IS 1 L; 0 Ki 0 Hi 1 F; 1 P? 1 S; 0 T; 0 W? 0 Y; 0 V? 

CC Retrieved by shears on Wed 21 Sep 94 11:57;09-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.07 
Residue Identity = 607. Hatches = 3 Hismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
III 

HQLDPAFGANS 
X 10 



8. US-08-249-182-3 (1-5) 

R43438 Ro/SSA epitope 106. 

ID R43438 standard; peptide; 12 AA. 

AC R4343S; 

DT 12-MAY-1994 (first entry) 

OE Ro/SSA epitope 106. 

KH Linear; epitope; 60 kD; Ro/SSA; La/SSB; autoantigen? E/F; G; 70 kD; 

KU nuclear ribonucleoprotein; nRNP; Sm B/B'; polypeptide; antigen; 0; 

KW systemic lupus erythematosus; SLE? autoantibody; U4/U6; U5; B; B'; 

KH RNA polymerase III; Ul; U2; Sjogrens syndrome; SS; human; vaccine; ss. 

OS Homo sapiens. 

PN W09321223-A. 

PD 28-0CT-1993. 

PF 13-APR-1993; U03484. 

PR 13-APR-1992; US-867819. 

PA (OKLA ) UNIV OKLAHOMA STATE. 

PI Harley JB; 

DR DPI; 93-351658/44. 

PT New linear epitope(s) for human auto-antibodies - from the 

PT Ro/SSA, La/SSB and Sm B/B' antigens and ribo?nucleoproteinr used 

PT for diagnosing and treating auto-immune disorders e.g. systemic 

PT lupus erythematosus 

PS Claim 1; Page 30; 43pp; English. 

CC The sequences given in R43391-562 are linear epitopes which are 

CC derived from the 60 kD Ro/SSA peptidei the La/SSB autoantigen, 

CC the 70 kD nuclear ribonucleoprotein (nRNP) and the Sm B/B' 

CC polypeptide. These antigens are common in systemic lupus 

CC erythematosus (SLE) and closely related disorders. The Ro/SSA family 

CC of proteins has been shown to have several molecular forms which are 

CC defined by the molecular weight of the antigen identified. The major 

CC form has a molecular weight of 60 kD and two additional forms have 

CC molecular weights of 52 and 54 kD. La/SSB is also a member of this 

CC group of autoantibodies and binds small RNAs with a polyuridine 

CC terminus. La/SSB is bound by a third of the anti-Ro/SSA precipitin 



- — *■ — bU' WW*/ • • t* « WWII 4IIW«M »*V WE U • W W U R W HVI IVHCI A \_ 

CC phosphoprotein which associates with RNA polymerase III transcripts. 

CC Anti-Sn antibodies precipitate snRNPs containing the Ul, U2, U4/U6 and 

CC U5 RNA. Anti-Sn antibodies nay be directed against one or a 

CC combination of the polypeptides? B (26 kD), B' (27 kD), D (13 kD)r 

CC E/F (11 kD doublet) and G (less than 10 kD). These epitopes nay be 

CC used for preventingr treating or screening autoimmune disorders i 

CC especially SLE or Sjogrens syndrome (SS). They bind to a human 

CC autoantibody and nay therefore be used as vaccines. 

Sfi Sequence 12 AA; 

SG 3 A; 0 R; o N; 0 D? 0 B; 0 C; 1 G; 0 E; o Z? 0 G; 0 Hi 

so i I; o l; 2 K; o h; l F; o p? 2 s; l T; o w; o y; l v; 

CC Retrieved by shears on Ued 21 Sep 94 U:59:35-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.07 
Residue Identity = 60X Hatches = 3 Mismatches - 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
Ml 

ISTK8AAFKAVS 
X 10 



9. US-08-249-182-3 (1-5) 

R43392 La/SSb epitope 139. 

ID R43392 standard; peptide; 13 AA. 

AC R43392; 

DT 12-HAY-1994 (first entry) 

DE La/SSb epitope 139. 

KW Linear; epitope; 60 kD; Ro/SSA; La/SSBi autoantigen; E/F; G; 70 kD; 

KW nuclear ribonucleoprotein; nRNP; Sm B/B'; polypeptide; antigen; D; 

KM systemic lupus erythematosus; SLE; autoantibody; U4/U6? U5; B; B f ; 

KW RNA polymerase III; Ul; U2; Sjogrens syndrone; SS; human; vaccine; ss. 

OS Homo sapiens. 

PN W09321223-A. 

PD 28-0CT-1993. 

PF 13-APR-1993; U03484. 

PR 13-APR-1992; US-867819. 

PA (0KLA ) UNIV OKLAHOMA STATE. 

PI Harley JB; 

DR MPI; 93-351658/44. 

PT Neu linear epitope(s) for human auto-antibodies - from the 

PT Ro/SSA, La/SSB and Sm B/B' antigens and ribo:nucleoprotein, used 

PT for diagnosing and treating auto- immune disorders e.g. systemic 

PT lupus erythematosus 

PS Claim 1; Page 30; 43pp; English. 

CC The sequences given in R43391-562 are linear epitopes which are 

CC derived from the 60 kD Ro/SSA peptidei the La/SSB autoantigen, 

CC the 70 kD nuclear ribonucleoprotein (nRNP) and the Sm B/B' 

CC polypeptide. These antigens are common in systemic lupus 

CC erythematosus (SLE) and closely related disorders. The Ro/SSA family 

CC of proteins has been shoun to have several molecular forms which are 

CC defined by the molecular weight of the antigen identified. The major 

CC form has a molecular weight of 60 kD and two additional forms have 

CC molecular weights of 52 and 54 kD. La/SSB is also a member of this 

CC group of autoantibodies and binds small RNAs with a polyuridine 

CC terminus. La/SSB is bound by a third of the anti-Ro/SSA precipitin 

CC positive sera. La/SSB has been shoun to be a 46-50 kD monomeric 

CC phosphoprotein which associates with RNA polymerase III transcripts. 

CC Anti-Sm antibodies precipitate snRNPs containing the Ul « U2> U4/U6 and 

CC U5 RNA. Anti-Sm antibodies may be directed against one or a 

CC combination of the polypeptides? B (26 kD), B' (27 kD), D (13 kD), 

CC E/F (11 kD doublet) and G (less than 10 kD). These epitopes may be 



CC especially SLE or Sjogrens syndrome (SS). They bind to a human 

CC autoantibody and nay therefore be used as vaccines. 

S9 Sequence 13 AA; 

SQ 1 A; 2 R; l N; 0 D; 0 B; 0 C; l 8; o E; 0 z; 0 G; l H; 

S9 1 I; l l; 2 Kf l h; l Ft 0 P? o s; l t; o u; 0 Y; 0 v; 

CC Retrieved by shears on Wed 21 Sep 94 1U59S34-PDT using FindSeq 

Initial Score = 3 Optinized Score = 3 Significance = 3.07 
Residue Identity = 60X Hatches = 3 Mi snatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
III 

NI8HRRTLHKAFK 
10 X 



10. US-08-249-182-3 (1-5) 

P91772 Has oncongene angiotensin receptor peptide. 

ID P91772 standard; protein; 16 AA. 

AC P91772; 

DT 14-HAY-1990 (first entry) 

DE Has oncongene angiotensin receptor peptide. 

KW Antibody; angiotensin; nas; hypertension; 

PN W08911657-A. 

PD 30-N0V-1989. 

PF 25-HAY-1989; G00583. 

PR 25-MAY-1988; GB-012354. 

PR 25-HAY-1988; GB-012353. 

PA (CAHB) Cambridge Res Bioch. 

PI Sheppard PU» Varndell IM; 

DR WPI; 89-370824/50. 

PT Detection of (pre-)hypertensive and/or (pre-) cancerous states - 

PT and neu peptide(s) used in raising antibodies at angiotensin 

PT receptor useful in immunoassay, and treatment with angiotensin 

PT antagonist. 

PS Clain 3; Page 22; 32pp; English. 

CC Antibodies raised against the peptide can be used for detection and 

CC treatnent of hypertension, tunours and other neoplasms, and prevention of 

CC ectopic hornone prodn. 

CC See also P91707 and P91771. 

SQ Sequence 16 AA; 

SQ l A; 3 R; IN; 2 D; 0 B; l c; 2 8; l E; 0 Z; 0 G; o H; 

S8 0 I; 0 l; 2 K; l w; i F; l p; o s; o T; ou; o y; 0 v; 

CC Retrieved by shears on Med 21 Sep 94 11:56:50-PDT using FindSeq 

Initial Score = 3 Optinized Score = 3 Significance = 3.07 
Residue Identity = 752 Hatches = 3 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 

III 

RAFKDEH8PRR8KDNC 
X X 10 



11. US-08-249-182-3 (1-5) 

R43391 La/SSb epitope 136. 

ID R43391 standard; peptide; 18 AA. 
AC R43391! 

DT 12-HAY-1994 (first entry) 



KW Linear; epitope; 60 kD; Ro/SSA; La/SSB; autoantigen? E/F; G; 70 kD; 

KW nuclear ribonucleoprotein; nRNP; Sm B/B'; polypeptide? antigen? D? 

KW systemic lupus erythematosus? SLE? autoantibody? U4/U6? US? B? B'? 

KW RNA polymerase III? Ul? U2? Sjogrens syndrome? SS? human? vaccine? ss. 

OS Homo sapiens. 

PN W09321223-A. 

PD 28-0CT-1993. 

PF 13-APR-1993? U03484. 

PR 13-APR-1992? US-867819. 

PA (OKLA ) UNIV OKLAHOMA STATE. 

PI Harley JB? 

DR WPI? 93-351658/44. 

PT New linear epitope(s) for human auto-antibodies - from the 

PT Ro/SSA, La/SSB and Sm B/B' antigens and ribo?nucleoproteinr used 

PT for diagnosing and treating auto-immune disorders e.g. systemic 

PT lupus erythematosus 

PS Claim 1? Page 30? 43pp? English. 

CC The sequences given in R43391-562 are linear epitopes which are 

CC derived from the 60 kD Ro/SSA peptide, the La/SSB autoantigenr 

CC the 70 kD nuclear ribonucleoprotein (nRNP) and the Sn B/B' 

CC polypeptide. These antigens are common in systemic lupus 

CC erythematosus (SLE) and closely related disorders. The Ro/SSA family 

CC of proteins has been shown to have several molecular forms which are 

CC defined by the molecular weight of the antigen identified. The major 

CC form has a molecular weight of 60 kD and two additional forms have 

CC molecular weights of 52 and 54 kD. La/SSB is also a member of this 

CC group of autoantibodies and binds small RNAs with a polyuridine 

CC terminus. La/SSB is bound by a third of the anti-Ro/SSA precipitin 

CC positive sera. La/SSB has been shoun to be a 46-50 kD mononeric 

CC phosphoprotein which associates with RNA polymerase III transcripts. 

CC Anti-Sm antibodies precipitate snRNPs containing the Ulr U2r U4/U6 and 

CC U5 RNA. Anti-Sm antibodies may be directed against one or a 

CC combination of the polypeptides: B (26 kD), B' (27 kD), D (13 kD) r 

CC E/F (11 kD doublet) and G (less than 10 kD). These epitopes may be 

CC used for preventing! treating or screening autoimmune disordersr 

CC especially SLE or Sjogrens syndrome (SS) . They bind to a human 

CC autoantibody and may therefore be used as vaccines. 

SQ Sequence 18 AA? 

SQ 1 A? 2 R? 1 N? 0 D? 0 B? 0 C? 2 S? 0 E? 0 Z? 1 G? 1 H? 

SQ 1 I? 2 L? 2 K? 1 M? 1 F? 0 P? 1 S? 1 T? 0 W? 0 Y? 1 V? 

CC Retrieved by shears on Wed 21 Sep 94 1U59534-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.07 
Residue Identity = 60X Matches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions - 0 



X X 
YPAFK 
III 

QVLNIQMRRTLHKAFKGS 
10 X X 



12. US-08-249-182-3 (i-5) 

R39830 El peptide RV-EP6, residues 106-125. 

ID R39830 standard; peptide? 20 AA. 
AC R39830? 

DT 19-JAN-1994 (first entry) 

DE El peptide RV-EP6, residues 106-125. 

KU Rubella virus? RV; El protein? antibody? mammal; vaccine; rubella? 
KW neutralising; cell mediated; immune response; mumps? measles. 
OS Synthetic. 
PN W09314206-A. 
PD 22-JUL-1993- 



rr tv unn i77i)) unuuiii 

PR 20-JAN-1992; GB-001139. 

PA (CONN-) CONNAUGHT LAB LTD. 

PI Chong P, Gillan Si Tingle A; 

DR WPIJ 93-243221/30. 

PT Synthetic peptide(s) having at least one antigenic determinant of 

PT a rubella virus protein - useful for producing vaccine* and also 

PT to detect associated antibodies to treat associated auto-innune 

PT disorders! etc, 

PS Table l; Page 41? 68ppi English. 

CC The sequences given in R39825-52 represent rubella virus (RV) El 

CC protein peptide fragments. These peptides are capable of eliciting 

CC high titres of antibodies against RV in nannals. They nay be used 

CC in vaccines to elicite neutralising antibodies and a cell mediated 

CC income response against RV. They nay be used as one component of a 

CC Rultivalent vaccine» pref. one providing protection against rubellai 

CC ftunps and neasles. 

SQ Sequence 20 AA; 

SQ 2 A; 0 R; 0 N; 0 D; 0 Br 1 Ci 1 6; 2 Ei 0 Z? 2 G; 2 HI 

sa 0 I; o L; 1 k; o M; l F; 2 P; l S; l T; o w; 3 Y; 1 v; 

CC Retrieved by shears on Wed 21 Sep 94 11?59:19-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance ~ 3.07 
Residue Identity = 60X Hatches = 3 Hisnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
III 

GSYYKSYHPTACEVEPAFGH 
10 X 20 



13. US-08-249-182-3 (1-5) 

R26160 Tuberculosis antibody production peptide #1. 

ID R26160 standard; peptide; 20 AA. 

AC R26160; 

DT 02-FEB-1993 (first entry) 

DE Tuberculosis antibody production peptide #1. 

KU 85-C; Mycobacterium tuberculosis; innunise; antibody; vaccine; 

KW carrier nolecule? antisera. 

OS Synthetic. 

PN EP-499003-A. 

PD 19-AUG-1992. 

PF 14-FEB-1991? 400388. 

PR 14-FEB-1991; EP-400388. 

PA (INN0-) INNOGENETICS NV SA. 

PI Content J* De Bruyn J, De Wit L; 

DR HPI; 92-277793/34. 

PT Recombinant peptide(s) and their nucleic acids - for diagnosing 

PT tuberculosis and as a vaccine against tuberculosis 

PS Disclosure; Page 11; 48pp; English. 

CC The sequences given in R261&0-66 are peptides which were used in the 

CC scope of the invention to raise antibodies against tuberculosis. 

CC They correspond to regions of the 85-C antigen containing region of 

CC Mycobacterium tuberculosis and can be used in the production of 

CC vaccines for immunisation against tuberculosis. The peptides nay 

CC be used to raise antisera and in this case would be synthesised uith an 

CC additional cysteine residuei pref. attached to the anino terminal. 

CC This facilitates coupling of the peptide to a carrier nolecule which 

CC is necessary to render the peptide immunogenic. 

SQ Sequence 20 AA; 

SQ 2 A; l R; 2 N; 4 D; 0 B; 0 C; 1 a; i E; 0 Z; 2 G; 0 H; 

SQ l I; l L; 0 K; o M; l F; l P; 0 S; l T? 1 w? l Y; 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll;58:21-PDT using FindSeq 



PT Hybrid monoclonal antibody - used for prepn. of thrombolytic drug 

PT having increased thrombolytic activity and specificity and 

PT reduced reactivity to fibrinogen 

PS Example i; Page 14; 38pp; Japanese. 

CC Hunan fibrin beta-chain peptides A and B were synthesised and coupled 

CC to BSA for injection into nice. The peptides were used to raise 

CC antibodies to human fibrin. Monoclonal antibodies specific for fibrin 

CC are used in the production of bispecific monoclonal antibodies 

CC which also recognise truncated tPA muteins lacking the finger, EGF and 

CC Kringle 1 domains, 

SQ Sequence 8 AA; 

sq 0 a; l R; o N; l D; 0 B; l C; 0 Q; 0 E; 0 Z; l G; 1 H; 

SQ 0 If 1 L? 1 K; 0 Hi 0 F? 1 P; o S; o T; 0 w; 0 YJ 0 VJ 

CC Retrieved by shears on Ued 21 Sep 94 ti:59:45-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 2.91 
Residue Identity - 60X Hatches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 

III 
GHRPLDKC 
X X 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 

Results file us-08-249-182-3.res made by on Ued 21 Sep 94 12:06: 14-PDT. 



Query sequence being conpared:US-08-249-182-3 (1-5) 
Number of sequences searched: 5543 
Number of scores above cutoff: 1885 

Results of the initial comparison of US-08-249-182-3 (1-5) with: 
File : /hone/shears/loring/lorin*.pep 

10000- 

N 

U 5000- 
H * 
B 
E 

R - * 

0 

F 1000- 
S 

E 500- 

Q 

U 

E 

N 

C 

E 

S 100- * 



50- 



i i t 
ERKDVY 
X X 



14. US-0B-249-182-2 (1-6) 

P71314 Sequence of fibrin icmunogen for the prepn. of won 

ID P71314 standard; peptide; 7 AA. 

AC P71314; 

DT 19-JUN-1991 (first entry) 

DE Sequence of fibrin iRnunogen for the prepn. of nonce lonal antibodies 

DE (MAbs). 

KW Fibrin-specific nonoclonal antibody; screening. 

FH Key Location/Qualifiers 

FT Misc_difference 7 

FT /label* Lys-OH 

PN W08706263-A. 

PD 22-QCT-1987. 

PF 14-APR-1987; U00862. 

PR 14-APR-1986; US-851514. 

PA (GEH0-) GEN HOSPITAL CORP. 

PA (GEM0-) GEN HOSPITAL CORP. 

PI Hatsueda GR> Haber E; 

DR WPIf 87-306855/43. 

PT Screening of fibrin-specific nonodonal antibodies - by contact 

PT with innobilised crosslinked fibrin clot and screening with 

PT detectable labelling step 

PS Disclosure; Page 7; 41pp; English. 

CC The MAbs are specific to fibrin uithout fibrinogen cross-reactivity. 

CC They have increased binding to in vitro and in vivo thronbi. The 

CC MAbs can be used in immunoassays for fibrin in the presence of 

CC fibrinogen or other proteins. They can be used as innunoaff inity 

CC ligands for the purification of fibrin. 

SQ Sequence 7 AA; 

SQ 0 A; l R; o N; l D; 0 B; 0 C; 0 G; 0 E; 0 Z; 1 G; 1 H; 

SQ o I; l L; l K; o M; 0 F; l P; 0 S; 0 T? 0 Hi 0 Y; 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 1 1 :57: 13-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 2.91 
Residue Identity = 751 Matches = 3 Mi snatches = 1 

Gaps = 0 Conservative Substitutions - 0 

X X 
PLDVYK 

III 
GHRPLDK 
X X 



15. US-08-249-182-2 (1-6) 

R44837 Hunan fibrin beta-chain N-terninal peptide A. 

ID R44837 standard; peptide; 8 AA. 

AC R44837; 

DT 20-JUN-1994 (first entry) 

DE Hunan fibrin beta-chain N-terninal peptide A. 

KW Tissue plasminogen activator; t-PA; nutein; fibrin; antigen? 

KM anti-fibrin; monoclonal antibody; hybridona; thrombolysis 

KW antithrombotic agent? bispecific antibody. 

OS Synthetic. 

PN J05304992-A. 

PD 19-N0V-1993. 

PF 17-JUN-1992? 158301. 

PR 20-JUN-1991? JP-148936. 

PA (TAKE ) TAKEDA CHEM IND LTD. 



lo- 



ll 1 II III III I 

SCORE 0| 1 |1 2 | 2 3 | 3 4 
STDEV 12 3 4 



PARAMETERS 



Similarity matrix 


Unitary 


K-tuple 2 


Mismatch penalty 


1 


Joining penalty 20 


Gap penalty 


1.00 


Window size S 


Gap size penalty 


0.05 




Cutoff score 


0 




Randomization group 


0 




Initial scores to save 40 


Alignments to save IS 


Optimized scores to 


save 0 


Display context 50 




SEARCH STATISTICS 


Scores: 


Mean 


Median Standard Deviation 




0 


1 0.98 


Tines! 


CPU 


Total Elapsed 




00:00:08.90 


00:00:08.00 



Number of residues: 482836 
Number of sequences searched: 5543 
Number of scores above cutoff: 1885 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence uas found: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 

1. R37445 Autotaxin peptide ATX 20. 5 5 5 5.12 0 

The list of other best scores is: 

Init, Opt. 

Sequence Name Description Length Score Score Sig. Frame 



ft*** 4 standard deviations above mean **** 

2. R25466 Endoglucanase 12. 376 4 4 4.09 0 

**** 3 standard deviations above mean **** 

3. R43439 Ro/SSA epitope 109. 8 3 3 3.07 0 



1 • 

5. 


R43440 


Ro/SSA epitope 111. 


o 

9 


3 


3 


3.07 


0 


6. 


R43437 


Ro/SSA epitope 105. 


10 


3 


3 


3.07 


0 


7. 


R07105 


Hepatitis B Virus PreSl epito 


11 


3 


3 


3.07 


0 


8. 


R43438 


Ro/SSA epitope 106. 


12 


3 


3 


3.07 


0 


9. 


R43392 


La/SSb epitope 139. 


13 


3 


3 


3.07 


0 


10. 


P91772 


Mas oncongene angiotensin rec 


16 


3 


3 


3.07 


0 


11. 


R43391 


La/SSb epitope 136. 


18 


3 


3 


3.07 


0 


12. 


R39830 


El peptide RV-EP6, residues 1 


20 


3 


3 


3.07 


0 


13. 


R26160 


Tuberculosis antibody product 


20 


3 


3 


3.07 


0 


14. 


R41297 


Peptide fragnent F10-2. 


21 


3 


3 


3.07 


0 


IS. 


R34509 


Peptide Kl recognised by MAbi 


21 


3 


3 


3.07 


0 


16. 


R39880 


Lipopeptide TPRV-EP6. 


23 


3 


3 


3.07 


0 


17. 


P20021 


Sequence of a foot and mouth 


67 


3 


3 


3.07 


0 


18. 


R24710 


Sequence of a chimeric anti-h 


112 


3 


3 


3.07 


0 


19. 


P83210 


Sequence of the CH2 domain (r 


116 


3 


3 


3.07 


0 


20. 


R25412 


Heavy chain variable domain o 


118 


3 


3 


3.07 


0 


21. 


R31022 


Grass pollen allergen KBG 7.2 


131 


3 


3 


3.07 


0 


22. 


R24712 


Sequence encoded by the genom 


132 


3 


3 


3.07 


0 


23. 


R27565 


Part of lamB lambda receptor 


134 


3 


3 


3.07 


0 


24. 


A48491 


twitching motility protein P 


135 


3 


3 


3.07 


0 


25. 


A49047 


IgM monoclonal striational a 


136 


3 


3 


3.07 


0 


26. 


R13304 


CFTR 556 del A. 


151 


3 


3 


3.07 


0 


27. 


A30938 


myosin regulatory light chai 


161 


3 


3 


3.07 


0 


28. 


R23867 


Pre-S gene region translation 


174 


3 


3 


3.07 


0 


29. 


R23871 


Pre-S gene region translation 


174 


3 


3 


3.07 


0 


30, 


R23870 


Pre-S gene region translation 


174 


3 


3 


3.07 


0 


31. 


R23869 


Pre-S gene region translation 


174 


3 


3 


3.07 


0 


32. 


R2386B 


Pre-S gene region translation 


174 


3 


3 


3.07 


0 


33. 


P90501 


ND28 deriv. of G-CSF. 


174 


3 


3 


3.07 


0 


34. 


R39392 


Truncated tissue factor. 


218 


3 


3 


3.07 


0 


35. 


R43675 


Single chain polypeptide with 


225 


3 


3 


3.07 


0 


36. 


R06478 


TRY40. 


225 


3 


3 


3.07 


0 


37. 


R05710 


TRY40. 


225 


3 


3 


3.07 


0 


38. 


R24811 


Sequence encoded by the chin 


239 


3 


3 


3.07 


0 


39. 


R44510 


Type I iodothyronine 5' deiod 


249 


3 


3 


3.07 


0 


40. 


R 14258 


gp75 peptide and fragments. 


249 


3 


3 


3.07 


0 



1. US-08-249-182-3 (1-5) 

R37445 Autotaxin peptide ATX 20. 

ID R37445 standard; peptide; 5 AA. 

AC R37445; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 20. 

KH Cell motility stimulating; cancer metastasis; antibody? detection; 

KW imnunostains; disease outcome prediction; therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7B22043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043, 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch Hr Liotta LA, Schiffmann Ei Stracke M. 

DR WPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 20. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in imnunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich may also include autotaxin inhibitors. Autotaxin 



annuuuitrs Ldn trubbiintica \,o toxins ricin h; tor canter 

CC therapy. 

50 Sequence 5 AA; 

S3 i A; 0 R; 0 N; 0 D; 0 Br 0 C; 0 8; 0 E? 0 z? 0 G; 0 H; 

SQ 0 I? o L; l K; o M? l F? i P? o S? 0 T? o w? i Y? o v; 

CC Retrieved by shears on Wed 21 Sep 94 11 :58:47-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 5.12 
Residue Identity = 100% Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 

Illll 
YPAFK 
X X 



2. US-08-249-182-3 (1-5) 

R25466 Endoglucanase #2. 

ID R25466 standard? Protein; 376 AA. 

AC R25466? 

DT 07-JAN-1993 (first entry) 

DE Endoglucanase *2. 

KW CHC-endoase? 43 kD cellulase; monoclonal antibody. 

OS Huaicola insolens. 

PN EP-495257-A. 

PD 22-JUL-1992. 

PF 06-N0V-1991; 202879. 

PR 16-JAN-1991? EP-870006. 

PR 06-NDV-1991; EP-2028B0. 

PR 06-N0V-1991? EP-202879. 

PA (PROC ) PROCTER & GAMBLE CO. 

PI Baeck AC. Busch A? Ceulenans RAA; 

DR UPI; 92-243163/30. 

DR N-PSDB? 926382. 

PT Compact, granular detergent compsns. - contain high activity 

PT cellulase and softening clay to provide synergistic effect in 

PT softening performance 

PS Disclosure; Page 23-25? 29pp? English. 

CC The sequences given in R25464 and R25466 are endoglucanases which 

CC are immunoreactive with a nonoclonal antibody raised against a 

CC partially purified 43 kD cellulase derived from Humicola insolens. 

CC These endoglucanases exhibit a CMC-endoase activity of at least 

CC about 50i pref. at least about 60r inparticular at least about 90 CMC- 

CC endoase units per mg of total protein. These endoglucanases have 

CC nolecular weight of approx. 43 kD. 

SO Sequence 376 AA? 

SQ 46 A? 7 R? 17 N? 22 D? 0 B? 19 C? 15 Q? 10 E? 0 2? 34 G? 3 H? 

SQ 6 I? 15 L? 33 K? 4 M? 11 F? 29 P? 38 S? 29 T? 6 W? 13 Y? 19 V? 

CC Retrieved by shears on Wed 21 Sep 94 11:58:18-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 4.09 
Residue Identity = 80X Matches = 4 Mismatches - 1 

Gaps = 0 Conservative Substitutions = 0 

XX 
YPAFK 

Mil 

DSYPELLKDGCHWRFDWFENADKPDFTFEQVQCPKALLDISGCKRDDDSSFPAFKVDTSASKP9PSSSAKKT 
ISO 190 200 210 220 230 240 



TSAAAAAQPQKTKDSAPVVQKSSTKPAAQPEPT 
250 260 270 280 



3. US-08-249-182-3 (1-5) 

R43439 Ro/SSA epitope 109. 



ID R43439 standard; peptide; S AA. 

AC R43439; 

DT 12-MAY-1994 (First entry) 

DE Ro/SSA epitope 109. 

KH Linear; epitope; 60 kD? Ro/SSA? La/SSB; autoantigen; E/F; G; 70 kD? 

KM nuclear ribonucleoprotein; nRNP; Sn B/B'? polypeptide; antigen; D; 

KU systenic lupus erythenatosus; SLE; autoantibody; U4/U6; U5; Br B r ; 

KW RNA polymerase III; Ul; U2; Sjogrens syndrone; SSI hunan; vaccine; ss. 

OS Hono sapiens. 

PN W09321223-A. 

PD 28-0CT-1993. 

PF 13-APR-1993; U03484. 

PR 13-APR-1992; US-867819. 

PA (OKLA ) UNIV OKLAHOMA STATE. 

PI Harley JB; 

DR WPI; 93-351658/44. 

PT New linear epitope(s) for hunan auto-antibodies - from the 

PT Ro/SSAi La/SSB and Sn B/B' antigens and ribonucleoprotein* used 

PT for diagnosing and treating auto-innune disorders e.g. systenic 

PT lupus erythenatosus 

PS Clain 1; Page 30; 43pp! English. 

CC The sequences given in R43391-562 are linear epitopes which are 

CC derived from the 60 kD Ro/SSA peptide* the La/SSB autoantigen* 

CC the 70 kD nuclear ribonucleoprotein (nRNP) and the Sn B/B r 

CC polypeptide. These antigens are connon in systenic lupus 

CC erythenatosus (SLE) and closely related disorders. The Ro/SSA fanily 

CC of proteins has been shown to have several nolecular forns which are 

CC defined by the molecular weight of the antigen identified. The najor 

CC forn has a nolecular weight of 60 kD and two additional forns have 

CC nolecular weights of 52 and 54 kD. La/SSB is also a nenber of this 

CC group of autoantibodies and binds snail RNAs with a polyuridine 

CC terninus. La/SSB is bound by a third of the anti-Ro/SSA precipitin 

CC positive sera. La/SSB has been shown to be a 46-50 kD nononeric 

CC phosphoprotein which associates with RNA polynerase III transcripts. 

CC Anti-Sn antibodies precipitate snRNPs containing the Ul* U2> U4/U6 and 

CC U5 RNA, Anti-Sn antibodies nay be directed against one or a 

CC combination of the polypeptides: B (26 kD), B' (27 kD), D (13 kD)r 

CC E/F (11 kD doublet) and G (less than 10 kD). These epitopes nay be 

CC used for preventing* treating or screening autoinnune disorders* 

CC especially SLE or Sjogrens syndrone (SS). They bind to a hunan 

CC autoantibody and nay therefore be used as vaccines. 

SO Sequence 8 AA; 

SO 3 A; o R; o n; 0 0; 0 B; 0 CM 0; 0 E; 0 Z; 0 G; 0 H; 

SQ 0 I; 0 L; 2 K> 0 H; l F; o P; o S; 0 T; o y; 0 Y* 1 v; 

CC Retrieved by shears on Wed 21 Sep 94 11:59?35-PDT using FindSeq 

Initial Score = 3 Optinized Score = 3 Significance = 3.07 
Residue Identity = 60X Matches = 3 Hisnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 

II! 
KGAAFKAV 
X X 



4. US-08-249-182-3 (1-5) 

R43393 La/SSb epitope 144. 



ID R43393 standard; peptide; 8 AA. 
AC R43393; 



DE La/SSb epitope 144. 

KM Linear; epitope; 60 kD; Ro/SSA; La/SSB; autoantigen; E/F; G; 70 kD; 

KM nuclear ribonucleoprotein; nRNP; Sn B/B'? polypeptide? antigen; D; 

KM systenic lupus erythenatosus? SLE ; autoantibody; U4/U6; U5; Br B'; 

KM RNA polynerase III; Ul? U2? Sjogrens syndrone; SS; hunan? vaccine; ss. 

OS Hono sapiens. 

PN M09321223-A. 

PD 2B-0CT-1993. 

PF 13-APR-1993? U03484. 

PR 13-APR-1992? US-867819. 

PA (QKLA ) UNIV OKLAHOMA STATE. 

PI Harley JB? 

DR WPI; 93-351658/44. 

PT New linear epitope (s) for hunan auto-antibodies - fron the 

PT Ro/SSAi La/SSB and Sn B/B' antigens and ribo?nudeoproteinr used 

PT for diagnosing and treating auto-innune disorders e.g. systenic 

PT lupus erythenatosus 

PS Clain 1; Page 30; 43pp; English. 

CC The sequences given in R43391-562 are linear epitopes which are 

CC derived fron the 60 kD Ro/SSA peptider the La/SSB autoantigen, 

CC the 70 kD nuclear ribonucleoprotein (nRNP) and the Sn B/B' 

CC polypeptide. These antigens are connon in systenic lupus 

CC erythenatosus (SLE) and closely related disorders. The Ro/SSA fanily 

CC of proteins has been shown to have several nolecular forns which are 

CC defined by the nolecular weight of the antigen identified. The najor 

CC forn has a nolecular weight of 60 kD and two additional forns have 

CC nolecular weights of 52 and 54 kD. La/SSB is also a nenber of this 

CC group of autoantibodies and binds snail RNAs with a polyuridine 

CC terninus. La/SSB is bound by a third of the anti-Ro/SSA precipitin 

CC positive sera. La/SSB has been shown to be a 46-50 kD nononeric 

CC phosphoprotein which associates with RNA polynerase III transcripts. 

CC Anti-Sn antibodies precipitate snRNPs containing the Ul r U2» U4/U6 and 

CC U5 RNA. Anti-Sn antibodies nay be directed against one or a 

CC conbination of the polypeptides: B (26 kD), B' (27 kD), D (13 kD), 

CC E/F (11 kD doublet) and G (less than 10 kD) . These epitopes nay be 

CC used for preventing! treating or screening autoinnune disorders* 

CC especially SLE or Sjogrens syndrone (SS). They bind to a hunan 

CC autoantibody and nay therefore be used as vaccines. 

SO Sequence 8 AA; 

SO 1 A; l R; 0 N? 0 D? 0 B; 0 c? 0 0? 0 E? 0 Z? 0 G; 1 H; 

SQ 0 I? l L; 2 K; 0 M? 1 F? 0 P; 0 S; 1 T; 0 H? 0 Y; 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:34-PDT using FindSeq 

Initial Score = 3 Optinized Score = 3 Significance = 3.07 
Residue Identity = 60X Matches = 3 Hisnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
HI 

RTLHKAFK 
X X 



5. US-08-249- 182-3 (1-5) 

R43440 Ro/SSA epitope 111. 

ID R43440 standard; peptide; 9 AA. 

AC R43440J 

DT 12-MAY-1994 (first entry) 

DE Ro/SSA epitope 111. 

KU Linear; epitope; 60 kD; Ro/SSA; La/SSB; autoantigen; E/F; G; 70 kD; 

KW nuclear ribonucleoprotein? nRNP; Sn B/B'; polypeptide; antigen; D; 

KM systenic lupus erythenatosus; SLE; autoantibody; U4/U6; U5; B; B'; 

KM RNA polynerase III; Ul? U2; Sjogrens syndrone? SS? hunan? vaccine? ss. 



PN W09321223-A. 

PD 28-QCT-1993. 

PF 13-APR-1993; U03484, 

PR 13-APR-1992; US-867819. 

PA (OKLA ) UNIV OKLAHOMA STATE. 

PI Harley JB; 

DR WPI; 93-351658/44. 

PT New linear epitope(s) for human auto-antibodies - from the 

PT Ro/SSAi La/SSB and Sn B/B' antigens and ribonucleoprotein, used 

PT for diagnosing and treating auto-imnune disorders e.g. systemic 

PT lupus erythematosus 

PS Claim I? Page 30; 43pp» English. 

CC The sequences given in R43391-562 are linear epitopes which are 

CC derived from the 60 ItD Ro/SSA peptide? the La/SSB autoantigen, 

CC the 70 kD nuclear ribonucleoprotein (nRNP) and the Sm B/B' 

CC polypeptide. These antigens are common in systemic lupus 

CC erythematosus (SLE) and closely related disorders. The Ro/SSA family 

CC of proteins has been shown to have several molecular forms which are 

CC defined by the molecular weight of the antigen identified. The major 

CC form has a molecular weight of 60 kD and two additional forms have 

CC molecular weights of 52 and 54 kD. La/SSB is also a member of this 

CC group of autoantibodies and binds snail RNAs with a polyuridine 

CC terminus. La/SSB is bound by a third of the anti-Ro/SSA precipitin 

CC positive sera. La/SSB has been shown to be a 46-50 kD nononeric 

CC phosphoprotein which associates with RNA polymerase III transcripts. 

CC Anti-Sm antibodies precipitate snRMPs containing the Ul, U2t U4/U6 and 

CC U5 RNA. Anti-Sn antibodies nay be directed against one or a 

CC conbination of the polypeptides: B (26 kD), B' (27 kD), D (13 kD), 

CC E/F (11 kD doublet) and G (less than 10 kD) . These epitopes nay be 

CC used for preventing, treating or screening autoimmune disorders, 

CC especially SLE or Sjogrens syndrone (SS). They bind to a human 

CC autoantibody and nay therefore be used as vaccines. 

SQ Sequence 9 AA; 

S3 2 A; 0 R; o N; o D; 0 B; l C; 0 3; i E; 0 Z; 0 G; 0 H; 

sq o it o l; l k; o h; l f; o p; i s; o t? o u; o y; 2 v? 

CC Retrieved by shears on Med 21 Sep 94 i 1 :59:35-PDT using FindSeq 

Initial Score = 3 Optimized Score - 3 Significance = 3.07 
Residue Identity = 100% Hatches = 3 Hismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
III 

AFKAVSEVC 
X X 



6. US-08-249-182-3 (1-5) 

R43437 Ro/SSA epitope 105. 

ID R43437 standard, peptide; 10 AA. 

AC R43437; 

DT 12-MAY-1994 (first entry) 

DE Ro/SSA epitope 105. 

KW Linear,' epitope; 60 kD; Ro/SSA; La/SSB; autoantigen; E/F; G; 70 kD; 

KW nuclear ribonucleoprotein; nRNP; Sn B/B'? polypeptide; antigen; D; 

KW systenic lupus erythematosus, SLE; autoantibody; U4/U6! U5; B; B'; 

KW RNA polymerase III; Ul; U2; Sjogrens syndrome; SS; human; vaccine; ss. 

OS Homo sapiens. 

PN W09321223-A. 

PD 28-QCT-1993. 

PF 13-APR-1993! U03484. 

PR 13-APR-1992; US-867819. 

PA (OKLA ) UNIV OKLAHOMA STATE. 



Initial Score = 3 Optimized Score = 3 Significance = 3.07 
Residue Identity = 60X Matches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 



X X 
YPAFK 
Ell 

DGLRAQDDYNGWDINTPAFE 
10 X 20 



14. US-08-249-182-3 (1-5) 

R41297 Peptide fragment F1Q-2. 

ID R41297 standard; Protein? 21 AA. 

AC R41297; 

DT 20-APR-1994 (first entry) 

DE Peptide fragment F10-2. 

KM Pyruvate dehydrogenase complex; PDC? 

KW oxoglutarate dehydrogenase complex; OGDC; 

KW monoclonal antibody; E2 enzyme; antigen. 

PN J05227984-A. 

PD 07-SEP-1993. 

PF 25-FEB-1992; 037749. 

PR 25-FEB-1992; JP-037749. 

PA (SUM) ) SUMITOMO CHEM CO LTD. 

PA (SUMU ) SUMITOMO SEIYAKU KK. 

DR MPI; 93-31 W17/40. 

PT Monoclonal antibody prepd. by culturing new hybridona - used for 

PT determn. of pyruvate dehydrogenase complex and oxo-glutarate 

PT dehydrogenase complex in disease research 

PS Disclosure; Page 9; llpp; English. 

CC Pyruvate dehydrogenase complex (PDC) and oxoglutarate dehydrogenase 

CC complex (OGDC) may be detected using a monoclonal antibody specifically 

CC recognising E2 enzyme contg, in PDC and PGDC. 

SQ Sequence 21 AA; 

SQ 2 A; l R; l N; 2 D; 0 B; o c; l s; l E; 0 Z; 1 G; 0 H; 

sq l i; o L? 0 K; 0 M? l F; l P; 1 s; 0 T; 0 H; 0 y; 2 v; 

SS 6 Others; 

CC Retrieved by shears on Wed 21 Sep 94 1U59:31-PDT using FindSeq 

Initial Score = 3 Optimized Score = 3 Significance = 3.07 
Residue Identity = 60X Matches = 3 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 



10 X 



15. US-08-249-182-3 (1-5) 

R34509 Peptide Kl recognised by MAb, for assaying LACI* 

ID R34509 standard; peptide; 21 AA. 
AC R34509; 

DT 10-AUG-1993 (first entry) 

DE Peptide Kl recognised by MAb> for assaying LACI. 

KW LACI; lipoprotein associated coagulation inhibitor; EPI; 

KM extrinsic pathway inhibitor; blood clotting; TFPI; 

KU tissue factor pathway inhibitor; blood clotting; blood coagulation; 

KW immunological assay; ELISA; testing; detection; diagnosis; 

KW coronary thrombosis; pulmonary thrombosis; thrombotic disease; 

KW microthrombi ; monoclonal antibody; MAb. 



PN EP-539975-A. 

PD 05-HAY-1993. 

PF 29-0CT-1992; 118497. 

PR 31-0CT-199H JP-311442. 

PR 29-N0V-1991J JP-339560. 

PA (TE1J ) TEIJIN LTD. 

PI Ichikaua Y. Koike Y. Suzuki K; 

DR WPI; 93-145355/18. 

PT Assay nethod for lipoprotein-associated coagulation inhibitor - 

PT using itonoclonal antibodies; used for diagnosis of thronbosis 

PS Clain 1; Page 12; 17pp; English. 

CC This sequence (Kl) is recognised by a nonce lonal antibody used in an 

CC immunological assay for free lipoprotein associated coagulation 

CC inhibitor (LACI). The MAb is used together with another HAb 

CC recognising peptide K3 (R34508). 

SO Sequence 21 AA> 

SO 3 A; l R; l N; 2 D; 0 B; l C; 0 Q; 0 E; 0 Z; 1 G; 0 H; 

S9 2 I; 0 L; 3 K; l n; 5 F; l p; o S; 0 T; 0 w; 0 Y; 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 li:59:03-PDT using FindSeq 

Initial Score = 3 Optimized Score - 3 Significance = 3.07 
Residue Identity = 100% Hatches = 3 Misnatches = 0 

Gaps - 0 Conservative Substitutions = 0 

X X 
YPAFK 
III 

AFK ADDGPCK A I MKRFFFN I F 
X X 10 20 

> O < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 

Results file us-08-249-182-4.res nade by on Wed 21 Sep 94 12:08U7-PDT. 



Query sequence being co«pared;US-08-249-182-4 (1-5) 
Nunber of sequences searched: 5543 
Nunber of scores above cutoff! 2112 

Results of the initial comparison of US-08-249-182-4 (1-5) uith! 
File s /hone/ shears/ lor ing/ lor in*. pep 

10000- 

N 

U 5000- 
M * 
B 
E 

R t 
0 

F 1000- 
S 

E 500- 

0 

U 

E - * 
N 

C 



S 100- 



50- 



10- 



0 

II I II III III I I I 

SCORE 0| 1 |1 2 | 2 3 | 3 4 4 5 
STDEV 12 3 4 



PARAMETERS 



Similarity matrix 


Unitary 


K-tuple 2 


Mismatch penalty 


1 


Joining penalty 20 


Gap penalty 


1.00 


Window size 5 


Gap size penalty 


0.05 




Cutoff score 


0 




Randomization group 


0 




Initial scores to save 40 


Alignnents to save 15 


Optimized scores to 


save 0 


Display context 50 




SEARCH STATISTICS 


Scores: 


Mean 


Median Standard Deviation 




0 


1 1.07 


Tines: 


CPU 


Total Elapsed 




00:00:08.90 


00:00:09.00 


Number of residues: 




482836 


Number of sequences 


searched: 


5543 


Number of scores above cutoff? 


2112 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100% identical sequence to the query sequence uas found: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



1. R37446 Autotaxin peptide ATX 24- 5 5 5 4.67 0 

4 1007. similar sequences to the query sequence were found: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



0 

C M 


ncol JU 


add/ gn ~c driLiDooy gratLeo ne 


1 21 


D 


D 


4.0/ 


A 

U 


0 ■ 


OOA7Q7 

ncU/ 7J 


^im-grat veo* nuctamseo neavy 


1 Al 

Ho 


c 
J 


0 


A 17 

4.6/ 


A 
U 


4 • 


DA 1 ALQ 

N41407 


nflD dDDc nunamseo neavy cnai 


1 A A 


c 


e 


a 17 
4.67 


0 


u « 


R40179 


Unisani caH 2n4 i Vinriii rMY*%-»l ka* 
nUnarilscru arlLlPUuy lilAt} o nco 


i jj 


c 
3 


e 
3 


A 17 
*r .0/ 


A 
U 



The list of other best scores is; 



Sequence Nane Description 



Init. Opt. 
Length Score Score Sig. Frane 







•«** 3 standard deviations ; 


ibove nean **»* 








6. 


R41113 


HCV peptide XIV or HCV8 (aa 1 


22 4 


4 


3.74 


0 


7. 


R41186 


HCV NS4 protein HCV7/8. 


26 4 


4 


3.74 


0 


8. 


R40053 


Hib OMP PI peptide HIBP1-1 (1 


30 4 


4 


3.74 


0 


9. 


R13186 


Peptide VII innunoreactive ui 


38 4 


4 


3.74 


0 


10. 


R06408 


HTLV-1 corresponding peptide 


38 4 


4 


3.74 


0 


11. 


R13185 


Peptide (V) innunoreactive ui 


40 4 


4 


3.74 


0 


12. 


R13591 


HTLV-I env precursor epitope 


40 4 


4 


3.74 


0 


13. 


R33871 


Polypeptide pl684 conprising 


67 4 


4 


3.74 


0 


14. 


R33872 


Polypeptide pl689 conprising 


117 4 


4 


3.74 


0 


15. 


R11717 


EMV93/HTLV-1-1I fusion protei 


217 4 


4 


3.74 


0 


16. 


R39336 


scFv fragment encoded by pLIS 


277 4 


4 


3.74 


0 


17. 


Rl 1713 


ENV93/HTLV-1-II+I fusion prot 


344 4 


4 


3.74 


0 


IS. 


R23999 


Open reading frane of the hep 


363 4 


4 


3.74 


0 


19. 


P70416 


Polypeptide uith IgE binding 


557 4 


4 


3.74 


0 


20. 


R26207 


Hunan serun albunin. 


585 4 


4 


3.74 


0 


21. 


P70417 


Polypeptide uith IgE binding 


775 4 


4 


3.74 


0 


22. 


FRZE.HYXXA 


GLIDING MOTILITY REGULATORY P 


777 4 


4 


3.74 


0 


23. 


R04574 


Derived anino acid sequence o 


810 4 


4 


3.74 


0 


24. 


P98500 


Partial sequence encoded by a 


1091 4 


4 


3.74 


0 


25. 


R28582 


HCV anino acid sequence contg 


2436 4 


4 


3.74 


0 


26. 


R24440 


Conposite HCV HC-J1/CDC/CHI p 


2894 4 


4 


3.74 


0 


27. 


R31621 


Hepatitis C virus (HCV) polyp 


3011 4 


4 


3.74 


0 


28. 


P90373 


Sequence encoded by hunan nus 


3685 4 


4 


3.74 


0 






♦»*» 2 standard deviations above mean **** 








29. 


R41316 


PEP (84-91). 


8 3 


3 


2.80 


0 


30. 


R36072 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


31. 


R36071 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


32. 


R36070 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


33. 


R36069 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


34. 


R36068 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


35. 


R36067 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


36. 


R36024 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


37. 


R36023 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


38. 


R36022 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


39. 


R36021 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 


40. 


R36020 


Hepatitis C virus (HCV) epito 


8 3 


3 


2.80 


0 



1. US-08-249-182-4 (1-5) 

R37446 Autotaxin peptide ATX 24. 

ID R37446 standard; peptide! 5 AA. 

AC R37446; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 24. 

KU Cell notility stimulating; cancer Metastasis; antibody; detection; 

KM innunostains; disease outcone prediction; therapy choice; 

KU cancer therapy; crossl inked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH tc HUMAN SERVICE. 



DR WPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 24. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in inmunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which nay also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

50 Sequence S AA; 

so l a; o r; o n; o D; o e; o c; i G; i E; o z; o G; 0 h; 

so o I; o L; o k; o it; o f; o p; l s; o t; o w; o y; i v? 

CC Retrieved by shears on Wed 21 Sep 94 11 :58:47-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 4.67 
Residue Identity - 10 OX Matches = 5 Hismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 

Mill 
SAEVS 
X X 



2. US-08-249-182-4 (1-5) 

R26150 A5B7 gH-2 antibody grafted heavy chain. 

ID R26150 standard; Protein; 146 AA. 

AC R26150; 

DT 03-FEB-1993 (first entry) 

DE A5B7 gH-2 antibody grafted heavy chain. 

KM humanised antibody; chimaeric; carcino-enbryonic antigen; therapy; 

KW diagnosis; carcinomas; CDR; IgG; human? murine; ss. 

OS Chimaeric. 

FH Key Location/Qualifiers 

FT Region 26.. 35 

FT /note= "grafted nurine CDRT 

FT Region 50.. 65 

FT /note= "grafted murine CDR2 D 

FT Region 95-102 

FT /note= "grafted nurine CDR3" 

PN W09201059-A. 

PD 23-JAN-1992. 

PF 05-JUL-1991; G01108. 

PR 05-JUL-1990; GB-0 14932. 

PR 21-DEC-1990; W0-G02017. 

PR 05-JUL-1991; W0-G01108. 

PA (CELL-) CELLTECH LTD. 

PI Adair JR. Bodmer MWi Mountain Ar Ouens RJ; 

DR WPI; 92-284316/34. 

DR N-PSDB; 927354. 

PT Hunanised antibody molecules - comprising murine and human regions^ 

PT specific for carcino-enbryonic antigen* useful for diagnosis and 

PT therapy 

PS Example 4; Figure 10; 71pp; English. 

CC This sequence is CDR-grafted A5B7 human antibody having 

CC murine CDRs at amino acids 26-35 (CDRl)i 50-65 (CDR2), and 95-102 

CC (CDR3) and additional murine frameuork residues at lr 48> 49 r 72> 

CC 73» 76, and 93. The LAY frameuork was chosen when making the coding 

CC construct (027354) as it shows the highest homology to A5B7. The 

CC antibody has. specificity for carcinoemhryonic antigen, produced by 



v*w i/uiiwuf si anu unc nu o onus u:>ciui in Ltuun wnrr apy emu UidyffU3l» UT 

CC certain carcinonas. 

SG Sequence 146 AA; 

SQ 7 A; 7 R; 4 N; 4 D; 0 B; 2 C? 6 9x 6 E; 0 Z? 18 GJ 1 H; 

SQ 3 I; 14 L; 6 K; 3 M; 9 F? 3 P? 16 S? 13 T; 5 U; 9 Y? 10 V; 

CC Retrieved by shears on Wed 21 Sep 94 11 :58:22-PDT using FindSeq 

Initial Score = 5 Qptinized Score = 5 Significance = 4.67 
Residue Identity = 100X Hatches = 5 Hisnatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 

Mill 

QAPGKGLEWLGFIGNKANGYTTEYSASVKGRFTISRDKSKSTLYLQMNGLQAEVSAIYYCTRDRGLRFYFDY 
60 70 80 90 100 110 X 120 

WGQGTLVTVSSASTKGP 
130 140 



3. US-08-249-182-4 (1-5) 

R20793 CDR-grafted, hunanised heavy chain gHl. 

ID R20793 standard; Protein; 146 AA. 

AC R20793; 

DT 19-MAY-1992 (First entry) 

DE CDR-graftedi hunanised heavy chain gHl. 

KM nurine nonoclonal antibody; HAb; A5B7? hunanised antibody; CEA; 

KU complementarity determining region. 

OS Hono sapiens. 

OS ftus nusculus. 

FH Key Location/Qualifiers 

FT Peptide 1..19 

FT /label= signal 

FT Protein 20.. 146 

FT /label= VH 

FT /note= D hunan LAY franeuork uith A5B7 CDRs° 

FT Region 45.. 54 

FT /label* CDR1 

FT /note= "nurine residues 0 

FT Region 69.. 87 

FT /label= CDR2 

FT /note= °nurine residues 8 

FT Region 120.. 129 

FT /label* CDR3 

FT /note= "nurine residues 0 

FT Misc_difference 20 

FT /note= "murine residue" 

FT Miscjiifference 67.. 68 

FT /note= °nurine residues" 

FT Misc.difference 94.. 95 

FT /note= "murine residues 0 

FT Hiscjiifference 98 

FT /note= "murine residue 0 

FT Misc_difference 118 

FT /note- "murine residue" 

PN WQ9201059-A. 

PD 23-JAN-1992. 

PF 05-JUL-1991; G01108. 

PR 05-JUL-1990; GB-014932. 

PR 21-DEC-1990; H0-G02017. 

PR 05-JUL-1991; M0-G01108. 

PA (CELL-) CELLTECH LTD. 

PI Adair JR> Bodner f1W> Mountain A, Ouens RJ; 

DR WPI; 92-056874/07. 



PT New CDR-grafted anti carcinoembryonic antigen antibodies - useful 

PT in therapy and diagnosis of carcinoma 

PS Claim 14; Fig 10? 70pp; English. 

CC This heavy chain sequence comprises a human framework (i.e. the LAY 

CC region) which contains murine sequences (from the murine anti-CEA 

CC A5B7 HAb) in the CDRs and at other positions predicted to be 

CC important for antigen-binding and at which hunan and A5B7 sequences 

CC differ. (See Q20984 for A5B7 heavy chain coding sequence). 

S9 Sequence 146 AA? 

SQ 7 A; 7 R; 4 N? 4 D; 0 B? 2 C; 6 Q? 6 E; 0 Z? 18 G? 1 H? 

SQ 3 I? 14 L? 6 K? 3 N? 9 F? 3 P? 16 S? 13 Tf 5 H; 9 Y? 10 V; 

CC Retrieved by shears on Wed 21 Sep 94 11?57:50-PDT using FindSeq 

Initial Score = 5 Optini zed. Score = 5 Significance = 4.67 
Residue Identity = 1007. Hatches = 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
8AEVS 

inn 

SAPGKGLEHLGFIGNKANGYTTEYSASVKGRFTISRDKSKSTLYL9MNGLQAEVSAIYYCTRDRGLRFYFDY 
60 70 80 90 100 110 X 120 

WG9GTLVTVSSASTKGP 
130 140 



4. US-08-249-182-4 (1-5) 

R41469 MAb 25D2 humanised heavy chain. 

ID R41469 standard? Protein; 140 AA. 

AC R41469; 

DT 03-HAR-1994 (first entry) 

DE MAb 25D2 hunanised heavy chain. 

KW Heavy; VH; light; VL; chain; variable region; antihuman? interleukin-4; 

KW IL-4? monoclonal antibody? HAb? 25D2? single chain binding protein? 

KW complementarity determining region? CDR? hunanised; Fv region? BABS? 

KW antagonist; polymerase chain reaction? PCR? primer? amplify? gamma4? 

KW pSV. SPORT. 

OS Rattus r alius. 

OS Homo sapiens. 

FH Key Location/flualif iers 

FT Peptide 1..19 

FT /note= "Leader sequence 1 * 

FT Protein 20.. 140 

FT /note= °25D2 H chain 0 

PN W09317106-A. 

PD 02-SEP-1993. 

PF 18-FEB-1993? U01301. 

PR 19-FEB-1992? US-841659. 

PA (SCHE ) SCHER1NG CORP. 

PI Abrams JS, Dalie B, Le HV, Miller K, flurgolo NJ; 

PI Nguyen Hf Pearce Mr Tindall Si Zavodny PJ? 

DR WPI? 93-288412/36. 

DR N-PSDB? Q48350. 

PT Monoclonal antibodies against human interleukin-4 corresp. DNA 

PT and CDRs - are useful for detection of interleukin-4 and treatment 

PT of related diseases 

PS Example 9? Page 89-90? 114pp? English. 

CC This sequence represents the hunanised heavy (H) chain of the antihuman 

CC interleukin-4 (IL-4) monoclonal antibody (HAb) 25D2. The 25D2 H 

CC chain coding region was cloned in three fragments using the primers 

CC given in Q48351-60. The amplified fragments were designed to 

CC contain silent restriction sites? however several codons had to 

CC be changed to incorporate further restriction sites. The primers 



w 3*t^m *m btwww* uu wci c «cu <*w nnp u i y wic cnuirr n Lriairi var Joule 

CC region (VH) of an unrelated humanised antibody. The amplified fragments 

CC were then cloned into pSV. Sport which already contained the 25D2 H 

CC chain fragments. The primers given in 048367-72 uere used in 

CC further manipulations to amplify a human gamma4 constant region cDNA 

CC which was used to replace the genomic DNA. The humanised HAb is an 

CC IL-4 antagonist. It nay be used in a pharmaceutical composition for 

CC detecting! measuring and inmunopurifying human IL-4 and blocking IL-4 

CC activity in IL-4-related diseases. 

SQ Sequence 140 AA? 

59 9 A; 7 R; 4 N? 5 D; 0 B; 3 C; 6 8? 4 E? 0 Z? 16 G? 2 H? 

SQ 6 I? 12 L? 2 K? 3 M; 7 F; 4 P; 18 5? 8 T? 5 u? 9 y; 10 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll;59:27-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 4.67 
Residue Identity = 100X Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
8AEVS 
IIIM 

VRaAPGKGLEWVASISISGDNTYYPDSVRGRFTISRNDSKNTLYL8MNGL8AEVSAIYYCARDPYYFSGHYF 
60 70 80 90 100 X 110 120 

DFWG8GTLVTVSS 
130 140 



5. US-08-249-182-4 (1-5) 

R40179 Humanised antibody CMX5-3 heavy chain variable reg 

ID R40179 standard? Protein? 135 AA. 

AC R40179? 

DT 14-FEB-1994 (first entry) 

DE Humanised antibody CMX5-3 heavy chain variable region. 

KW Primer? polymerase chain reaction? amplify; PCR? human? kappa? L? 

KW constant region; heavy? H? chain? pUC19? humanised? antibody? 

KW light; REI; VL3 fragment; CMX5-1? CMX5-3. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Peptide 1..19 

FT /note= "Secretory leader peptide 0 

FT Protein 20.. 135 

FT /note= "CMX5-3 heavy chain variable region 0 

PN W09316184-A. 

PD 19-AUG-1993. 

PF 04-FEB-1993? U00759. 

PR 06-FEB-1992? US-832842. 

PA (SCHE ) SCHERING CORP. 

PI Abrams JSt Chou Ci Jenh C* Murgolo NJf Petro ME; 

PI Silver JE> Tindall S. Windsor WT, Zavodny PJ; 

DR WPI? 93-272888/34. 

PT Humanised monoclonal antibody - comprises variable animal region 

PT and constant human region* binds to human interleukin-5 

PS Example? Page 91-92? 118pp? English. 

CC The sequences given in R40179-80 represent the variable regions of 

CC the heavy and light chains of the humanised antibody CMX5-3 

CC respectively. These sequences uere based on the humanised antibody 

CC CMX5-1. These sequences were generated using the primer sequences 

CC given in 848068-71. These primers uere based on sequences derived 

CC from antibody JES1-39D10 and human LAY VH framework sequences. The 

CC amplification products uere used to replace the VH1 and VH3 fragments 

CC of CMX5-1 H chain cDNA in pSV. Sport (see also R40175). 

S3 Sequence 135 AA? 

SQ 9 A; 5 R? 7 N; 4 D? 0 B? 3 C? 6 8? 5 E? 0 Z? 15 G? 1 H? 

SQ 7 I? 15 L; 3 K? 2 M? 4 F? 2 P? 17 S? 8 T? 5 W? 7 Y? 10 V? 



CC and/or HTLV-I or II. 

SQ Sequence 22 AA; 

SQ 5 A? 1 Ri 0 N; 0 D; 0 B? 0 C; 3 Q; 1 Er 0 Z; 1 G; 0 H; 

sq l i; 3 l; i k; o h; o f; i p? i s; i t; o w; o y; i v; 

SQ 2 Others; 

CC Retrieved by shears on Wed 21 Sep 94 11 :59:30-PDT using FindSeq 

Initial Score = 4 Optinized Score = 4 Significance = 3.74 
Residue Identity = SOX Hatches - 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 

llll 

XQKALGLLQTASRQAEVIAPAX 
10 X 20 



7. US-08-249-182-4 (1-5) 

R41186 HCV NS4 protein HCV7/8. 

ID R41186 standard; peptide; 26 AA. 

AC R41186; 

DT 22-NAR-1994 (first entry) 

DE HCV NS4 protein HCV7/8. 

KW Hunan immunodeficiency virus; HIV; hepatitis C virus; HCV; 

KW non-A non-B hepatitis; NANBH; hunan T-cell lynphotropic virus; HTLV; 

KW epitope; antibody; biotini diagnosis; detection; vaccine. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Region 10.. 16 

FT /label= epitope^ 

PN WQ931B054-A. 

PD 16-SEP-1993. 

PF 08-NAR-1993; E00517. 

PR 06-MAR-1992; EP-40059B. 

PA (INNQ-) INNOGENETICS NV SA. 

PI De LEYS R; 

DR WPI; 93-303397/38. 

PT Neu biotinylated peptide(s) corresp. to innuno-doninant 

PT epitope(s) - with increased antigenicityr useful in antibodies 

PT detection and vaccines against hepatitis C, HIV and HTLV 

PS Disclosure; Page 79; 133pp; English. 

CC Peptide conpsns. comprise at least one and pref. a combination of 

CC tuoi three* four or nore biotinylated peptides chosen fron the 

CC sequences given in R41058-R41166. 

CC The peptides nay be hybrids consisting of combinations of the core 

CC epitopes of the HCV core (R41171-R41180) , HCV NS4 (R41181-R4H86) or 

CC the HCV NS5 (R41187-R4U93) region separated by Gly and/or Ser residues. 

CC Pref. hybrid peptides are given in R41161-R41163. 

CC The peptides represent innunologically inportant regions of viral 

CC proteins and are prepd. by solid phase peptide synthesis. The conpsns. 

CC are useful for the detection of antibodies to HCVr and/or HIV» 

CC and/or HTLV-I or II. 

SQ Sequence 26 AA; 

SQ 6 A; l R; o N; 0 D; 0 B; 0 C; 4 Q; 2 E; 0 Z; l G; 0 H; 

SQ l I; 4 l; 2 k; 0 h; i F; l P; l s; l T; 0 M; 0 Y; l v; 

CC Retrieved by shears on Wed 21 Sep 94 1H59;29-PDT using FindSeq 

Initial Score = 4 Optinized Score = 4 Significance = 3.74 
Residue Identity = BOX Hatches = 4 Misnatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
QAEVS 



rvtfur irveu uy snedr* on weo ct ot-u 7*t n»j7tCJ-riM using nnoaeq 



Initial Score = 5 Optimized Score = 5 Significance - 4.67 
Residue Identity = 100X Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 
1)111 

U I Rfl APGKGLEWV AL I USNGDTD YNSA I KSRFT I SRNDSKNTLYLflHNGLSAEVSAI YFC ARE Y YGYFDYWG 
60 70 80 90 100 X 110 120 

QGTLVTVSS 
130 



6. US-08-249-182-4 (1-5) 

R41113 HCV peptide XIV or HCV8 (aa 1730-1749). 

ID R41113 standard; peptide; 22 AA. 

AC R41113; 

DT 22-MAR-1994 (first entry) 

DE HCV peptide XIV or HCV8 (aa 1730-1749). 

KW Human immunodeficiency virus? HIV; hepatitis C virus? HCV? 

KU non-A non-B hepatitis; NANBH? human T-cell lymphotropic virus; HTLV? 

KW epitope; antibody; biotin? diagnosis? detection; vaccine. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Modif ied_site 1 

FT /note* d the N-terninal comprises (A)-(B)-(X)-Y? where 

FT B= biotin; 

FT X= biotinylation cpd. incorporated 

FT during synthesis; 

FT Y= bond or linking gp(s). which 

FT minimises steric hindrance* 

FT where Y is not a bond it is pref. 1-10 

FT residues of (sane or different) glycine* 

FT beta-alanine* 4-aminobutyric acid* 

FT 5-aminovaleric acid or 6-aminohexanoic acid; 

FT parenthesis around B and X indicate opt. presence 

FT at the specified positions but B or X must be 

FT present in at least one of the positions shown* 

FT B interacts with the peptide to give a cpd. 

FT with greater diagnostic sensitivity; 

FT A (optional)* one or more amino acidsi NH2 or 

FT gp. which modifies the N-terminus; 

FT Z- one or more amino acids* OH* NH2* or a 

FT linkage involving either of these 2 gps.° 

FT Modified.site 22 

FT /note* °the C-terminal conprises Y-(X)-Z° 

PN W09318054-A. 

PD 16-SEP-1993. 

PF 08-MAR-1993? E00517. 

PR 06-MAR-1992; EP-400598. 

PA (INNQ-) INN0GENETICS NV SA. 

PI De LEYS R; 

DR WPI; 93-303397/38. 

PT New biotinylated peptide(s) corresp. to immuno-dominant 

PT epitope (s) - with increased antigenicity* useful in antibodies 

PT detection and vaccines against hepatitis C* HIV and HTLV 

PS Claim 4; Page 90-98; 133pp; English. 

CC Peptide compsns. comprise at least one and pref. a combination of 

CC two* three* four or more biotinylated peptides chosen from the 

CC sequences given in R41058-R41166. The peptides represent 

CC immunologically important regions of viral proteins and are 

CC prepd. by solid phase peptide synthesis. The compsns. are 



MM 

LAESFKQKALGLLQTA5RQAEVIAPA 
10 20 X 



8. US-08-249-182-4 (1-5) 

R40053 Hib OHP PI peptide HIBP1-1 (1-29). 

ID R40053 standard; peptide; 30 AA. 

AC R40053; 

DT 04-FEB-1994 (first entry) 

DE Hib OHP PI peptide HIBP1-1 (1-29). 

KW Haemophilus influenzae; type b; Hib; outer membrane protein; PI; P2? 

KW P6; vaccine; antibody; detection; lipoglycopeptide conjugate; 

KW immunogen. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Misc_difference 30 

FT /note= "Hay be absent" 

PN W09315205-A. 

PD 05-AUG-1993. 

PF 03-FEB-1993; CA0041. 

PR 03-FEB-1992; GB-002219. 

PA (CONN-) CONNAUGHT LAB LTD. 

PI Chong P, Kandil A, Klein HH* Sia C; 

DR WPI; 93-258681/32. 

PT Synthetic Haemophilus influenzae conjugate vaccine - comprising 

PT T-helper cell determinants and B-cell epitope(s) linked to 

PT synthetic oligoisaccharide(s) 

PS Table 1; Page 47; 99pp; English. 

CC The sequences given in R40053-101 are peptide fragments derived from 

CC the Haemophilus influenzae type b (Hib) outer membrane proteins PL 

CC P2 and P6. These peptides may be used in a vaccine against Hib 

CC infection and antibodies against these peptides may be used in test 

CC kits to detect H. influenzae in a sample. The vaccine may further 

CC comprise a immunogenic or immunostimulatory molecule or the peptides 

CC may be modified with lipids* or linked to synthetic PRP as synthetic 

CC lipoglycopeptide conjugates to produce alternative vaccines. 

SG Sequence 30 AA; 

SQ 9 A; l R; l N; l D; o B; i C; l Q; 2 E; o Z; 3 G; 0 H; 

S9 l I; 2 L; o K; o K; l F; o p; 3 S? i T; o «; l Y; 2 v? 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:22-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.74 
Residue Identity = 80X Hatches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions » 0 

X X 
SAEVS 
Mil 

AAFQLAEVST5GLGRAYAGEAAIADNASVC 
X 10 20 30 



9. US-08-249-182-4 (1-5) 

R13186 Peptide VII immunoreactive with anti-HTLV antibodi 

ID R13186 standard; Protein; 38 AA. 

AC R13186; 

DT 09-0CT-1991 (first entry) 

DE Peptide VII immunoreactive with anti-HTLV antibodies. 

KW human T-cell leukaemia virus; AIDS; ATL; detection; 

KW envelope protein gp&U acquired immunodef iency syndrome. 

OS Synthetic. 

PN EP-439077-A. 

PD 31-JUL-1991. 



mi 

PA I I PDREVLYREFDEMEECS9HLPY I ESGMHLAEQFK9KALGLL8T ASRGAEV I APAV 
10 20 30 40 SO 60 X 



14. US-08-249- 182-4 (1-5) 

R33872 Polypeptide pi 689 comprising HCV viral antigen. 

ID R33872 standard; peptide; 117 AA. 

AC R338725 

DT 19-JUL-1993 (first entry) 

DE Polypeptide pi 689 comprising HCV viral antigen. 

KU Hepatitis C virus; NANBH; assay; antibody; p380-JHi; p380-J; p380LG; 

KM p408. 

OS Synthetic. 

PN W09306247-A. 

PD 01-APR-1993. 

PF 16-SEP-1992; U07813. 

PR 16-SEP-1991; US-760292. 

PA (ABB0 ) ABBOTT LAB. 

PI Lesnieuski RR> Leung TK; 

DR HPI; 93-117563/14. 

PT Assay For detecting presence of antibody to hepatitis C viral 

PT antigen - by contacting sanple with polypeptide contg. at least 

PT one epitope of virus antigen 

PS Disclosure; Page 13; 63pp; English. 

CC The synthetic peptide pl689 represents anino acid residues 1689-1805 of 

CC the hepatitis C viral antigen. The peptide nay be used in an assay to 

CC detect antibodies to HCV and thus to diagnose chronic HCV infection. 

CC See also R33861-87. 

SQ Sequence 117 AA; 

SQ 14 A; 3 r; 3 n; 2 D; 0 B? l c; 10 a; 9 E; 0 z; 6 G; 2 H; 

S9 7 I; 12 L; 5 K; 5 K; 5 f; 7 P; 8 s; 8 T; 3 u; 3 Y; 4 v; 

CC Retrieved by shears on Wed 21 Sep 94 11?58:46-PDT using FindSeq 

Initial Score = 4 Qptinized Score = 4 Significance = 3.74 
Residue Identity = 80% Hatches = 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 

mi 

PA I I PDREVLYREFDENEECSGHLPY I EQGMHL AE8FK8KALGLL6TASR8 AEV I APAV8TNW8KLETFWAK 
10 20 30 40 50 X 60 70 

HMWNF I SG I QYL AGLSTLPGNPA I ASLHAFTAA 
80 90 100 



15. US-08-249-182-4 (1t5) 

R11717 ENV93/HTLV-1-II fusion protein. 

ID Rl 1717 standard; Protein; 217 AA. 

AC RU717J 

DT 27-JUN-1991 (first entry) 

DE ENV93/HTLV-1-II fusion protein. 

KU Hunan T-cell leukaenia virus! HTLV-l; fusion protein; antibodies. 

OS Hunan T-cell leukaenia virus. 

FH Key Location/Qualifiers 

FT Region 1..104 

FT /label= ENV93 epitope 

FT Region 105.. 217 

FT /label= HTLV-1-II epitope 

PN EP-424748-A. 

PD 02-MAY-1991. 



PA (TRIT-) TRITON BIOSCIENCES. 

PI Akita RWi Florine DL, Ralston JS; 

DR HPI; 91-221557/30. 

PT Synthetic peptide(s) and antibodies corresp. to an epitope of 

PT HTLV-I - used in diagnosis! therapy prepn. of vaccines and 

PT prognostic indicators of HTLV-I infection 

PS Disclosure; Page 3; lOpp; English, 

CC The peptide has specific binding affinity for O.Salpha monoclonal 

CC antibody. It represents an epitopic site on the major HTLV-I 

CC envelope precursor > i.e it corresponds to residues 281-320. The 

CC peptide and its antibody can be used in diagnosing the presence of 

CC HTLV-I associated diseases* as vaccines against HTLV-I infection or 

CC as prognostic indicators after HTLV-I infection. 

CC See also R13077, R13590 and R13591. 

SO Sequence 40 AA; 

SO 3 A; 3 R; 0 N; 0 D; 0 B; 2 C; l 9? 0 E; 0 Z; 1 G; l H; 

SQ 3 I ; 5 L; 0 k; o M; l F; 6 P; 7 S; 1 T? 1 W; o Y? 5 V? 

CC Retrieved by shears on Hed 21 Sep 94 1U57:30-PDT using FindSeq 



Initial Score = 4 Optimized Score = 4 Significance = 3.74 
Residue Identity = 80X Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
QAEVS 
II II 

ISAIVSSPCHCSLILPPFSLSPVPTLGSRSRRAVPVAVHL 
X X 10 20 30 40 



13. US-08-249-182-4 (1-5) 

R33871 Polypeptide pl684 comprising HCV viral antigen. 

ID R33871 standard; peptide; 67 AA. 

AC R3387I; 

DT 19-JUL-1993 (first entry) 

DE Polypeptide pl684 comprising HCV viral antigen. 

KW Hepatitis C virus; NANBH; assay; antibody; p380-JHl; p380-J; p380LG; 

KU p408. 

OS Synthetic. 

PN W09306247-A. 

PD 01-APR-1993. 

PF 16-SEP-1992; U07813, 

PR 16-SEP-1991; US-760292. 

PA (ABBO ) ABBOTT LAB. 

PI Lesnieuski RR, Leung TK; 

DR WPI; 93-117563/14. 

PT Assay for detecting presence of antibody to hepatitis C viral 

PT antigen - by contacting sample uith polypeptide contg. at least 

PT one epitope of virus antigen 

PS Disclosure; Page 13; 63pp; English. 

CC The synthetic peptide pl684 represents amino acid residues 1684-1750 of 

CC the hepatitis C viral antigen. The peptide nay be used in an assay to 

CC detect antibodies to HCV and thus to diagnose chronic HCV infection. 

CC See also R33861-87. 

SQ Sequence 67 AA; 

SQ 7 A; 4 R; 0 N; 2 D; 0 B; l c; 6 Q; 8 E; 0 Z; 4 G; l H; 

SQ 4 I; 7 L? 3 K; 3 M; 2 F; 4 P; 3 s; 1 T? 0 H? 2 Y? 5 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:46-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.74 
Residue Identity = 80X Watches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 



1 
0 



X X 
QAEVS 
II II 

CFDPQIQAIVSSPCHNSLILPPFSLSPVPTLGSRSRRA 
X 10 20 30 



11. U5-08-249- 182-4 (1-5) 

R13185 Peptide (V) innunoreactive with anti-HTLV antibodi 

ID R13185 standard! Protein; 40 AA. 

AC R13185; 

DT 09-0CT-1991 (first entry) 

DE Peptide (V) innunoreactive uith anti-HTLV antibodies. 

KW hunan T-cell leukaenia virus; AIDS; ATL? detection; 

KW envelope protein gp6l; acquired icmunodef iency syndrone. 

OS Synthetic. 

PN EP-439077-A. 

PD 31-JUL-1991. 

PF 18-JAN-1991; 100616. 

PR 24-JAN-1990; US-469291. 

PA (UNBI-) UNITED BIOMEDICAL. 

PI Wang CY; 

DR HPI; 91-224505/31. 

PT Peptide conpsns. corresp. to envelope fragments of HTLV-1 ,2 - for 

PT detecting antibodies to these viruses and diagnosing HIV and 

PT adult T-cell leukaenia infections 

PS Clain 1; Page 17; 27pp; English. 

CC This peptide is one of 16 peptides useful for detecting antibodies to 

CC HTLV or HIV viruses. The peptides correspond to partial sequences of 

CC the HTLV virus designated gp21 and gp64? both part of gp61r which 

CC defines the envelope protein of the HTLV-I or HTLV-II virus. The 

CC peptides can be anidated at the C-terninal. This particular peptide 

CC is used in a composition with at least two of the other peptides of 

CC the invention. See also R13184, R13186-R13192 and R13861-6. 

SQ Sequence 40 AA; 

SQ 3 A; 0 R? l Ni l D; 0 B; 2 C; 2 o; o E; 0 z; 0 G; 3 h; 

SQ 2 I; 6 L; 0 K; 0 N; 2 F; 7 P; 5 S; 3 T; l w; l Y; l V; 

CC Retrieved by shears on Wed 21 Sep 94 li:57:31-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.74 
Residue Identity = 807. Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 
II II 

SSTPLLYPSLALPAPHLTLPFNWTHCFDPQIQAIVSSPCH 
10 20 30 X X 40 



12. US-08-249-182-4 (1-5) 

R13591 HTLV-I env precursor epitope peptide. 

ID R13591 standard; Protein; 40 AA. 

AC R13591; 

DT 03-0CT-1991 (first entry) 

DE HTLV-I env precursor epitope peptide. 

KW HTLV-I; epitope; diagnosis; env protein; gp46; antibody; vaccine. 

OS Synthetic. 

PN US5003043-A. 

PD 26-MAR-1991. 

PF 25-MAY-1988; 198416. 



PR 24-JAN-1990; US-4A9291. 

PA (UNBI-) UNITED BIOMEDICAL. 

PI Wang CY; 

DR WPI; 91-224505/31. 

PT Peptide compsns. corresp. to envelope fragments of HTLV-1 » 2 - for 

PT detecting antibodies to these viruses and diagnosing HIV and 

PT adult T-cell leukaemia infections 

PS Claim 1; Page 17; 27pp; English. 

CC This peptide is one of 16 peptides useful for detecting antibodies to 

CC HTLV or HIV viruses. The peptides correspond to partial sequences of 

CC the HTLV virus designated gp21 and gp64r both part of gp61» which 

CC defines the envelope protein of the HTLV- I or HTLV-II virus. The 

CC peptides can be aft i dated at the C-terminal. This particular peptide 

CC is used in a composition with at least tuo of the other peptides of 

CC the invention. See R13184-R13193 and R13861-6. 

SQ Sequence 38 AA; 

SQ 2 A; 3 R; i N; i D; 0 B; 2 c; 2 8; o E; o z; l G; l h; 

SQ 3 u 4 L; o Ki o M; 2 F; 6 P; 7 S? l T; o w; o Y; 2 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:31-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.74 
Residue Identity = 802 Hatches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 
II II 

CFDP8I8AIVSSPCHNSLILPPFSLSPVPTLGSRSRRA 
X 10 20 30 



10. US-08-249-182-4 (1-5) 

R06408 HTLV-1 corresponding peptide (VI). 

ID R06408 standard; protein; 38 AA. 

AC R0&408; 

DT 21-DEC-1990 (first entry) 

DE HTLV-1 corresponding peptide (VI). 

KU HTLV-1 ; HIV; antibodies; vaccines; polymers; 

OS Synthetic. 

PN W09008162-A. 

PD 26-JUL-1990. 

PF 16-JAN-1990! U00260. 

PR 13-JAN-1989; US-297&35. 

PA (UNBI-) UNITED BIOMED INC. 

PI Yang CY; 

DR WPI; 90-254015/33. 

PT Synthetic peptide(s) corresponding to HTLV-1 and op. HIV - used 

PT for detection of antibodiesi in vaccines and for development of 

PT antibodies 

PS Claim 1 (VI); Page 38; 52pp; English. 

CC Peptides having specific immunoreactivity to antibodies to HTLV-1 

CC comprise this sequence on its own* or an analogue of it in uhich 

CC amino acids may be added, deleted or substdr or segmentsi mixts.r 

CC conjugates or polymers of the peptides representes in R06403-08. 

CC The peptides are safer sensitive and specific in the detection of 

CC antibodies. This peptide corresponds to a partial segment of the 

CC amino acid sequence of the HTLV-1 virus gp.21 or gp.46 and are 

CC prepared by solid phase synthesis. 

SQ Sequence 38 AA; 

SQ 2 A; 3 R; l N; l D; 0 B; 2 C; 2 8; 0 E; 0 Z; i G; 1 H; 

sq 3 I; 4 l; o k; o n? 2 F; 6 P; 7 s; l t; o w; o Y; 2 v; 

CC Retrieved by shears on Wed 21 Sep 94 11:57:07-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.74 



OS Synthetic. 

PN W09003984-A. 

PD 19-APR-199Q. 

PF 29-SEP-1989; U043G2. 

PR 19-SEP-1989? US-407663, US-252949? W0-U04302. 

PA (REPL-) Repligen Corp- 

PI Rusche JRr Putney SDi Javaherian Ki Farley J» Grimaila R* Lynn D» 

PI Petro-Breyer J, 

DR WPI; 90-147824/19. 

PT Principal neutralising domain of HIV variants - used for producing 

PT peptide (s) and antibodies for diagnosis? prophylaxis? and/or therapy 

PT of HIV infection. 

PS Claim 8 (75)? Page 77? 108pp? English. 

CC Peptide RP70 comprises segments of the Principal Neutralising Domain 

CC (envelope protein) from isolate HIV-MN. Cysteines nay be addedi 

CC so that the residues at or near both ends to form a disulfide bond, 

CC giving the peptide a loop-like configuration. 

CC The loop configuration can be utilised to enhance the 

CC immunogenic properties of the peptides. The protein is capable of 

CC eliciting* and/or binding with* neutralising antibodies. 

CC The neutralising domain is bounded by cysteine residues which occur at 

CC positions 296 and 331. Peptides can be used as immmunogens or screening 

CC reagents to generate or identify poly- or monoclonal antibodies. 

CC See also R04427-R04506 and 904273-Q04279. 

SQ Sequence 40 AA? 

S3 2 A? 5 R? 5 N? 0 D? 0 B; 2 C? 1 9; 0 E? 0 Z? 3 G? 2 H? 

SQ 7 I? 0 L? 3 K? 0 h; l F; 2 P; l S? 4 T; 0 u? 2 Y; 0 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:56:55-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.33 
Residue Identity = 55X Matches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 
lllll 

INCTRPNYNKRKRIHIGPGRAFYTTKNI IGT1R3AHCNIS 
X 10 20 30 40 



6. US-08-249-182-5 (1-10) 

R31521 Cysteine protease #2. 

ID R31521 standard; Protein? 215 AA. 

AC R31521? 

DT 20-MAY-1993 (first entry) 

DE Cysteine protease 12. 

KH Generic? cysteine protease? parasite? helminth? immune system? 

KU hypersensitivity reaction? antibody? antigen? graft? tissue? 

KH footpad? swelling; booster? sheep? red blood cell? immunisation? 

KU organ; transplant? rejection? autoimmune? disease. 

OS Paragonimus uestermani. 

PN EP-524834-A. 

PD 27-JAN-1993. 

PF 24-JUL-1992? 306803. 

PR 25-JUL-1991? JP-208546. 

PR 12-FEB-1992? JP-057189. 

PA (HAMA/) HAMAJIMA F. 

PA (TSUR/) TSURU S. 

PA (YAMA/) YAMAKAMI K. 

PA (YAMA/) YAMAMOTO M. 

PI Hamajima Fr Tsuru Si Yamakami Ki Yamamoto M? 

DR MP I; 93-028881/04. 

DR N-PSDB? G35446. 

PT Immunosuppressive cysteine protease obtained from larvae of 



r i (Jdr<s*ii Lit ntfimribris - ubeo i or buppre&bion qt gretv rejection in 

PT organ transplantation and suppression of auto-innune diseases 

PS Disclosure; Page 19-20; 26pp; English. 

CC The sequence given in R31519 is a generic version of the sequences 

CC given in R31520-21. These sequences represent a cysteine protease 

CC which was isolated fron the parasitic helninthr Paragoninus 

CC uesternani. This cysteine protease suppresses the ipmune sy stent 

CC both cell mediated and hunoral. It inhibits delayed hypersensitivity 

CC reactions and production of antibodies against specific antigens and 

CC graft tissues. This protease inhibits footpad swelling caused by a 

CC booster injection of sheep red blood cells in the footpad after an 

CC initial innunisation intraperitoneal ly. The proteinase is useful for 

CC preventing organ transplant rejection and for control of autoinnune 

CC disease. 

SO Sequence 215 AA; 

so 18 A; 5 Ri 8 n; 14 D; o b; 8 c; 8 a; 18 E; 0 z; 21 G; 3 h; 

SQ 10 i; 17 L; 12 K; 5 M; 5 F? 9 p; 14 s; 11 T; 7 H; 9 Y; 13 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58?38-PDT using FindSeq 



Initial Score = 5 Optimized Score = 5 Significance = 3.33 
Residue Identity = SOX Matches = 5 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
III II 

GAMPSSSYLEIIDMGGLESESDYPYVGVE8TCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNA 
70 80 90 100 110 X 120 X 130 



VALQYYQSGVLKPTFEECPDTELNHAVLTVGYDKEGDH 
140 150 160 170 



7. US-08-249-182-5 (1-10) 

R31520 Cysteine protease II. 



ID R31520 standard; Protein; 215 AA. 

AC R31520; 

0T 20-HAY-1993 (first entry) 

DE Cysteine protease *1. 

KU Generic; cysteine protease; parasite? helninth? innune systen; 

KW hypersensitivity reaction; antibody; antigen; graft; tissue; 

KU footpad; swelling; booster; sheep; red blood cell; innunisation; 

KU organ; transplant; rejection; autoimmune; disease. 

OS Paragoninus uesternani. 

PN EP-524834-A. 

PD 27-JAN-1993. 

PF 24-JUL-1992; 306803. 

^ 25-JUL-1991; JP-208546. 

*Vl2-FEB-1992; JP-057189. 

p A X M1A/> HAMAJIHA F. 

PA (YA** TSURU S. 

PA (YAMA/r^AHI K. 

PI Hanajina F^TJ) H. 

OR HPI; 93-028881/OvS, Yanakani K, Yananoto H; 
OR N-PSDB; S35445. X \ 
PT Immunosuppressive cysteiK 

ri parasitic helninths - used^S^ 356 obtained ^ ron larvae of 



5 "isciosure; Page 18; 26pp; English^ of auto ~ influne diseases 

« The sequence given in R31519 is a L n *ric 

CC given in R31520-21. These «n„«r« of W>e sequences 

CC «hich uas isolated froTthVo^ f T?™ ' *W Please 

CC aesteraaru. This custein orl" he, " inth ' Pa ^onimjs 

CC both ce.l mediated a d 1 f ^ s « steB ' 

— ,. i ° nmora] - inhibits delayed hypersensitive 



PEEVTRPNYL 



INCTRPNYNKRKRIHIGPGRAFYTTKNI IGTIRSAHCNIS 
X 10 20 30 40 



4. US-08-249-182-5 (1-10) 

R29228 Heterocon jugate antibody innunogen RP70. 

ID R29223 standard; peptide; 40 AA. 

AC R29228; 

DT 14-APR-1993 (first entry) 

DE Heteroconjugate antibody innunogen RP70. 

KW V3 loop? gp41; envelope protein? HN? prototype? virus? varient? HIV? 

KW homology? heteroconjugate; enzyne? epitope napping? replication? 

KW conjugate? innunogenic carrier; keyhole linpet henocyanin? KLH? 

KW ovalbunin? succinyl naleinidonethyl cyclohexanylcarboxylate? SHCC. 

OS Synthetic, 

PN WQ9220373-A. 

PD 26-N0V-1992. 

PF 29-APR-1992? U03616. 

PR 14-NAY-1991; US-699773. 

PA (REPK ) REPLIGEN CORP. 

PI Higgins PJ* Potts BJ? 

DR WPI; 92-415475/50. 

PT Hetero-conjugate antibodies for treating HIV infections - 

PT conprise an antibody specific for an effector cell surface 

PT antigen and an antibody to a V3 loop of gp-120 envelope protein 

PT of HIV 

PS Disclosure; Page 19; 69pp? English. 

CC The sequences given in R29226-35 represent peptides which were used 

CC as innunogens for the production of antibodies against HIV. These 

CC peptides nay be either unconjugated or conjugated to an innunogenic 

CC carrier* eg. a keyhole linpet henocyanin (KLH) or ovalbunin* using 

CC succinyl naleinidonethyl cyclohexanylcarboxylate (SHCC) as a 

CC conjugating agent. Viruses containing these or sinilar sequences nay 

CC be recognised by the heteroconjugate enzynes of the invention. The 

CC antibodies raised against these sequences nay be identified by standard 

CC epitope napping techniques. These antibodies are capable* even at lou 

CC concentrations* of nearly elininating viral replication of different 

CC strains of HIV. 

SQ Sequence 40 AA? 

S9 2 A; 5 R? 5 N; 0 D; 0 B? 2 C? 1 Q? 0 E? 0 ZJ 3 G? 2 H? 

SQ 7 I? 0 L? 3 K! 0 H? 1 F? 2 P? 1 S? 4 T? 0 WJ 2 Y? 0 V? 

CC Retrieved by shears on Wed 21 Sep 94 li:58:30-PDT using FindSeq 

Initial Score = 5 Optinized Score * 5 Significance = 3.33 
Residue Identity = 55X Hatches = 5 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 
Mill 

INCTRPNYNKRKRIHIGPGRAFYTTKNI IGTIRQAHCNIS 
X 10 20 30 40 



5. US-08-249-182-5 (1-10) 

R04461 Hunan innunodef iciency virus peptide RP70. 

ID R04461 standard? protein? 40 AA. 

AC R04461? 

DT 20-SEP-1990 (first entry) 

DE Hunan innunodef iciency virus peptide RP70. 

KW HIV-MN; peptide RP70? principal neutralising donain? antibodies; 



ruv-i isolates naving uie am no acio sequence r\jj*H*i. wn-ui is aiso 

CC putatively reactive with the RF-like peptide set out in R33344. 

CC The variable region of the heavy and light chain of ncnoclonal 

CC antibody NM-01 were cloned by PCR and sequenced. Nucleotides 1-21 

CC and 334-363 of 837472 corresp. to the PCR priners used to anplify 

CC Nil-01 light chain sequences and nucleotides 1-27 and 385-402 of 

CC 857471 corresp. to the PCR priners used to anplify NM-01 heavy chain 

CC sequences, 

SQ Sequence 15 AA; 

SO 0 A; 3 R; 2 N; 0 Di 0 B; i c; 0 8; 0 E; 0 Z; l G; l H; 

S8 2 I; 0 Li 2 Kr 0 M? 0 F; 1 P? 0 S? 1 T; 0 W? 1 Y; 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 11:58:45-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.33 
Residue Identity - 71X Matches = 5 Hisnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 

lllll 
CTRPNYNKRKRIHIG 
X X 10 



3. US-08-249-182-5 < 1-10) 

R33217 HIV gpl20 V3 loop immunogenic peptide RP70. 

ID R33217 standard; peptide; 40 AA. 

AC R33217; 

DT 13-JUL-1993 (first entry) 

DE HIV gpl20 V3 loop innunogenic peptide RP70. 

KM HIV-1 J hunan icmunodef iciency virus; antibody generation; AIDS; 

KW infection; CD4 binding site; soluble CD4. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Region 16.. 20 

FT /note= "conserved subsequence in centre of V3 loop 0 

PN W09304693-A. 

PD 18-MAR-1993. 

PF 02-SEP-1992; U07511. 

PR 09-SEP-1991J US-756677. 

PR 20-JUL-1992; US-916542. 

PA (REPK ) REPLIGEN CORP. 

PI Field KGr Herlihy WC, Potts BJ, White-Scharf HE. 

DR WPI; 93-100653/12. 

PT Synergistic conpsn. for treating HIV-1 infection - comprises antibody 

PT to V3 loop of gpl20 and antibody to CD4 binding site of gpl20 or 

PT soluble CD4 polypeptide 

PS Exanple; Page 12; 56pp; English. 

CC The sequence is that of peptide RP70 used as an innunogen for the 

CC generation of antibodies directed against the V3 loop of HIV gpl20. 

CC These antibodies can be used as part of a conpsn. with antibodies 

CC directed against the CD4 binding site of gpl20. The antibodies act 

CC synergistically to neutralise HIV-1 in the treatnent of HIV 

CC infection caused by different strains. It can be forned into a 

CC closed loop by creation of a di sulphide bond between the two 

CC cysteine residues near the ends of the amino acid sequence. 

S3 Sequence 40 AA; 

SQ 2 A; 5 R; 5 N; 0 D? 0 B; 2 C; 1 8; 0 E; 0 Z; 3 G; 2 H; 

S8 7 i; o l; 3 k; o M; l F; 2 P? l s? 4 t; o w; 2 y; 0 v; 

CC Retrieved by shears on Wed 21 Sep 94 11?58:47-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.33 
Residue Identity - 55X Hatches = 5 Mi snatches " 4 

Gaps = 0 Conservative Substitutions = 0 



PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch Hr Liotta LAr Schiffnann E» Stracke H. 

DR WPI; 93-085861/10. 

PT Motility stimulating protein naned autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Exanple; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 29. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in innunostains of patient sanples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which nay also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SO Sequence 10 AA; 

S8 0 A; 0 R; 1 N; 0 D; 0 B; 0 C; 0 9; 2 E; 0 Z; 0 G; 0 H; 

S8 o u l l; 0 k; o H; 0 F; 2 P; o s; l T; 0 u; 1 y; l v; 

SO 1 Others; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:47-PDT using FindSeq 

Initial Score = 9 Optimized Score = 9 Significance = 6.66 
Residue Identity = 90X Matches = 9 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 

IMII till 
PEEVTXPNYL 
X 10 



2. US-08-249-182-5 (1-10) 

R33332 Sequence of peptide which corresp. to residues 302 

ID R33332 standard; peptide; 15 AA. 

AC R33332; 

DT 06-JUL-1993 (first entry) 

DE Sequence of peptide which corresp. to residues 302-316 of the V3 

DE loop region of HIV-1HN gp!20. 

KW Monoclonal antibody; NM-01; HIV-l; gp!20; gpl60. 

OS Synthetic. 

PN W09304090-A. 

PD 04-MAR-1993. 

PF 24-AUG-1992; U07111. 

PR 22-AUG-1991; US-748562. 

PA (NISP ) NISSIN SH0KUHIN KAISHA LTD. 

PI Ohno T; 

DR HPI; 93-093943/11. 

PT Monoclonal antibodies against HIV-l gpl20 and gpl60 proteins - 

PT for treating and preventing HIV-l infection 

PS Exanple; Page 19; 57ppf English. 

CC Hybridona cell line HB 10726 secretes MAb NM-01. In order to 

CC characterize the viral epitope recognized by NM-01 f the Ab was 

CC screened reactivity with overlapping peptides corresp. to the anino 

CC acid sequence of the V3 loop region of HIV-l gpl20 <R33332r R33333, 

CC R33334). While there was no detectable reactivity over background of 

CC MAb-01 with the peptides corresp. to AAs 302-316 or 322-336 of the 

CC V3 loopi binding of the antibody to the peptide representing AAs 

CC 3122-326 was apparent. The extent of this reactivity with other 

CC HIV-l isolates was screened with peptides corresp. to the V3 loop 

CC region of HIV-l isolates IIIB, RF, CDC4, NY/5r Z6, Z2 and ELI 

CC (R33335-R33342) . These results indicate that nonoclonal antibody 

CC NM-01 recognizes an epitope of the V3 loop of gpl20 of multiple 



The list of best scores is J 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 
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H20B receptor. 
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731 


e 

J 




7 71 

3.33 


A 

0 


1 A 


P7AO07 


Sequence of hunan factor XIII 


732 


D 


D 


7 77 

3.33 


A 

0 






**** 2 standard deviations above nean 


flrxW 








4 C 
15. 


DOLA70 

KcoU3o 


uugopeptide K24-34. 


* * 
11 




4 


2.50 


A 

0 


10. 


D7jLCA7 


D32.37 antibody isolated pept 


i o 

12 




c: 
J 


o cn 
2.50 


0 






Hunan innunodef iciency virus 


14 




4 


O CA 

2.50 


A 
0 


t ft 
10. 


OAAA77. 


nun an mpiunooet ic l ency virus 


1 A 
10 




4 


O CA 

2.50 


A 
0 


1 0 
17. 


DDA04.C 

nc4bo5 


Sequence of peptide fragment 


21 




4 


O CA 

2.50 


A 

0 


CU • 


07AAO7 


Gpl20-29, for activating T ce 


23 




4 


0 P.A 

c.DU 


A 
U 


01 
CI • 


D03C0A 
KccDVo 


Beta tunour cell growth facto 


24 




4 


O CA 

2.50 


A 

0 


oo 

CCm 


D 1 Ol 70 

Kiel /7 


HIV SP-10 region corresp. pep 


24 




4 


2.50 


A 

0 


c3. 


D0770C 

Kc/3cD 


Peptide corresp. to an epitop 


25 




4 


O CA 

2.50 


A 

0 


e4. 


DLAQQC 

roUooD 


Synthetic peptide which elici 


25 


4 


4 


O CA 

2.50 


A 

0 


OR 
CJi 


KU44JD 


Hunan innunodef iciency virus 


25 




4 


O CA 

2.50 


A 
0 


Oi 
CO * 


D0770A 

R2/324 


Peptide corresp. to an epitop 


26 


4 


4 


4 CA 

2.50 


A 

0 


07 
C7 a 


0077 1 0 


Peptide corresp. to an epitop 


26 


4 


4 


1 C A 

2.50 


A 

0 


OD 
CO. 


Oi A0Q7 


Synthetic peptide ahich elici 


26 


4 


4 


O C A 

2.50 


A 

0 


oo 

C7 . 


0O771 O 

Kc/3l7 


Peptide corresp. to an epitop 


27 


4 


4 


O CA 

2.50 


A 

0 


30. 
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1. US-08-249-182-5 (1-10) 

R37447 Autotaxin peptide ATX 29. 

ID R37447 standard; peptide; 10 AA. 

AC R37447J 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 29. 

KW Cell notility simulating; cancer netastasis; antibody; detection; 

KW innunostains; disease outcone prediction; therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Modified.site 6 

FT /note= "potentially glycosylated residue 0 

PN US7822043-A. 



r ivvv~ 



s 

E 500- 

a 
u 

E 
N 
C 
E 

S 100- 



50- 



10- 



II II II I II II II II I 

SCORE 0| 1| 2| 3 4| S| 6| 7| 8 

STDEV 0 1 2 3 4 5 6 



PARAHETERS 



Similarity natrix 


Unitary 


K-tuple 




2 


Misnatch penalty 


1 


Joining penalty 




20 


Gap penalty 


1.00 


Window size 




5 


Gap size penalty 


0.05 








Cutoff score 


0 








Randomization group 


0 








Initial scores to save 


40 


Alignnents to save 


15 




Optimized scores to save 0 


Display context 


50 





SEARCH STATISTICS 



Scores: Mean Median Standard Deviation 

1 3 1.20 

Tiaes: CPU Total Elapsed 

00:00:08.91 00:00:09.00 

Nunber of residues: 482836 

Number of sequences searched: 5543 

Nunber of scores above cutoff: 2895 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 



A 1 00% identical sequence to the query sequence was not found. 



PR 23-0CT-1989; US-425252. 

PA (HOFF ) Hoffnann-La Roche AG. 

PI Buonagurio DA, Longiaru H; 

DR WPI; 91-126308/18. 

DR N-PSDB; 611554. 

PT New nucleic acid encoding conserved epitope of HTLV gp.21 - and 

PT hybrids with other epitopes and the derived polypeptide (s) useful as 

PT immunoassay reagents for detecting specific HTLV antibodies in serum 

PS Disclosure; fig 11? 31pp; English. 

CC This ENV93/HTLV-1-II fusion protein comprises an epitope from the 

CC immunodominant conserved region of HTLV-1 envelope (env) glyco- 

CC protein (gp) 21 fused to an epitope from the gp46 region of HTLV-1 

CC env protein. The ENV93 epitope constitutes residues 342-432 of 

CC the gp21 sequence and the HTLV-1-II epitope constitutes residues 

CC 201-307 of gp46. The ENV93 gene construct encoding this fusion 

CC protein is is used as a vehicle for the high level expression 

CC of this other epitope of HTLV-1 env as a fusion protein. The 

CC fusion protein is useful for detecting antibodies to HTLV-1 in 

CC body fluids > eg blood* uhere it provides a more sensitive and 

CC selective assay than current viral lysate tests. 

CC See also Ql 1552-53 and Ql 1555-58. 

Sfl Sequence 217 AA; 

SQ 10 A; 7 R; 11 N; 7 DJ 0 B? 6 C; 14 0; 6 E; 0 Z; 10 G; 7 H; 

SQ li is 34 L; 8 K; l H; 5 F; 20 P; 25 s; 12 T; 6 u; 4 y; 13 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:21-PDT using FindSeq 



Initial Score = 4 Optimized Score = 4 Significance = 3.74 
Residue Identity = 80X Hatches ~ 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
SAEVS 
II II 

ASLSTHHVLYSPNVSVPSSSSTPLLYPSLALPAPHLTLPFNHTHCFDPSIflAIVSSPCHNSLILPPFSLSPV 
140 150 160 170 180 190 X 200 210 



PTLGSQA 



> 0 < 

0| |0 IntelliGenetics 

> 0 < 



FastDB - Fast Pairwise Comparison of Sequences 
Release 5.4 



Results file us-08-249-182-5.res made by on Med 21 Sep 94 12:09? 17-PDT. 



Query sequence being compared:U5-08-249-182-5 (1-10) 
Number of sequences searched: 5543 
Number of scores above cutoff: 2895 

Results of the initial comparison of US-08-249-182-5 (1-10) with: 
File : /hone/shears/loring/lorin*.pep 

10000- 

N 

U 5000- 
II 

B * 

E * 



V UUblUlO 



X X 
YDVPHNETI 

till 
CTTAVPUNASWS 
X 10 



7. US-0B-249-182-6 (1-9) 

P91670 HIV-1 TriP related polypeptide 

ID P91670 standard; protein; 12 AA. 

AC P91670; 

DT 29-JUN-1990 (first entry) 

DE HIV-1 TUP related polypeptide 

KM HIV-1; TUP related polypeptide; assay; anti-HIV antibodies. 

OS Hunan innunodef iciency virus. 

PN H08901494-A. 

PD 23-FEB-1989. 

PF 19-AUG-19885 U02870. 

PR 21-AUG-1987; US-088067. 

PA (SCRI) Scripps Clinic & Res. 

PI Oldstone M» Gnann J; 

DR HPI; 89-068856/09. 

PT Hunan innuno-def iciency virus related polypeptide(s) - 

PT used for assaying anti-HIV antibodies in sanples and for 

PT inducing antibody prodn. 

PS Disclosure; page 16; 43pp; English 

CC A conpsn. is cl aided conprising an admixture of at least tuo polypeptides 

CC where the 1st polypeptide is p91665 or p92120 and the 2nd is 

CC p9 1 666 » p91667 or p91668. The conpsn. is characterised as being 

CC substantially free of antibodies (Abs) that innunoreact uith HIV-1 

CC TUP related polypeptides such as p91670. 

SO Sequence 12 AA; 

S3 2 A; OR; IN; 0 d; OB; l C; o a; o e; o z; o G; 0 H; 

S9 o I; o l; ok; on; of; i p; 2 s; 2 t; 2 u; 0 Y; 1 v; 

CC Retrieved by shears on Wed 21 Sep 94 1 U5&:50-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.67 
Residue Identity = 44X Matches = 4 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 

Mil 
CTTAVPUNASWS 

X 10 



8. US-08-249- 182-6 (1-9) 

R13191 Peptide (XIV) innunoreactive uith anti-HIV-1 antib 

ID R13191 standard; Protein; 19 AA. 

AC R13191; 

DT 09-0CT-1991 (first entry) 

DE Peptide (XIV) innunoreactive uith anti-HIV-1 antibodies. 

KM hunan T-cell leukaenia virus; AIDS; ATL; detection; 

KU envelope protein gp6l; acquired innunodef iciency syndrone; 

KU hunan innunodef iciency virus. 

OS Synthetic. 

PN EP-439077-A. 

PD 31-JUL-1991. 

PF 18-JAN-1991; 100616. 

PR 24-JAN-1990; US-469291. 

PA (UNBI-) UNITED BIOMEDICAL. 



it ftuuit* ui uiLDhrt urtiL imn uui at uscu tu ridKcr dribiuuuiirs i tfUL. 

PT includes forming 1st and 2nd PGR admixtures* subjecting then to 

PT PGR therno-cycles* sepg. double stranded DNA* hybridising* etc. 

PS Disclosure; Fig 6; 143pp; English. 

CC This sequence is the C terminal of R27568 encoded by a sequence 

CC inserted into lambda ZAP to give ImmunoZAP H expression vector. The VH 

CC or VL homologue is fused to the N terminal of this sequence. 

SQ Sequence 12 AA; 

SQ 0 a; o R; 0 N; 2 D; o B? 0 C; o 3; 0 e; o Z; i G; 0 h; 

SQ 0 i; 0 L; 0 K; o M; 0 F; 2 P; 2 s; 1 T; 0 W; 3 YJ 1 Vr 

CC Retrieved by shears on Wed 21 Sep 94 11:58:26-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.67 
Residue Identity = SOX Matches = 4 Mi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 

mi 

TSYPYDVPDYGS 
X 10 X 



6. US-08-249-182-6 (1-9) 

R23616 Peptide able to induce in vivo prodn. of antibodie 

ID R23616 standard* peptide; 12 AA. 

AC R23616* 

DT 29-0CT-1992 (first entry) 

DE Peptide able to induce in vivo prodn. of antibodies. 

KW Mycoplasm; adhesion; molecule; HIV; AIDS; gpi60; nef; T-lymphocytes; 

KW vaccine; passive; immunotherapy. 

OS Human innmunodef iciency virus. 

PN W09206199-A. 

PD 16-APR-1992. 

PF 27-SEP-1991J F00758. 

PR 27-SEP-1990? FR-011951. 

PR 26-0CT-1990; FR-013324. 

PA (INSP ) INST PASTEUR. 

PI Bahroaui E» Berneman D* Blanchard A* Chamaret S» Guetard D; 

PI Montagnierl* Van Rietschoten J» 

DR WPI; 92-150884/18. 

PT New peptide(s) common to mycoplasm adhesion molecule and HIV 

PT proteins - useful as vaccines against AIDS and mycoplasma 

PT infections and in their diagnosis 

PS Claim 21; Page 36; 123pp; French. 

CC The peptide is derived from the HIV envelope protein gpl60 and has 

CC homology to a peptide derived from mycoplasma genitalium* a 

CC mycoplasma adhesin. The synthetic peptide* coupled to keyhole 

CC limpet haemocyanin to form a conjugate* is useful as an 

CC immunogen to induce antibodies which neutralise HIV infection. 

CC It can be formulated in a vaccine* opt. with other peptides able 

CC to induce neutralising antibodies (specifically one corresp. to 

CC the V3 region of gpl20). The peptide can also be used diagnostic- 

CC ally to detect antibodies against mycoplasma and/or HIV and also 

CC to monitor the progress of disease treatment. Antibodies raised 

CC against the peptide may be used in passive immunotherapy to inhibit 

CC infection by HIV-T cell lymphocytes. 

CC See also R23604-17. 

SQ Sequence 12 AA; 

SQ 2 A; o R; l N; 0 D; 0 B; l C; o Q; 0 E; o Z; 0 G; 0 H; 

SQ o I; o l; o k; o m; o f; i p; 2 s; 2 T; 2 w? o y; l v; 

CC Retrieved by shears on Wed 21 Sep 94 li:58?04-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.67 
Residue Identity = 44X Matches = 4 Mismatches = 5 
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KW binding domain; antibody; diarrhoea; liganrf; antagonist. 

OS Rattus rattus. 

PN US5237051-A. 

PD 17-AUG-1993. 

PF 06-DEC-1990; 623033. 

PR 06-DEC-1990? US-623033. 

PA (UYVA-) UNIV VANDERBILT. 

PI Garbers DLr Schulz 5; 

DR HPIl 93-272183/34. 

PT Neu purified enterotoxin receptor protein - used to develop 

PT prods, for treating abnornal conditions caused by bacterially 

PT released enterotoxin* partic. diarrhoea 

PS Disclosure; Fig 3; 26pp> English. 

CC The sequences given in R33862-63 represent the guanylyl cyclases r 

CC GC-A and GC-Bi which binds heat stable enterotoxin. These proteins 

CC are enterotoxin receptors which nay be used as a therapeutic to control 

CC intestinal fluid perneation as yell as abnornal conditions caused 

CC by bacterially released enterotoxin. The binding donain of the 

CC proteins, or antibodies to the proteins* can be used to elininate 

CC diarrhoea. The proteins nay be used to isolate ligands and to screen 

CC for antagonists of toxin binding. 

SO Sequence 1029 AA; 

SO 74 A; 73 R; 37 N; 56 D; 0 B; 16 C; 40 9; 72 E! 0 Z? 75 GJ 23 H; 

SO 38 I; 124L; 44 K; 20 Hi 46 F; 50 P; 61 S; 53 T; 16 U; 34 Y; 77 V; 

CC Retrieved by shears on Wed 21 Sep 94 li;59:20-PDT using FindSeq 

Initial Score = 5 Qptinized Score = 5 Significance = 4.59 
Residue Identity = 55X Hatches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
II III 

ETV8AEAFDSVTIYFSDIVGFTALSAESTPH9VVTLLNDLYTCFDAVIDNFDVYKVETIGDAYHVVSGLPVR 
840 850 860 870 880 X 890 900 

NGQLHAREVARWALALLDAVRSFRIRHRPGEGLRLRI 
910 920 930 940 



4. US-08-249- 182-6 (1-9) 
R38861 GC-C. 

ID R38861 standard; Protein; 1075 AA. 

AC R38861; 

DT 08-FEB-1994 (first entry) 

DE GC-C. 

KW Guanylyl cyclase; GC-C; heat stable; enterotoxin; rat; snail intestine; 

KW nucosa; polyA+ RNA; PCR; enterotoxin receptor; bacterial enterotoxin; 

KW binding donain; antibody; diarrhoea; ligand; antagonist. 

OS Rattus rattus. 

FH Key Location/Qualifiers 

FT Peptide 1..22 

FT /note= "Signal peptide 0 

FT Protein 23.. 1075 

FT /note* "Mature CG-C° 

FT Modified_site 31.. 33 

FT /note= °N-l inked glycosylation site 0 

FT Modified.site 74.. 76 

FT /note= u N-linked glycosylation site" 

FT Modified_site 78.. 80 

FT /note= u N-l inked glycosylation site 0 

FT Modified.site 187., 189 

FT /note= n N-linked glycosylation site 0 

FT Modified.site 194.. 196 



X X 
YDVPHNETI 

Illilllil 
YDVPHNETI 
X X 



2. US-08-249-182-6 (1-9) 
R38863 GC-B. 

ID R38863 standard; Protein; 1025 AA. 

AC R38863; 

DT 08-FEB-1994 (first entry) 

DE GC-B. 

KH Guanylyl cyclase; GC-C; heat stable; enterotoxin; rat; snail intestine; 

KH nucosa; polyA+ RNA; PCR; enterotoxin receptor; bacterial enterotoxin; 

KH binding donain; antibody; diarrhoea; ligand; antagonist. 

OS Rattus rattus. 

PN US5237051-A. 

PD 17-AUG-1993. 

PF 06-DEC-1990; 623033. 

PR 06-DEC-1990; US-623033. 

PA (UYVA-) UN IV VANDERBILT. 

PI Garbers DL. Schul2 S; 

DR HPI; 93-272183/34. 

PT Neu purified enterotoxin receptor protein - used to develop 

PT prods, for treating abnornal conditions caused by bacterially 

PT released enterotoxin^ partic. diarrhoea 

PS Disclosure; Fig 3; 26pp! English. 

CC The sequences given in R38862-63 represent the guanylyl cyclases* 

CC GC-A and GC-B. which binds heat stable enterotoxin. These proteins 

CC are enterotoxin receptors which nay be used as a therapeutic to control 

CC intestinal fluid perneation as uell as abnornal conditions caused 

CC by bacterially released enterotoxin. The binding donain of the 

CC proteins t or antibodies to the proteins* can be used to elininate 

CC diarrhoea. The proteins nay be used to isolate ligands and to screen 

CC for antagonists of toxin binding. This sequence is given as it is 

CC represented in the specification. 

S9 Sequence 1025 AA; 

S9 82 A; 69 R; 39 N; 54 D; 0 B; 15 Cf 40 8; 64 E; 0 Z; 78 Gf 28 H; 

SQ 49 U 122L; 39 K! 21 Hi 46 F5 53 P? 56 S; 53 T; 16 H? 37 Y! 64 V; 

CC Retrieved by shears on Ued 21 Sep 94 ll:59:20-PDT using FindSeq 

Initial Score = 5 Optinized Score = 5 Significance = 4.59 
Residue Identity = 55X Hatches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
II III 

ETVQAEAFDSVTIYFSDIVGFTALSAESTPMSVVTLLNDLYTCFDAIIDNFDVYKVETIGDAYMVVSGLPGR 
830 840 B50 860 870 880 X 890 

NGQRHAPE 1 ARMALALLDA VSSFR I RHRPHDfilRLR I 
900 910 920 930 



3. US-08-249-182-6 (1-9) 
R38862 GC-A. 

ID R38862 standard; Protein; 1029 AA. 

AC R38B62; 

DT 08-FEB-1994 (first entry) 

DE GC-A. 

KU Guanylyl cyclase! GC-C; heat stable; enterotoxin; rat; snail intestine; 



* t ■ 

15. 


1 Ufc./ X i 

R44497 


Sequence of the HIV-1 epitope 


35 


■f 


1 0.0/ 
4 3.67 


u 
0 


16. 


R31964 


BCH-87ck. 


35 


4 


4 3.67 


0 


17. 


R31963 


BCH-87c. 


35 


4 


4 3.67 


0 


18. 


R26492 


Fragment of HIV-1 gp41 (resid 


35 


4 


4 3.67 


0 


19. 


R31966 


BCH-26&. 


36 


4 


4 3.67 


0 


20. 


R31967 


BCH-408. 


39 


4 


4 3.67 


0 


21. 


R20661 


HIV antibody detecting peptid 


39 


4 


4 3.67 


0 


22. 


R31968 


BCH-408k. 


40 


4 


4 3.67 


0 


23. 


R20659 


HIV antibody detecting peptid 


44 


4 


4 3.67 


0 


24. 


P60067 


Sequence of HTLV-III polypept 


82 


4 


5 3.67 


0 


25. 


P70239 


Polypeptide ENV (80). 


96 


4 


4 3.67 


0 


26. 


R25407 


Light chain variable domain o 


113 


4 


4 3.67 


0 


27. 


R28743 


Light chain variable domain o 


114 


4 


4 3.67 


0 


28. 


P93342 


HIV-1 env protein from CBL-1 


140 


4 


4 3.67 


0 


29. 


P60068 


Sequence of HTLV-III polypept 


146 


4 


4 3.67 


0 


30. 


R41320 


PEP (1-160). 


160 


4 


4 3.67 


0 


31. 


R43910 


Nerve growth factor. 


239 


4 


4 3.67 


0 


32. 


R37647 


Sequence of a 4-4-20/212 sing 


250 


4 


4 3.67 


0 


33. 


R30723 


Protein C heavy chain. 


261 


4 


4 3.67 


0 


34. 


R34030 


Fc-alpha-R. 


287 


4 


4 3.67 


0 


35. 


P80297 


Sequence encoded by the env < 


291 


4 


4 3.67 


0 


36. 


P82053 


Outer membrane protein F of P 


350 


4 


4 3.67 


0 


37. 


R05637 


Placenta-specific protein PP1 


369 


4 


4 3.67 


0 


38. 


R21846 


Gal alpha-2,6-ST (from clone 


406 


4 


4 3.67 


0 


39. 


P91235 


(ENV-30MGAG-VII)<Hexahis> pr 


515 


4 


4 3.67 


0 


40. 


P70541 


HTLV-III gag/env gene protein 


600 


4 


4 3.67 


0 



1. US-08-249-182-A (1-9) 

R37448 Autotaxin peptide ATX 47. 

ID R37448 standard* peptide; 9 AA. 

AC R37448; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 47. 

KW Cell motility stimulating; cancer metastasis? antibody; detection? 

KW immunostains; disease outcome prediction? therapy choice; 

KW cancer therapy? crossl inked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUNAN SERVICE. 

PI Krutzsch Hr Liotta LAi Schiffmann Ef Stracke H. 

DR WPI? 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33? 36pp? English. 

CC The sequence is that of autotaxin peptide ATX 47. It may be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in immunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SQ Sequence 9 AA? 

S9 o A? o R? l N; 1 D? o B; o c; o Q; i E? o z; o G; o H? 

S9 1 I? 0 L? 0 K; 0 H? 0 F; 1 P? 0 s; 1 T? 1 W? 1 Y? 1 V? 

CC Retrieved by shears on Wed 21 Sep 94 11?58:47-PDT using FindSeq 

Initial Score = 9 Optimized Score = 9 Significance = 8.27 
Residue Identity = 100X Hatches = 9 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 



4 



II II II II II II I II I 

SCORE 0| 1| 2| 3| 4| 5| 6 7 | S 
STDEV 1 2 3 4 5 6 7 



PARAMETERS 



Similarity natrix Unitary 


K-tuple 


2 


Hisnatch penalty 1 


Joining penalty 


20 


Gap penalty 1.00 


Window size 


5 


Gap size penalty 0.05 






Cutoff score 0 






Randomization group 0 






Initial scores to save 40 


Alignments to save 15 




Optimized scores to save 0 


Display context 50 




SEARCH STATISTICS 




Scores: Mean 


Median Standard Deviation 


0 


1 1.09 




Tines? CPU 


Total Elapsed 




00:00:08,90 


00:00:09.00 




Nufiber of residues: 


482836 




Nunber of sequences searched: 


5543 




Nunber of scores above cutoff: 


2303 




The scores below are sorted by initial score. 




Significance is calculated based 


on initial score. 





A 100X identical sequence to the query sequence uas found: 

Inii. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



1. R37448 Autotaxin peptide ATX 47. 9 9 9 8.27 0 

The list of other best scores is: 



Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 







4 standard deviations 


above nean 


»**« 








2. 


R3S863 


GC-B. 


1025 


5 


5 


4.59 


0 


3. 


R38862 


GC-A. 


1029 


5 


5 


4.59 


0 


4. 


R38861 


GC-C. 


1075 


5 


5 


4.59 


0 






*♦#» 3 standard deviations 


above nean 


«**< 








5. 


R27567 


C terminal end of sequence in 


12 






3.67 


0 


6. 


R23616 


Peptide able to induce in viv 


12 






3.67 


0 


7. 


P91670 


HIV-1 THP related polypeptide 


12 






3.67 


0 


8. 


R13191 


Peptide (XIV) innunoreactive 


19 






3.67 


0 


9. 


R06410 


HTLV-1 corresponding peptide 


19 






3.67 


0 


10. 


R41059 


HIV-1 gp41 peptide (isolate H 


21 






3.67 


0 


11. 


R41063 


HIV-1 gp41 peptide (isolate E 


25 






3.67 


0 


12. 


P80566 


Peptide region of hunan innun 


27 






3.67 


0 


13. 


R27564 


Insert D to prevent steric hi 


29 






3.67 


0 



S9 Sequence 11 AA; 

SQ 0 A; l R; 0 N; 0 D; 0 B; 0 C; 0 9; l E; 0 Z; o G; l H; 

SS 2 I; 0 L; l k; o H; 0 F; l P; 0 S; l T; l w; o y; 2 v; 

CC Retrieved by shears on Med 21 Sep 94 ll:58:21-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 2.50 
Residue Identity = 44X Hatches = 4 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 

llll 
IIVTRPWKHVE 
X 10 

> 0 < 

0| |0 Intel UGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 

Results file us-08-249-182-f>.res made by on Wed 21 Sep 94 12: 10:24-PDT. 



Query sequence being conparedJUS-08-249-182-6 (1-9) 
Number of sequences searched: 5543 
Number of scores above cutoff J 2303 

Results of the initial comparison of US-08-249-182-6 (1-9) with: 
File : /hone/shears/ loring/lor in*. pep 

10000- 

N 

U 5000- 
II # 
B 

E * 
R 

0 

F 1000- 
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PR 12-MAR-1986? DE-608280. 

PR 26-JUN-1986; DE-621371. 

PA (BEHW ) BEHRINGHERKE AG. 

PI Grundmann Ui Amann Ef Zettlmeissl G; 

DR WPI; 87-258275/37. 

DR N-PSDB? N70461. 

PT New DNA sequence coding for factor 13A and expressed proteins - 

PT useful as diagnostic reagents and for producing antibodies 

PS Claim 10; Table 3, ppl6-20; 30pp; German. 

CC Hunan placental cDNA gene bank was screened by hybridisation with two 

CC synthetic oligonucleotides! corresponding to partial AA sequences of 

CC factor XHIa (N70460, N70465). N70461 gives the coding strand 

CC sequence of clones lambda-gtlO-11 and lanbda-gtlO-12. 

S9 Sequence 732 AA; 

S8 37 A! 45 R; 40 N; 47 D; 0 B; 9 C; 26 8? 49 E; 0 Z; 50 G; 14 H; 

SQ 38 I; 49 U 38 K; 20 M; 31 F; 33 P; 46 S; 45 T; 15 W; 29 Y? 71 V? 

CC Retrieved by shears on Hed 21 Sep 94 11 :57: 14-PDT using FindSeq 



Initial Score = 5 Optimized Score = 5 Significance - 3.33 
Residue Identity = 507. Matches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
PEEVTRPMYL 
III II 

VVGSDMTVTVSFTNPLKETLRNVWVHLDGPGVTRPMKKMFREIRPNSTVQWEEVCRPWVSGHRKLIASHSSD 
650 660 670 680 690 X 700 710 



SLRHVYGELDVaiQRRPSH 
720 730 



15. US-08-249-182-5 (1-10) 

R26038 Oligopeptide P24-34. 

ID R26038 standard; peptide; 11 AA. 

AC R26038; 

DT 02-FEB-1993 (first entry) 

DE Oligopeptide P24-34. 

KM Hunan; granulocyte-macrophage colony stimulating factor; antibody; 

KU monoclonal; haemopoietic progenitor cells; neoplastic disease; 

KW lymphoma* leukaemia. 

OS Nus musculus. 

PN EP-499161-A. 

PD 19-AUG-1992. 

PF 07-FEB-1992; 102103. 

PR ll-FEB-1991; US-653428. 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 

PI Braslausky GR, Bursuker Ir Greenfield RS; 

DR WPI; 92-277909/34. 

PT New nonoclonal antibodies uhich inhibit GM-CSF - for treating 

PT GM-CSF mediated inflammatory and autoimmune diseases e.g. 

PT systemic lupus erythematosus r temporal arteritis^ etc.? also 

PT atherosclerosis 

PS Disclosure; Fig 4; 31pp; English. 

CC The sequence given in R26036 is a portion of the murine wild-type 

CC sequence from the granulocyte-macrophage colony stimulating factor 

CC (GM-CSF). The sequence given in R26037 is the Corresponding 0 

CC sequence from the human GM-CSF polypeptide sequence. Sequences 

CC R26038-41 are other portions from the nurine GM-CSF gene. All these 

CC peptide sequences can be used to raise antibodies against GM-CSF. 

CC These monoclonal antibodies (MAb) can be used to inhibit or prevent 

CC undesirable disease states uhich are caused by the colony stimulating 

CC activity of GM-CSF on haemopoietic progenitor cells. These MAb's can 

CC also be used for inhibiting or eliminating the growth of GM-CSF- 



FT Protein 38.. 731 

FT /label= nature^a'.subunit 

FT Active_site 314 

FT /note= "of the a subunit" 

FT Cleavage_site 37.. 38 

FT /note= "for conversion of a subunit to a' subunit 

FT by thrombin" 

PN W09I16931-A. 

PD 14-N0V-1991. 

PF 09-HAY-1991; U03212. 

PR 10-MAY-1990; US-521805. 

PR 18-NAY-1990; US-525556. 

PA (ZYM0-) ZYHOGENETICS INC. 

PI Bishop PD; 

DR WPI; 91-353537/48. 

DR N-PSDBJ 914687. 

PT New diagnostic conpsns. contg. factor XIII or derivs. - 

PT are coupled to radioisotope or paramagnetic cpd. and are useful 

PT for detecting throRbosis in high risk patients 

PS Disclosure; Fig 1; 51pp; English. 

CC A diagnostic compsn. comprising factor XIII couples to a detectable 

CC substancei and a physiologically acceptable carrier* are used to 

CC detect and in age blood clots (venous and arterial thrombosis). 

CC Since they have a longer half life than fibrinogen* they are 

CC partic. useful before surgery to provide on-going monitoring (for up 

CC to 2 weeks after surgery) from a single dose. 

CC Hutant forms of factor XIII were made by in vitro site-directed 

CC mutagenesis* esp. ss pRS202 templates were modified by the single 

CC pricier method* e.g. for TGC-Cys-314 to TCT-Ser exchange. 

CC See also 914687-88. 

SO Sequence 731 AA; 

SQ 37 A; 45 R? 40 N; 47 D; 0 B; 9 C; 27 G; 48 E; 0 Z; 50 G; 14 H; 

SQ 39 I; 49 L; 38 K; 19 H? 31 F; 33 P; 46 S; 45 T; 15 W? 29 Y; 70 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:40-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.33 
Residue Identity = SOX Matches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions - 0 

X X 
PEEVTRPNYL 
III II 

VVGSDM W I 9FTNPLKETLRN VW VHLDGPG VTRPMKKHFRE I RPNST V9WEEVCRPWV5GHRKL I ASHSSD 
650 660 670 680 690 700 710 

SLRHVYGELD V9 I GRRPSM 
720 730 



14. US-08-249-182-5 (1-10) 

P70293 Sequence of human factor XII la. 

ID P70293 standard; Protein; 732 AA. 

AC P70293; 

DT 21-HAY-1991 (first entry) 

DE Sequence of human factor XI 1 1 a. 

KM Diagnosis; antigen; anti-factor X 1 1 1 a antibody. 

OS Homo sapiens. 

FH Key Location/Oualif iers 

FT Region 228.. 249 

FT /note= "used to derive 66mer probe N70460" 

FT Region 475.. 481 

FT /note= "used to derive 20ner probe N70465" 

PN EP-236978-A. 

PD 16-SEP-1987. 



12. US-08-249-182-5 (1-10) 

R07131 H20A receptor. 



ID R0713I standard? protein,' 416 AA. 

AC R07131; 

DT 23-JAN-1991 (first entry) 

DE H20A receptor. 

KM Picornavirus proteins? poliovirus? transgenic aninals? vaccines; 

KU antibodies; imaging. 

FH Key Location/Qualifiers 

FT Peptide 1..20 

FT /label=signal peptide 

FT Domain 345.. 348 

FT /label=transpieflbrane domain 

PN W09010699-A. 

PD 20-SEP-1990. 

PF 09-MAR-1990? U01320. 

PR 10-MAR-1989? US-321957. 

PA (UYC0-) COLUMBIA UNIV NY. 

PI Racaniello Vi Mendelsohn C, Costantini F; 

DR WPI? 90-305023/40. 

DR N-PSDB? G06070. 

PT DNA encoding picornavirus partic. poliovirus receptors proteins - 

PT for treating picornavirus infections or for expression in 

PT transgenic aninals used to test vaccines 

PS Disclosure; fig 4; 88pp; English. 

CC This poliovirus receptor, H20A, has a sequence differing fron the 

CC receptor, H20B at the cytoplasnic tail only. Antibodies (Abs) 

CC raised against it are useful for targetted delivery of the hunan 

CC poliovirus, conjugated to a drug. Transgenic aninals contg. the 

CC corresp. DNA (genonic- or cDNA) can be used to test the efficiency 

CC and virulence of picornavirus vaccines. 

CC See also 006069. 

S9 Sequence 416 AA; 

SO 30 A; 18 R; 18 n; io D; 0 B; 9 C; 21 Q; 22 E; 0 z; 32 G; 10 H; 

SO 13 I; 42 L; 9 K; 9 M? 12 F; 32 P; 37 S; 31 T? 10 w? 11 y; 40 v; 

CC Retrieved by shears on Wed 21 Sep 94 1 i :57:08-PDT using FindSeq 

Initial Score = 5 Optinized Score = 5 Significance = 3.33 
Residue Identity = SOX Hatches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 
II I I! 

MARAMAAAWPLLLVALLVLSWPPPGTGDVVV8APTSVPGFLGDSVTLPCYLQVPNMEVTHVSSLTHARHGES 
10 20 30 40 X 50 60 70 

GSMAVFHflTQGPSYSESKRLEFVAARLGA 
80 90 100 



13. US-08-249-182-5 (1-10) 

R14376 Factor XIII subunit a. 

ID R14376 standard? Protein; 731 AA. 

AC R14376? 

DT 14-FEB-1992 (first entry) 

DE Factor XIII subunit a. 

KM Factor XIII? subunit? antibody? diagnosis? thrombin? thronbosis? 

KU nut ant. 

OS Mono sapiens. 

FH Key Location/Qualifiers 

FT Protein 1..731 



i*v ui it»vu uy a near a un weo ci otry 7t H«jO'j7 _ rui usirry rinuaeq 



Initial Score - 5 Optimized Score * 5 Significance = 3.33 
Residue Identity = SOX Matches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
Hill 

PVETPTREIKKLDGLWAFSLDRERVADIIDGSENFTDNAKTIIVHLNESVfllNCTRPNYNKRKRIHIGPCRA 
10 20 30 40 50 X 60 X 70 

FYTTKNI IGTIRQAHCNISRAKWNDTLRQIVSKLKEQF 
80 90 100 110 



11. US-08-249-1S2-5 (1-10) 

R07130 H20B receptor. 



ID R07130 standard; protein; 392 AA. 

AC R07130; 

DT 23-JAN-1991 (first entry) 

DE H20B receptor, 

KW Picornavirus proteins; poliovirusf transgenic animals; vaccines; 

KW antibodies; imaging. 

FH Key Location/Sualif iers 

FT Peptide 1..20 

FT /label=signal peptide 

FT Domain 345.. 368 

FT /label=transmembrane domain 

PN W09010699-A. 

PD 20-SEP-1990. 

PF 09-MAR-1990; U01320. 

PR 10-MAR-1989? US-321957. 

PA (UYC0-) COLUMBIA UNIV NY. 

PI Racaniello V> Mendelsohn C> Costantini F; 

DR HPI; 90-305023/40. 

DR N-PSDB; G06069. 

PT DNA encoding picornavirus partic. poliovirus receptors proteins - 

PT for treating picornavirus infections or for expression in 

PT transgenic animals used to test vaccines 

PS Disclosure; fig 4; 88pp? English. 

CC This poliovirus receptor* H20B* has a sequence differing from the 

CC receptor. H20A at the cytoplasmic tail only. Antibodies (Abs) 

CC raised against it are useful for targetted delivery of the human 

CC poliovirusr conjugated to a drug. Transgenic animals contg. the 

CC corresp. DNA (genomic- or cDNA) can be used to test the efficiency 

CC and virulence of picornavirus vaccines. 

CC See also 006070. 

SO Sequence 392 AA; 

SO 26 A; 17 R; 17 N? 9 D; 0 Bf 10 C; 20 fl? 20 E; 0 Z; 30 Gf 10 H; 

so 13 I; 42 l; 9 k; 9 m; 12 F; 31 P; 31 S? 28 t; 10 w; 10 Y; 38 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:08-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance ~ 3.33 
Residue Identity = 50X Matches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
PEEVTRPNYL 
II I II 

MARAMAAAWPLLLVALLVLSWPPPGTGDVVVSAPT8VPGFLGDSVTLPCYLQVPNMEVTHVS8LTHARHGES 
10 20 30 40 X 50 60 70 



GSMAVFH8T3GPSYSESKRLEFVAARLGA 
80 90 100 



PS Claim 6; Page 13-14? 26pp? English. 

CC The sequence given in R31519 is a generic version of the sequences 

CC given in R31520-21. These sequences represent a cysteine protease 

CC which was isolated from the parasitic helminth* Paragonimus 

CC uestermani. This cysteine protease suppresses the immune system* 

CC both cell mediated and humoral. It inhibits delayed hypersensitivity 

CC reactions and production of antibodies against specific antigens and 

CC graft tissues. This protease inhibits footpad swelling caused by a 

CC booster injection of sheep red blood cells in the footpad after an 

CC initial immunisation intraperitoneal ly. The proteinase is useful for 

CC preventing organ transplant rejection and for control of autoimmune 

CC disease. 

SS Sequence 215 AA? 

S8 17 A* 5 R; 8 n; 12 D; o 8; 8 c; 8 a? 17 E; o z; 21 G; 3 H; 

SQ 10 I? 17 I? 12 K? 4 n; 5 F; 8 P; 13 s; 11 T? 7 w; 9 Y; 13 v? 

SO 7 Others; 

CC Retrieved by shears on Wed 21 Sep 94 li:58?38-PDT using FindSeq 



Initial Score = 5 Optimized Score = 5 Significance = 3.33 
Residue Identity = SOX Matches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
Ml II 

GGWPXSSYLEIMXMGGLESESDYPYVGVEQTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNA 
70 80 90 100 110 X 120 X 130 



VAL8YYGSGVLKPTFEECPDTELNHAVLTVGYDKEGDN 
140 150 160 170 



9. US-08-249-182-5 (1-10) 

R44495 Sequence of the immunoglobulin IgG (1C3/86) kappa 

ID R44495 standard? Protein; 219 AA. 

AC R44495? 

DT 26-HAY-1994 (first entry) 

DE Sequence of the immunoglobulin IgG (1C3/86) kappa chain 

DE derived from clone gammal.l.la and augmented using PCR. 

KW Particle-binding antibody fragment; kappa chain; 

KW monoclonal cell line 1C3/86? ant i -erythrocyte IgGs. 

OS Synthetic. 

PN W09324630-A. 

PD 09-DEC-1993. 

PF 19-HAY-1993? AU0228. 

PR 22-MAY-1992? AU-002551. 

PA (AGEN-) AGEN LTD. 

PI Hiltyard CJ, Hudson PJ* Lilley GG; 

DR HPI; 93-405821/50. 

DR N-PSDB; Q53430. 

PT Bifunctional recombinant protein - contains particle and analyte 

PT binding moieties* used in agglutination assays pref. on whole 

PT blood 

PS Example? Figure 2; 42pp? English. 

CC mRNA was prepd. from monoclonal antibody cell line (1C3/86) which 

CC prods anti-erythrocyte IgGs which bind to RBCs. ds-cDNA was prepd. 

CC and cloned into lambda-gtlO arms and packaged into a phage library. 

CC The heavy chain clone ganma-M/1.1 and the light chain clone 

CC ph76-kappa-10 were used to source ds-DNA inserts for the screening 

CC of the gtlO library. Positive clones were amplified* and the positive 

CC insert cDNA subcloned into pUC18. As a result a near full-length gamma 

CC clone (gamma-1.1 .la) was identified* the nucleotide sequence was 

CC determined and from this the protein sequence deduced (S53429/R44494) . 

CC The sequences of a partial kappa clone (kappa-4ACl) which encoded 



CC graft tissues. This protease inhibits footpad swelling caused by a 

CC booster injection of sheep red blood cells in the footpad after an 

CC initial immunisation intraperitoneal^. The proteinase is useful for 

CC preventing organ transplant rejection and for control of autoinnune 

CC disease. 

S8 Sequence 215 AA; 

S8 18 A; 5 R; 8 N? 12 D; 0 B; 7 C? 9 8? 17 E; 0 Z; 23 G; 3 H> 

Sfl 10 I; 17 l; 12 k; 4 M; 5 F; 8 p; 15 s? n t; 7 u; 10 y; 14 v? 
CC Retrieved by shears on Wed 21 Sep 94 ll:58:38-PDT using FindSeq 

Initial Score * 5 Optinized Score = 5 Significance = 3.33 
Residue Identity = SOX Matches - 5 Mismatches * 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
III II 

GGMPASSYLEIMYMGGLESESDYPYVGVEOTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNA 
70 80 90 100 110 X 120 X 130 

VALQYY8SGVLKPTFEESPDTELNHAVLTVGYDKEGDM 
140 150 160 170 



8. US-08-249-182-5 (1-10) 

R31519 Cysteine, protease generic sequence. 

ID R31519 standard; Protein; 215 AA. 

AC R31519; 

DT 20-MAY-1993 (first entry) 

DE Cysteine protease generic sequence. 

KU Generic; cysteine protease; parasite; helninth; inoune system 

KU hypersensitivity reaction; antibody; antigen; graft; tissue; 

KU footpad; swelling? booster; sheep; red blood cell; ipmunisation; 

KU organ; transplant; rejection; autoinnune; disease. 

OS Paragoninus uesternani. 

FH Key Location/Qualifiers 

FT Misc_difference 15 

FT /label* Ala, Pro 

FT Misc_difference 21 

FT /label* Ser, Glu 

FT Misc_difference 58 

FT /label* Arg, Met 

FT HiscjJifference 59 

FT /label* Val, Ala 

FT Misc_difference 61 

FT /label* Gin, Glu 

FT MiscjJifference 69 

FT /label* Ala, Ser 

FT Misc_difference 77 

FT /label* Tyr, Asp 

PN EP-524834-A. 

PD 27-JAN-1993. 

PF 24-JUL-1992J 306803. 

PR 25-JUL-1991; JP-208546. 

PR 12-FEB-1992; JP-057189. 

PA (HAMA/) HAMAJIHA F. 

PA (TSUR/) TSURU S. 

PA (YAMA/) YAMAKAMI K. 

PA (YAMA/) YAMAMOTO M. 

PI Hanajina F, Tsuru S, Yanakawi K, Yananoto M; 

DR WPI; 93-028881/04. 

DR N-PSDB; 835444. 

PT Immunosuppressive cysteine protease obtained fron larvae of 

PT parasitic helminths - used for suppression of graft rejection in 



CC determined in a similar fashion. To determine the sequence of the 

CC 1C3/86 kappa light chain at the 5' end/ a nixed N-terminal sequence 

CC bias determined from the intact 1C3/86 Ig and together with the 

CC sequence from a gamma heavy chain clone used to determine the 

CC N-terninus of the variable region of the kappa light chain- A 

CC coding sequence for this amino acid sequence was compiled and PCR 

CC amplified using the redundant forward (sense) primer N960 and the 

CC reverse (antisense) pricier N852) which was based on the kappa 

CC constant region beginning at nucleotide 337 (see 853430). The 

CC sequences derived from the PCR and gtlO library enabled the 

CC compilation of the sequence in 853430. 

S8 Sequence 219 AA; 

sa io a; 9 R; li n; 12 D? 0 E; 5 c; 10 a; 10 e; 0 z; 12 G; 2 H? 

sa 8 I; 13 l; 15 K; 3 fl; 8 F; 10 P; 35 S? 20 T; 4 w; 10 Y; 12 v? 

CC Retrieved by shears on Wed 21 Sep 94 li:59:45-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.33 
Residue Identity = SOX Hatches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 
II III 

DIVMSQSPSSLAVSVAEKVSNSCKSS8SLFNSRTRKNYLTWY8QKPG8SPKPLIYUASTRESGVPDRFTGSG 
10 20 30 40 50 60 70 

SGTDFTLTISSV8AEDL 
80 



10. US-08-249-182-5 (1-10) 

R04495 HIV fusion protein PBlmn 

ID R04495 standard; protein; 229 AA. 

AC R04495; 

DT 20-SEP-1990 (first entry) 

DE HIV fusion protein PBlmn 

KM HIV; fusion protein; PBlmn; therapy; AIDS; principal neutralising domain; 

KU antibodies; diagnosis; prophylaxis. 

OS Synthetic. 

FH Key Location/Oualif iers 

FT Region 36.. 203 

FT labels HIV portion of PBlmn 

PN W09003984-A. 

PD 19-APR-1990. 

PF 29-SEP-1989? U04302. 

PR 19-SEP-1989; US-407663, US-252949; H0-U04302. 

PA (REPL-) Repligen Corp. 

PI Rusche JRi Putney SD» Javaherian Kr Farley J» Grimaila R, Lynn D, 

PI Petro-Breyer J. 

DR MPI; 90-147824/19. 

DR N-PSDB; 804276 

PT Principal neutralising domain of HIV variants - used for producing 

PT peptide(s) and antibodies for diagnosis; prophylaxis; and/or therapy 

PT of HIV infection. 

PS Disclosure; 108pp; English. 

CC The protein can be expressed in simian cells* and synthesis of HIV 

CC proteins can be detected immunologically. The recombinant protein 

CC product comprises a principal neutralising domain. The neutralising 

CC domain is bounded by cysteine residues which occur at positions 

CC 296 and 331 1 the segments between the residues form a loop. 

CC See also R04427-R04506 and 804273-804279. 

SO Sequence 229 AA; 

S8 9 a; 17 r; 17 n; 12 D; 0 B; 6 C; 9 8? 13 E; 0 z? 16 G; 4 H; 

SQ 24 I; 13 L; 17 k; 4 m 11 F; 10 p; 13 s; 19 T; 3 w; 4 y; 8 v; 



PS Disclosure; Page 47-52; 67pp; English. 

CC The sequences given in R26982-3 contain pari of the exotoxin A (ETA) 

CC sequence corresponding to positions 252-613 of the full exotoxin A 

CC sequence- These sequences are encoded by Fv(FRP5)-ETA fusion genes. 

CC The ETA sequence was used as a ciarker gene so that E. coli transformed 

CC with the fusion gene could be identified. The fusion genes were 

CC expressed in E. coli and the antibodies were extracted. These 

CC recombinant antibodies can be used for the qualitative and 

CC quantitative determination of c-erbB-2. This can be used for 

CC monitoring or in-vivo localisation of tumours overexpressing c-erbB-2. 

SB Sequence 637 AA; 

SQ 69 A? 36 R? 15 N; 34 D; 0 B; 8 C? 37 G? 36 E> 0 Z; 72 G; 9 H; 

SQ 22 li 54 L; 21 K; 5 M; 24 F; 36 P; 48 S; 40 Tr 11 U; 24 Y; 36 Vi 

CC Retrieved by shears on Wed 21 Sep 94 II :58:22-PDT using FindSeq 

Initial Score = 7 Optimized Score = 7 Significance = 5.02 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMGTVFVGYGPTFK 

Mill I! 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTSNHTVERLLQAHRQLEERGYVFVGYHGTFLEAASSI 
410 420 430 440 450 X 460 470 

VFGGVRARSQDLDAIWRGFYIAGDPALAYGYAQDQEPDARGRIR 
480 490 500 510 



15. US-08-249-182-9 (1-16) 
R34018 BW 835 VH. 

ID R34018 standard; Protein; 115 AA. 

AC R34018; 

DT 02-AUG-1993 (first entry) 

DE BW 835 VH. 

KW Monoclonal antibody; MAb; hybridoma; lung; adenocarcinoma; 

KW mammary; ovary; prostate; polymorphic epithelial mucin; PER. 

OS Synthetic. 

PN DE4133791-A. 

PD 15-APR-1993. 

PF ll-QCT-1991; 133791. 

PR 11-0CT-1991; DE-133791. 

PA (BEHW ) BEHRINGWERKE AG. 

PI Bosslet K» Pfleiderer P, Seenann G; 

DR HPI; 93-127068/16. 

DR N-PSDB; Q40046. 

PT New monoclonal antibody BW835 specific for tumour antigens - 

PT useful for diagnosis and treatment of tumours affecting the 

PT breasts i ovaries t prostate and lungs 

PS Disclosure; Fig la; 24pp; German. 

CC Monoclonal antibody BW 835 is produced by hybridoma cell line BW 835. 

CC The antibody strongly reacts with lung adenocarcinomas and human 

CC manmary-f ovary- and prostate carcinomas. It additionally reacts 

CC with polymorphic epithelial mucin (PEM) but does not react with 

CC normal human tissue. 

SQ Sequence 115 AA; 

SQ 7 A; 7 R; 6 N? 4 D; 0 B; 2 c? 5 Q; 6 E; 0 Z; 8 G; 1 H; 

SQ 4 l; 8 L; 5 K; 4 M; 4 F; 2 P; 12 S; 8 T; 4 W; 10 Y; 8 V; 

CC Retrieved by shears on Wed 21 Sep 94 1U58?53-PDT using FindSeq 

Initial Score = 6 Optimized Score = 6 Significance = 4.18 
Residue Identity = 37X Matches * 6 Mismatches = 10 

Gaps = 0 Conservative Substitutions = 0 



PT for diagnosing and treating tumours expressing C-erbB-2 e.g. breast 

PT or ovarian tumours 

PS Disclosure; Page 53-58; 67pp; English. 

CC The sequences given in R26982-3 contain part of the exotoxin A (ETA) 

CC sequence corresponding to positions 252-613 of the full exotoxin A 

CC sequence. These sequences are encoded by Fv(FRP5)-ETA fusion genes. 

CC The ETA sequence uas used as a marker gene so that E. coli transformed 

CC with the fusion gene could be identified. The fusion genes uere 

CC expressed in E. coli and the antibodies uere extracted. These 

CC recombinant antibodies can be used for the qualitative and 

CC quantitative determination of c-erbB-2. This can be used for 

CC monitoring or in-vivo localisation of tumours overexpressing c-erbB-2. 

SQ Sequence 637 AA; 

SO 68 A; 35 R? 12 N? 38 D; 0 8; 8 C; 35 Qi 36 E; 0 Z; 72 G; 10 H; 

SG 26 I; 59 L; 21 K>' 5 M; 16 F; 35 P; 53 S; 36 T; 11 W? 28 Y; 33 V; 

CC Retrieved by shears on Wed 21 Sep 94 li:58:22-PDT using FindSeq 

Initial Score = 7 Optimized Score = 7 Significance = 5.02 
Residue Identity = 437. Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSM9TVFVGYGPTFK 
lllll II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGT8NWTVERLLSAHRQLEERGYVFVGYHGTFLEAA8SI 
410 420 430 440 450 X 460 470 

VFGGVRARS8DLDAIWRGFYIAGDPALAYGYAQDQEPDARGRIR 
480 490 500 510 



14. US-08-249-182-9 (1-16) 

R26982 (FRPS)-ETA fusion protein. 

ID R26982 standard; Protein; 637 AA. 

AC R26982; 

DT lt-FEB-1993 (first entry) 

DE (FRPS)-ETA fusion protein. 

KM Monoclonal antibody; light chain; heavy chain; tumour i c-erbB-2; 

KW variable region; ETA. 

OS Pseudomonas aeruginosa PAK. 

FH Key Location/Qualifiers 

FT Peptide 1..21 

FT /label= ompA_signal_peptide 

FT Peptide " 22.729 

FT /label= FLAG_peptide_and_enterokinase_cleavage_site 

FT Domain 33.. 151 

FT /label= FRP5_heavy_chain_variablejjomain 

FT Peptide 152.. 166" 

FT /label= Linker 

FT Domain 167.. 274 

FT /label= FRP5_light_chain_variable_domain 

FT Protein 276.. 397 

FT /labels ETA.252-613 

PN EP-502812-A. 

PD 09-SEP-1992. 

PF 27-JAN-1992? 810056. 

PR 05-FEB-199U EP-810079. 

PA (CIBA ) CIBA GEIGY AG. 

PI Groner Br Hardman N. Haruerth It Hynes NEr Wels WS; 

PI Zuickl MJ 

DR WPI? 92-302096/37. 

DR N-PSDB; Q28262. 

PT Recombinant antibodies directed to growth factor receptor C-erbB-2 - 

PT for diagnosing and treating tumours expressing C-erbB-2 e.g. breast 



PT receptor binding used as iptnunoJtoxins for highly specific 

PT targetting 

PS Clain 3; Page 20-23; 30ppi English. 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxics receptor-binding site, and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated uith a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodiesr fragments and other ligands) to forn inpiunotoxinsi and 

CC these innunotoxins do not bind to the cell surface receptors which 

CC are bound by the native cytotoxin. Houeveri when the cross-linker 

CC is cleaved and the binding mol. is released* the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

50 Sequence 613 AA; 

SQ 68 A; 44 R; 21 N? 36 D; 0 B; 8 C? 27 9; 43 E; 0 Z; 56 G; 15 H; 

SQ 26 I; 66 Li 15 K; 6 H; 14 F; 38 Pi 37 Si 27 Ti 11 Hi 18 Y; 37 Vi 

CC Retrieved by shears on Wed 21 Sep 94 11?59:20-PDT using FindSeq 

Initial Score - 7 Optimized Score = 7 Significance = 5.02 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps - 0 Conservative Substitutions = 0 

X 10 X 

VNSM8TVFVGYGPTFK 
Mill 11 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTflNHTVERLLQAHRQLEERGYVFVGYHGTFLEAAQSI 
380 390 400 410 420 430 440 X 450 

VFGG VR ARSGDLD A I WRGFY I AGDPALAYG Y AfiDGEPDARCR I R 
460 470 480 490 



13. US-08-249-182-9 (1-16) 

R26983 (FRP5D-ETA fusion protein. 

ID R26983 standard? Protein; 637 AA. 

AC R26983J 

DT ll-FEB-1993 (first entry) 

DE (FRP5D-ETA fusion protein. 

KW Monoclonal antibody; light chaini heavy chain? tunour? c-erbB-2; 

KH variable region? ETA. 

OS Pseudononas aeruginosa. 

FH Key Location/Qualifiers 

FT Peptide 1 - .21 

FT /label= onpA_signal.peptide 

FT Peptide " 22.729 

FT /label= FLAG_peptide_and_enterokinase_cleavage_site 

FT Donain 33.. 152 

FT /label= FWP51_heavy_chain_variable domain 

FT Peptide " 1537.167 " 

FT /label= Linker 

FT Domain 168.. 274 

FT /label* FRP5_light_chain_variable_donain 

FT Protein 276.. 397* 

FT /label= ETA 252-613 

PN EP-502812-A. 

PD 09-SEP-1992. 

PF 27-JAN-1992; 810056. 

PR 05-FEB-1991? EP-810079. 

PA (C1BA ) CIBA GEIGY AG. 

PI Groner B, Hardnan N> Haruerth I, Hynes NEt Hels MS; 

PI Zuickl M; 

DR HPI? 92-302096/37. 

DR N-PSDBi 028263. 



CC antibodiesr fragnents and other ligands) to forn innunotoxinsr and 

CC these innunotoxins do not bind to the cell surface receptors uhich 

CC are bound by the native cytotoxin. However* when the cross-linker 

CC is cleaved and the binding nol. is released* the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

SQ Sequence 613 AA? 

S9 68 A; 44 R; 21 N; 36 D; 0 B; 9 C; 27 Q; 43 E; o Z; 56 Gi IS H; 

SQ 26 I; 66 L; 14 K; 6 M; 14 F; 38 P; 37 S; 27 T; 11 W? 18 Y; 37 V? 

CC Retrieved by shears on Wed 21 Sep 94 il:59:20-PDT using FindSeq 

Initial Score = 7 Optimized Score = 7 Significance = 5.02 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions ~ 0 

X 10 X 

VNSM9TVFVGYGPTFK 
Mill II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGT8NHTVERLL9AHR9LEERGYVFVGYHGTFLEAASSI 
380 390 400 410 420 430 440 X 450 

VFGGVRARS9DLDAIWRGFYIAGDPALAYGYA9D9EPDARGRIR 
460 470 480 490 



12. US-08-249-182-9 (1-16) 

R40102 Pseudononas exotoxin for site-specific nutation ui 

ID R40102 standard; Protein; 613 AA. 

AC R40102; 

DT 27-JAN-1994 (first entry) 

DE Pseudononas exotoxin for site-specific nutation with unpaired CYS. 

KW Pseudononas exotoxin; PE; diphtheria toxin? DT; innunotoxin; 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site; 

KU monoclonal antibody; ligand; cell surface; nutation; 

KW steric unpaired cysteine; s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/9ualif iers 

FT Misc_difference 20 

FT /note= "unpaired cysteine residue nay replace Lys B 

FT MiscjJifference 25 

FT /note= "unpaired cysteine residue nay replace Ser" 

FT Niscjiifference 88 

FT /note- "unpaired cysteine residue nay replace Ser" 

FT Misc^difference 96 

FT /note= "unpaired cysteine residue nay replace Ser" 

FT Hisc_difference 158 

FT /note= "unpaired cysteine residue nay replace Ser" 

FT Misc_difference 182 

FT /note= "unpaired cysteine residue nay replace Arg" 

FT Misc_difference 188 

FT /note= "unpaired cysteine residue nay replace Ser" 

FT Misc_difference 192 

FT /note= "unpaired cysteine residue nay replace Ser" 

FT Misc_difference 223 

FT /note= "unpaired cysteine residue nay replace Lys" 

FT Hisc_difference 245 

FT /note= "unpaired cysteine residue nay replace Ser" 

PN W09315113-A. 

PD 05-AUG-1993. 

PF 15-JAN-1993; U00358. 

PR 24-JAN-1992; US-825396. 

PA (TAN0-) TAN0X B10SYSTEMS INC. 

PI Chang TW; 

DR WPI; 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) with an unpaired cysteine 



PT receptor binding used as innuno?toxins for highly specific 

PT targetting 

PS Clain 3? Page 20-23; 30pp; English. 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxics receptor-binding sitei and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated with a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodies* fragnents and other ligands) to for* innunotoxinsj and 

CC these innunotoxins do not bind to the cell surface receptors which 

CC are bound by the native cytotoxin. However* when the cross-linker 

CC is cleaved and the binding nol. is released! the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity, 

SQ Sequence 613 AA; 

SQ 68 A? 44 R; 21 N; 36 D; 0 B; 9 C; 27 fl; 43 E; 0 Z; 56 G? 15 HI 

SQ 26 i; 66 L; 15 KJ 6 « ? 14 F; 38 P; 36 s; 27 T; 11 u; 18 Y; 37 v; 

CC Retrieved by shears on Wed 21 Sep 94 11?59:20-PDT using FindSeq 

Initial Score = 7 Optinized Score = 7 Significance = 5.02 
Residue Identity = 43X Hatches ~ 7 Hisnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSHSTVFVGYGPTFK 

inn ii 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNHTVERLLflAHRflLEERGYVFVGYHGTFLEAAQSI 
380 390 400 410 420 430 440 X 450 

VFGGVRARSGDLDAIWRGFYIAGDPALAYGYA6DQEPDARGRIR 
460 470 480 490 



11. US-08-249-182-9 (1-16) 

R40104 Pseudononas exotoxin (K20C). 

ID R40104 standard? Protein; 613 AA. 

AC R40104; 

DT 27-JAN-1994 (first entry) 

DE Pseudononas exotoxin (K20C). 

KW Pseudononas exotoxin; PE; diphtheria toxin; DT; innunotoxin; 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site; 

KM wonoclonal antibody; ligand; cell surface; nutation; 

KM steric unpaired cysteine; s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/Qualifiers 

FT liisc_difference 20 

FT /note= °unpaired cysteine residue replaces Lys° 

PM W09315113-A. 

PD 05-AUG-1993. 

PF 15-JAN-1993; U00358. 

PR 24-JAN-1992; US-825396. 

PA (TAN0-) TAN0X BI0SYSTEHS INC. 

PI Chang TW; 

DR WPI; 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) with an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as innunoitoxins for highly specific 

PT targetting 

PS Clain 3? Page 20-23; 30pp; English. 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxin' s receptor-binding site* and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated with a binding no I. 

CC The toxins are cross-linked through the free SH group of their 



n /nuuc- urt^direo cystine resioue rtrptdLeb oer 

PN WQ9315113-A. 

PD 05-AUG-1993- 

PF 15-JAN-1993; U00358. 

PR 24-JAN-1992; US-825396. 

PA (TAMO-) TANOX BIOSYSTEHS INC. 

PI Chang TUi 

DR WPI; 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) with an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as imuino? toxins for highly specific 

PT targetting 

PS Clain 3; Page 20-23; 30pp; English. 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxics receptor-binding site* and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated with a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodiesi fragnents and other ligands) to forn innunotoxinsi and 

CC these innunotoxins do not bind to the cell surface receptors which 

CC are bound by the native cytotoxin. However* when the cross-linker 

CC is cleaved and the binding nol. is releasedr the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

58 Sequence 613 AA; 

59 68 A; 44 R; 21 N; 36 D; 0 B; 9 C; 27 8; 43 E? 0 Z; 56 G; 15 H; 
SQ 26 I; 66 l; 15 K; 6 m; 14 f; 38 p; 36 s; 27 T? 11 w; 18 Y; 37 V; 
CC Retrieved by shears on Wed 21 Sep 94 ll:59:20-PDT using FindSeq 

Initial Score = 7 Optinized Score = 7 Significance = 5.02 
Residue Identity = 43X Matches = 7 Hisnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSM8TVFVGYGPTFK 
Hill II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGT9NHTVERLLQAHR8LEERGYVFVGYHGTFLEAAQSI 
380 390 400 410 420 430 440 X 450 

VFGGVRARS8DLDAIWRGFYIAGDPALAYGYAQD8EPDARGRIR 
460 470 480 490 



10. US-08-249-182-9 (1-16) 

R40105 Pseudononas exotoxin (S25C). 

ID R40105 standard; Protein; 613 AA. 

AC R40105; 

0T 27-JAN-1994 (first entry) 

DE Pseudononas exotoxin (S25C). 

KW Pseudononas exotoxin; PE; diphtheria toxin; DT ? innunotoxin; 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site? 

KW monoclonal antibody; ligand; cell surface; nutation; 

KM steric unpaired cysteine; s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/8ualif iers 

FT Miscjiifference 25 

FT /note= "unpaired cysteine residue replaces Ser° 

PN W09315113-A. 

PD 05-AUG-1993. 

PF 15-JAN-1993; U00358. 

PR 24-JAN-1992; US-825396. 

PA (TAN0-) TANOX BI0SYSTEMS INC. 

PI Chang TW; 

DR WPI? 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) with an unpaired cysteine 



DE Pseudononas exotoxin (S96C). 

KW Pseudononas exotoxin; PE? diphtheria toxin? DT? innunotoxin? 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site; 

KW nonoclonal antibody; ligand? cell surface; nutation? 

KW steric unpaired cysteine; s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/Gualif iers 

FT Hisc_difference 96 

FT /note= "unpaired cysteine residue replaces Ser B 

PN W09315113-A. 

PD 05-AUG-1993. 

PF 15-JAN-1993? U00358. 

PR 24-JAN-1992? US-825396. 

PA (TANQ-) TANOX BIOSYSTEHS INC. 

PI Chang TW; 

DR WPI; 93-258616/32. 

PT Site-specifically nutated cytotoxin (s) with an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as iRPiuno? toxins for highly specific 

PT targetting 

PS Clain 3; Page 20-23; 30pp? English. 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxin' s receptor-binding site* and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated uith a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodies i fragnents and other ligands) to forn innunotoxinsr and 

CC these innunotoxins do not bind to the cell surface receptors which 

CC are bound by the native cytotoxin. However t when the cross-linker 

CC is cleaved and the binding nol. is released? the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

SQ Sequence 613 AA; 

sa 68 A; 44 R; 21 N; 36 D; o B; 9 c; 27 a; 43 E; 0 z; 56 G; 15 H; 

sa 26 is 66 L; 15 k; 6 M? 14 f; 38 p? 36 s; 27 T; 11 w; 18 y; 37 v? 

CC Retrieved by shears on Wed 21 Sep 94 il:59:20-PDT using FindSeq 



Initial Score = 7 Optinized Score = 7 Significance = 5.02 
Residue Identity = 43X Hatches = 7 Misnatches = 9 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

VNSR8TVFVGYGPTFK 
Mill II 

CAGPADSGDALLERNYPTGAEFLGDGGDMSFSTRGT9NWTVERLLaAHRQLEERGYVFVGYHGTFLEAA8SI 
380 390 400 410 420 430 440 X 450 



VFGGVRARSQDLDAIWRGFYIAGDPALAYGYA8DQEPDARGRIR 
460 470 480 490 



9. US-08-249- 182-9 (1-16) 

R40106 Pseudononas exotoxin (S88C). 

ID R40106 standard; Protein; 613 AA. 
AC R40106? 

DT 27-JAN-1994 (first entry) 
DE Pseudononas exotoxin (S88C). 

KW Pseudononas exotoxin; PE; diphtheria toxin; DT? innunotoxin; 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site? 

KM nonoclonal antibody; ligand? cell surface? nutation? 

KW steric unpaired cysteine? s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/8ualif iers 

FT Hisc difference 88 



460 470 480 490 



7. US-08-249-182-9 (1-16) 

R40108 Pseudononas exotoxin (S158C). 

ID R40108 standard; Protein; 613 AA. 

AC R40108; 

DT 27-JAN-1994 (first entry) 

DE Pseudononas exotoxin (S1S8C). 

KW Pseudononas exotoxin; PE; diphtheria toxin; DT; innunotoxin; 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site; 

KW monoclonal antibody; ligand; cell surface; nutation; 

KM steric unpaired cysteine; s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/Qualifiers 

FT Miscjiifference 158 

FT /note= "unpaired cysteine residue replaces Ser" 

PN H09315113-A. 

PD 05-AUG-1993. 

PF 15-JAM-1993; U00358. 

PR 24-JAN-1992; US-825396. 

PA (TANQ-) TANOX BIOSYSTEMS INC. 

PI Chang TW; 

DR WPI; 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) with an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as innunoitoxins for highly specific 

PT targetting 

PS Clain 3; Page 20-23; 30pp; English. 

CC The neu nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxin' s receptor-binding site* and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated yith a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodiesr fragnents and other ligands) to forn innunotoxinsr and 

CC these innunotoxins do not bind to the cell surface receptors which 

CC are bound by the native cytotoxin. Houeveri uhen the cross-linker 

CC is cleaved and the binding nol. is releasedi the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

S9 Sequence 613 AA; 

SO 68 A; 44 R; 21 N; 36 D; 0 b; 9 c; 27 a; 43 E; 0 z; 56 G; 15 H? 

SO 26 l; 66 L; 15 K; 6 M; 14 F; 38 P; 36 S; 27 T; 11 H; 18 Y; 37 V? 

CC Retrieved by shears on Wed 21 Sep 94 11:59:20-PDT using FindSeq 

Initial Score = 7 Optinized Score = 7 Significance ~ 5.02 
Residue Identity = 437. Matches = 7 Hisnatches = 9 

Gaps = 0 Conservative Substitutions - 0 

X 10 X 

VNSRSTVFVGYGPTFK 
lllll II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTflNWTVERLLQAHRQLEERGYVFVGYHGTFLEAAQSI 
380 390 400 410 420 430 440 X 450 

VFGGVRARSQDLDAIWRGFYIAGDPALAYGYASD9EPDARGRIR 
460 470 480 490 



8. US-08-249-182-9 (1-16) 

R40107 Pseudononas exotoxin (S96C) . 



ID R40107 standard; Protein; 613 AA. 
AC R40107? 



Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSN8TVFVGYGPTFK 

Mill II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNWTVERLLQAHRQLEERGYVFVGYHGTFLEAA8SI 
380 390 400 410 420 430 440 X 450 

VFGGVRARSSDLDAIWRGFYIAGDPALAYGYAQD6EPDARGRIR 
460 470 480 490 



6. US-08-249-182-9 (1-16) 

R40109 Pseudononas exotoxin (R182C). 

ID R40109 standard? Protein; 613 AA. 

AC R40109; 

DT 27-JAN-1994 (first entry) 

DE Pseudononas exotoxin (R182C). 

KW Pseudononas exotoxin; PE» diphtheria toxin; DT; innunotoxini 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site; 

KM nonoclonal antibody; ligand? cell surface; nutation; 

KW steric unpaired cysteine; s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/Qualifiers 

FT Misc_difference 182 

FT /note= "unpaired cysteine residue replaces Arg" 

PN W09315113-A. 

PD 05-AUG-1993. 

PF 15-JAN-1993; U00358. 

PR 24-JAN-1992? US-825396. 

PA (TAN0-) TAN0X BI0SYSTEMS INC. 

PI Chang TW; 

DR MPU 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) with an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as innunostoxins for highly specific 

PT targetting 

PS Clain 3; Page 20-23; 30pp; English. 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxin' s receptor-binding sitet and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated with a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodies* fragnents and other ligands) to forn innunotoxinsr and 

CC these innunotoxins do not bind to the cell surface receptors which 

CC are bound by the native cytotoxin. Howeveri when the cross-linker 

CC is cleaved and the binding nol. is releasedr the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

SQ Sequence 613 AA; 

S8 68 A; 43 R; 21 N; 36 D; 0 B; 9 C; 27 Q; 43 E; 0 2; 56 G; 15 H; 

SQ 26 I; 66 L; 15 K; 6 m; 14 F; 38 P; 37 s; 27 T; n tt; 18 Y; 37 v; 

CC Retrieved by shears on Wed 21 Sep 94 11:59:20-PDT using FindSeq 

Initial Score = 7 Optinized Score = 7 Significance - 5.02 
Residue Identity = 43X Matches = 7 Misnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSH8TVFVGYGPTFK 
Mill II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNHTVERLLQAHRQLEERGYVFVGYHGTFLEAASSI 
380 390 400 410 420 430 440 X 450 



are oouno oy x-ne nsiive tytoLoxm. tioueveri unen me cross-urnier 

CC is cleaved and the binding nol. is released! the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

SQ Sequence 613 AA; 

SQ 68 A; 44 ft; 21 N; 36 D; 0 B; 9 C; 27 Q? 43 E; 0 Zi 56 G; 15 H? 

SQ 26 I? 66 L; 15 KJ 6 H; 14 FS 38 P; 36 S? 27 TJ 11 H; 18 Y; 37 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:20-PDT using FindSeq 

Initial Score = 7 Optimized Score = 7 Significance = 5-02 
Residue Identity = 43X Matches = 7 Hi snatches = 9 

Gaps = 0 Conservative Substitutions » 0 

X ID X 

VNSHQTVFVGYGPTFK 
lllll I! 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNMTVERLLSAHRQLEERGYVFVGYHGTFLEAA9SI 
380 390 400 410 420 430 440 X 450 

VFGGVRARSQDLDAIWRGFYIAGDPALAYGYAQDQEPDARGRIR 
460 470 480 490 



5. US-08-249-132-9 (1-16) 

R40110 Pseudononas exotoxin (5188C). 

ID R40110 standard; Protein; 613 AA. 

AC R40U0; 

DT 27-JAN-1994 (first entry) 

DE Pseudononas exotoxin (S188C). 

Ky Pseudononas exotoxin? PE; diphtheria toxin; DT; innunotoxin; 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site? 

KW nonoclonal antibody; Ugand; cell surface; nutation; 

KW steric unpaired cysteine* s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/Qualifiers 

FT NiscjJifference 188 

FT /note* "unpaired cysteine residue replaces Ser tt 

PN W09315113-A. 

PD 05-AUG-1993. 

PF 15- JAN- 1993; U00358. 

PR 24-JAN-1992J US-825396. 

PA (TAN0-) TANQX BIOSYSTEMS INC. 

PI Chang TW? 

DR WPI? 93-258616/32. 

PT Site-specifically nutated cytotoxin (s) with an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as innuno^toxins for highly specific 

PT targetting 

PS Clain 3; Page 20-23; 30pp; English. 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxin' s receptor-binding site> and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated with a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodies i fragnents and other ligands) to forn innunotoxinsr and 

CC these innunotoxins do not bind to the cell surface receptors which 

CC are bound by the native cytotoxin. However uhen the cross-linker 

CC is cleaved and the binding nol. is releasedr the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

SQ Sequence 613 AA; 

SQ 68 A; 44 R; 21 N; 36 D? 0 B; 9 C; 27 Q; 43 E; 0 Z; 56 G; 15 H; 

SQ 26 I; 66 L; 15 K; 6 H; 14 F; 38 P; 36 S; 27 T; 11 u; 18 Y; 37 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:20-PDT using FindSeq 



Initial Score 



7 Optinized Score = 7 Significance = 5.02 



i j wa«n or r aye i_u cur jvpp r tityuarii 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxics receptor-binding site* and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated with a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodies i fragments and other ligands) to forn innunotoxinsr and 

CC these innunotoxins do not bind to the cell surface receptors which 

CC are bound by the native cytotoxic However i when the cross-linker 

CC is cleaved and the binding nol. is released; the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

SO Sequence 613 AA; 

SQ 68 a; 44 r; 21 N; 36 D; 0 b; 9 c; 27 q; 43 E; o z; 56 g; 15 H; 

SQ 26 I; 66 u 14 k; 6 Mi 14 F; 38 P; 37 S; 27 T; 11 u; 18 Y; 37 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:20-PDT using FindSeq 

Initial Score = 7 Qptinized Score = 7 Significance = 5.02 
Residue Identity = 43X Matches = 7 Nisnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSM8TVFVGVGPTFK 
II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQMWTVERLLQAHR3LEERGYVFVGYHGTFLEAA9SI 
380 390 400 410 420 430 440 X 450 

VFGGVRARSQDLDAIWRGFYIAGDPALAYGYAQDflEPDARGRIR 
460 470 480 490 



4. US-08-249-182-9 (1-16) 

R40111 Pseudononas exotoxin (S192C). 

ID R40111 standard; Protein; 613 AA. 

AC R40111; 

DT 27-JAN-1994 (first entry) 

DE Pseudononas exotoxin (S192C). 

KW Pseudononas exotoxin; PE; diphtheria toxin; DT; innunotoxin; 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site; 

KW nonoclonal antibody; ligand; cell surface; nutation; 

KW steric unpaired cysteine; s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/Qualifiers 

FT Misc_difference 192 

FT /note* "unpaired cysteine residue replaces Ser° 

PN W09315113-A. 

PD 05-AUG-1993. 

PF 15-JAN-1993; U00358. 

PR 24-JAN-1992; US-825396. 

PA (TAN0-) TAN0X BI0SYSTEMS INC. 

PI Chang TW; 

DR WPI; 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) with an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as inmjno: toxins for highly specific 

PT targetting 

PS Clain 3; Page 20-23; 30pp; English. 

CC The new nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxin' s receptor-binding site # and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated with a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including nonoclonal 

CC antibodiesr fragnents and other ligands) to forn innunotoxinsr and 

CC these innunotoxins do not bind to the cell surface receptors which 



PR 24-JAN-1992? US-825396. 

PA (TANQ-) TANOX BIOSYSTEMS IMC. 

PI Chang TW? 

DR WPI? 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) with an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as innunoJtoxins for highly specific 

PT targetting 

PS Claifl 3; Page 20-23; 30pp! English. 

CC The neu nutated toxin has an unpaired cysteine residue in 

CC or near the cytotoxics receptor-binding sitei and retains the 

CC sane receptor-binding ability and cytotoxicity as the native 

CC cytotoxins provided they are not conjugated with a binding nol. 

CC The toxins are cross-linked through the free SH group of their 

CC unpaired cysteine residues to binding nols. (including monoclonal 

CC antibodies t fragments and other ligands) to forn innunotoxins* and 

CC these innunotoxins do not bind to the cell surface receptors uhich 

CC are bound by the native cytotoxin. However* when the cross-linker 

CC is cleaved and the binding nol. is released* the cytotoxin regains 

CC its receptor-binding ability and its cytotoxicity. 

SQ Sequence 613 AA? 

SQ 68 AJ 44 R; 21 N; 36 D; o B; 9 C? 27 Q; 43 E; 0 z; 56 G; 15 H; 

ss 26 i; 66 l; 15 k; 6 n? 14 f; 38 p; 36 s; 27 t? n w; 18 y; 37 v; 

CC Retrieved by shears on Wed 21 Sep 94 li:59:20-PDT using FindSeq 



Initial Score = 7 Optimized Score = 7 Significance = 5.02 
Residue Identity = 43% Hatches = 7 Hisnatches = 9 

Gaps = 0 Conservative Substitutions - 0 



X 10 X 

VNSNSTVFVGYGPTFK 
HIM II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGT8NWTVERLLSAHRSLEERGYVFVGYHGTFLEAAQSI 
380 390 400 410 420 430 440 X 450 



VFGGVRARSQDLDAIWRGFYIAGDPALAYGYA8D8EPDARGRIR 
460 470 480 490 



3. US-08-249-182-9 (1-16) 

R40112 Pseudononas exotoxin (K223C). 

ID R40U2 standard; Protein; 613 AA. 

AC R40112; 

DT 27-JAN-1994 (first entry) 

DE PseudoRonas exotoxin (K223C). 

KW Pseudononas exotoxin? PE; diphtheria toxin; DT; innunotoxin? 

KM target site; cytotoxin? unpaired cysteine; receptor; binding site; 

KW nonoclonal antibody; ligand; cell surface? nutation; 

KW steric unpaired cysteine? s.u.c. 

OS Pseudononas aeruginosa. 

FH Key Location/Qualifiers 

FT (1isc_difference 223 

FT /note= "unpaired cysteine residue replaces Lys D 

PN W09315113-A. 

PD 05-AUG-1993. 

PF 15-JAN-1993? U00358. 

PR 24-JAN-1992; US-825396. 

PA (TAN0-) TANOX BIOSYSTEMS INC. 

PI Chang TW? 

DR WPI? 93-258616/32. 

PT Site-specifically nutated cytotoxin(s) uith an unpaired cysteine 

PT - such that conjugation of a binding nol. to the Cys blocks 

PT receptor binding used as inmino:toxins for highly specific 

PT targetting 



40. R05506 Acyltransferase penicillin bi 357 5 5 3.35 



1. US-08-249-182-9 U-16) 

R37451 Autotaxin peptide ATX 101. 

ID R37451 standard; peptide; 16 AA. 

AC R37451; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 101. 

KM Cell motility stimulating; cancer metastasis; antibody; detection; 

KW imnunostains; disease out cone prediction; therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch Hr Liotta LA* Schiffnann E» Stracke M. 

DR WPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp> English. 

CC The sequence is that of autotaxin peptide ATX 101. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in imnunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SS Sequence 16 AA; 

SQ 0 A; o R; l N; o D; 0 B; o c; l 9; 0 E; o Z; 2 G; l H; 

SO 0 I? 0 L; l K; 0 M; 2 F; l P; l S; 2 T; 0 w; l Y; 3 V; 

CC Retrieved by shears on Wed 21 Sep 94 li:58:47-PDT using FindSeq 

Initial Score = 15 Optimized Score = 15 Significance = 11.71 
Residue Identity = 93X Matches = 15 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 

in illinium 

VNSMQTVFVGYGPTFK 
X 10 X 



2. US-08-249-182-9 (1-16) 

R40113 Pseudomonas exotoxin (S245C). 

ID R40113 standard; Protein; 613 AA. 
AC R40113; 

DT 27-JAN-1994 (first entry) 
DE Pseudomonas exotoxin (S245C) . 

KW Pseudomonas exotoxin; PE; diphtheria toxin; DT; imnunotoxin? 

KW target site; cytotoxin; unpaired cysteine; receptor; binding site; 

KW monoclonal antibody; ligand; cell surface; mutation; 

KW steric unpaired cysteine; s.u.c. 

OS Pseudomonas aeruginosa. 

FH Key Location/Qualifiers 

FT MiscjJifference 245 

FT /note* "unpaired cysteine residue replaces Ser° 
PN W09315113-A. 
PD 05-AUG-1993. 



Scores: Mean 

1 

Tines; CPU 

00:00:08.92 

Nuflber of residues: 

Nunber of sequences searched: 

Number of scores above cutoff: 



Median Standard Deviation 
3 1.20 

Total Elapsed 
00:00:08.00 

482836 
5543 
3295 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 1007. identical sequence to the query sequence was not found. 



The list of best scores is: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



**** 11 standard deviations above nean *«# 



1 . 


R37451 


Autotaxin peptide ATX 101. 


16 


15 


15 


11.71 


0 






•*•* 5 standard deviations above nean 


♦it* 








2. 


R40113 


Pseudononas exotoxin (S245C). 


613 


7 


7 


5.02 


0 


3. 


R40112 


Pseudononas exotoxin (K223C). 


613 


7 


7 


5.02 


0 


4. 


R40111 


Pseudononas exotoxin (S192C). 


613 


7 


7 


5.02 


0 


5. 


R40110 


Pseudononas exotoxin (S188C). 


613 


7 


7 


5.02 


0 


6. 


R40109 


Pseudononas exotoxin (R182C). 


613 


7 


7 


5.02 


0 


7. 


R40108 


Pseudononas exotoxin (S158C). 


613 


7 


7 


5.02 


0 


8. 


R40107 


Pseudononas exotoxin (S96C). 


613 


7 


7 


5.02 


0 


9. 


R40106 


Pseudononas exotoxin (S88C). 


613 


7 


7 


5.02 


0 


10. 


R40105 


Pseudononas exotoxin (S25C). 


613 


7 


7 


5.02 


0 


11. 


R40104 


Pseudononas exotoxin (K20C). 


613 


7 


7 


5.02 


0 


12. 


R40102 


Pseudononas exotoxin for site 


613 


7 


7 


5.02 


0 


13. 


R26983 


(FRP5D-ETA fusion protein. 


637 


7 


7 


5.02 


0 


14. 


R26982 


(FRP5)-ETA fusion protein. 


637 


7 


7 


5.02 


0 






*•*« 4 standard deviations above nean 


*«•* 








15. 


R34018 


BM 835 VH. 


115 






4.18 


0 


16. 


R09423 


Br-3 Heavy Chain V Region (no 


143 






4.18 


0 


17. 


R28582 


HCV anino acid sequence contg 


2436 






4.18 


0 


18. 


R29527 


HCV antigen T7N1-30. 


2510 






4.18 


0 


19. 


R24440 


Conposite HCV HC-J1/CDC/CHI p 


2894 






4.18 


0 


20. 


R20111 


Non-A, non-B viral genone pro 


3010 






4.18 


0 


21. 


R20091 


Non-Ai non-B viral genone pro 


3010 






4.18 


0 


22. 


R31621 


Hepatitis C virus (HCV) polyp 


3011 






4.18 


0 


23. 


R22154 


NANBV Hutch c59 isolate genon 


3011 






4.18 


0 






**** 3 standard deviations above nean 


ft** 








24. 


R33087 


Hunan cytonegalovirus antibod 


31 


5 


5 


3.35 


0 


25. 


R33086 


Hunan cytonegalovirus antibod 


31 


5 


5 


3.35 


0 


26. 


R33089 


Hunan cytonegalovirus antibod 


32 


5 


5 


3.35 


0 


27. 


R33091 


Hunan cytonegalovirus antibod 


51 


5 


5 


3.35 


0 


28. 


R25410 


Heavy chain variable donain o 


114 


5 


5 


3.35 


0 


29. 


R40215 


Sequence of nouse hybridona c 


119 


5 


5 


3.35 


0 


30. 


R 12358 


Heavy chain variable region o 


134 


5 


5 


3.35 


0 


31. 


R12326 


Heavy chain variable region o 


136 


5 


5 


3.35 


0 


32. 


R42212 


CRABP-1 gene product. 


137 


5 


5 


3.35 


0 


33. 


R30484 


VH region of Ab to pre-S2 ant 


139 


5 


5 


3.35 


0 


34. 


R32128 


Anti-IL2R-alpha antibody H-21 


183 


5 


5 


3.35 


0 


35. 


R 15326 


IL-2 chimeric antibody heavy 


183 


5 


5 


3.35 


0 


36. 


R33252 


HBsAg encoded by pGPD-1 (HBS) . 


226 


5 


5 


3.35 


0 


37. 


R11496 


RP142/HBsAg. 


250 


5 


5 


3.35 


0 


38. 


R11495 


RP135/HBsAg. 


251 


5 


6 


3.35 


0 



Query sequence being conpared.*US-08-249-182-9 (1-16) 
Nunber of sequences searched: 5543 
Nunber of scores above cutoffs 3295 



Results of the initial comparison of US-08-249-182-9 (1-16) with: 
File ? /hone/shears/loring/lorin«.pep 



10000- 

N 

U 5000- 

H 

B 

E « 

R 

0 

F 1000- 
S 

E 500- 

Q 

U 

E 

N 

C 

E 

S 100- 



50- 



10- 



5- 



II I I I II II I I I I I I I I I I 
SCORE 0| | 2 | |3 5| | 7 | 8 | 10 |12 13 15 
STDEV 0123 45678 9 



PARAMETERS 



Similarity natrix 


Unitary 


K-tuple 




2 


Mismatch penalty 


1 


Joining penalty 




20 


Gap penalty 


1-00 


Window size 




5 


Gap size penalty 


0.05 








Cutoff score 


0 








Randortization group 


0 








Initial scores to save 


40 


Alignments to save 


15 




Optinized scores to save 0 


Display context 


50 





II Ml 

GNLACLSLCHIETERAPSRAPTITLKKTPKPKTTKKPTKTTIHHRTSPETKLQPKNMTATP98GILSSTEHH 
180 190 200 210 220 230 240 

TN9STTQI 
250 



15. US-08-249-182-8 (1-11) 
R29637 pCTD ORF 2. 

ID R29637 standard; Protein; 354 AA. 

AC R29637; 

DT 05-FEB-1993 (first entry) 

DE pCTD ORF 2. 

KU CTi pCTD! epithelium; ocula mucosa; uro-genital mucosa; antigen; 

KU monoclonal; polyclonal; antibody; vaccine. 

OS Chlamydia trachomatis. 

PN EP-499681-A. 

PD 26-AUG-1992. 

PF 17-APR-1991; 106110. 

PR 07-FEB-1991; IT-000314. 

PA (ISTS ) SCLAVO SPA. 

PI Conanducci H> Giuliani MM. Ratti G. Tecce MF; 

DR MPI; 92-285922/35. 

OR N-PSDB5 827429. 

PT PCTD plasnid from Chlamydia Trachomatis and immunogenic proteins 

PT - for diagnosing and vaccinating against Chlamydia infections 

PT e.g. venereal lymphogranuloma 

PS Claim 1; Page 8-16; 40pp.' English. 

CC The sequences given in R29636-43 are encoded by the plasnid isolated 

CC from Chlamydia trachomatis (CT) serotype D. pCTD. This serotype 

CC generally infects epithelial tissues, such as the ocular and 

CC uro-genital mucous membranes r and shows a lou virulence. Of the eight 

CC proteins encoded by the plasnid r seven are encoded by the sense strand 

CC and the eighth is encoded by the complementary strand. These proteins 

CC can be used as antigens for the preparation of poly- and nono-clonal 

CC antibodies to be used in diagnostics. The antigens can also be used 

CC in the formulation of vaccines against infections due to CT. 

SO Sequence 354 AA; 

SQ 20 A; 23 R; 21 N; 10 D; 0 B; 4 C; 14 a; 29 E; 0 Z; 14 G; 7 H; 

SQ 32 I; 31 l; 32 K; 5 hi 16 F; 14 p; 23 S; 20 T; 7 u; 16 Y; 16 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:22-PDT using FindSeq 

Initial Score = 5 Optimized Score = 6 Significance = 3.30 
Residue Identity = 54% Hatches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGOPLUITATK 
I II I II 

FGFEGM IPR I RFTKTEFLE A YG VKRYKTSRNKYEFSGKEAETALEAL YHLGH9PFL I VATRTRHTNGTQ I VD 
90 100 110 120 130 X 140 X 150 

RYQTLSPIIRIYEGHEALTDEENIDIDLTPFNSPPTRKH 
160 170 180 190 

> 0 < 

0| |0 Intel HGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file us-08-249-182-9.res made by on Wed 21 Sep 94 12:04: 15-PDT. 



CC Retrieved by shears on Ued 21 Sep 94 11 :57:33-PDT using FindSeq 

Initial Score = 5 Optimized Score = 6 Significance = 3.30 
Residue Identity = 54X Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 
II I II I 

MSDQATTLRIKPLGDRILVKREEEDSTARGGIILPDTAKKK6DRAEVLVLGTGKRDKDGNLLPFEVTVGDTV 
10 20 30 40 50 60 70 

LIDKYAGQELTVDGEEYV 
80 90 



14. US-08-249-182-8 (1-11) 

R21326 Sequence of protein G. 

ID R21326 standard; Protein? 257 AA. 

AC R21326; 

DT 17-HAY-1992 (first entry) 

DE Sequence of protein G. 

KW Subunit vaccine; recombinant virus? antibody? probe; 

KW passive immunisation. 

OS Bovine respiratory syncytium virus isolate 391-2. 

FH Key Location/Qualifiers 

FT Domain l.,37 

FT /label= cytoplasmic 

FT Domain 38.. 65 

FT /label* transmembrane 

FT Domain 66.. 257 

FT /label= extracellular 

PN W09201471-A. 

PD 06-FEB-1992. 

PF 23-JUL-1991; U05194. 

PR 24-JUL-1990? US-557267. 

PA (UABR-) UAB RES FOUND. 

PI Wertz GMi Lerch R? 

DR WPI; 92-064708/08. 

DR N-PSDB; Q2U50. 

PT DNA encoding bovine respiratory syncytium virus - G,F and N 

PT proteinsi used to produce recombinant proteins and antibodies* 

PT for vaccine and diagnosis of BRS viral infection 

PS Claim 22; Fig 3; 124pp? English. 

CC The inventors claim recombinant DNA mols. which comprise nucleic 

CC acid sequences encoding bovine respiratory syncytial (BRS) virus 

CC protiens G, F and N respectively, and BRS virus G, F and N proteins 

CC and fragments prepd. by culturing transformed microorganisms and 

CC cells. G-protien DNA is pref. contained in plasmid pRLG414-76-191 

CC (ATCC NO 40841), F-protein DNA in pRLF2012-76-1902 (ATCC No 40842), 

CC N-protein in pRLNB3-76 (ATCC No 40843). For expression, a 

CC regulatory sequence is also included. The resulting mols. are 

CC contained in rVG642 (ATCC No VR2276), and rVF-464 (ATCC Vr2277) for 

CC G-protein and F-protein respectively. 

SQ Sequence 257 AA; 

SQ 12 A; 7 R? 16 N; 2 D; 0 B? 5 C? 17 Q? 8 E? 0 Z; 8 G; 14 H; 

SQ 18 I; 21 L? 23 K; 2 M? 6 F; 20 P? 24 S? 42 T; 1 W; 6 Y; 5 V? 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:50-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 45X Matches = 5 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 



X X 



KU GroEL. 

OS Chlamydia trachonatis serovar A. 

PN US7531317-A. 

PD 09-JUL-1991. 

PF 31-MAY-1990; 143560. 

PR 31-MAY-1990? US-531317. 

PA (USSH ) NAT INST OF HEALTH, 

DR HPI; 91-245693/33. 

OR N-PSDB? 013137. 

PT DNA encoding HypA and HypB Chlanydia proteins - used to develop 

PT prods, for detection of and vaccines against Chlanydia infection. 

PS Disclosure; Fig 7; 51pp? English. 

CC The sequence uas deduced fro* the first of two ORFs found in the 

CC hyp operon of clone pTA571CC prepd. fron C. trachonatis genonic DNA. 

CC It is the HypA hypersensitivity protein, analogous to the E. coli 

CC GroES protein. It can be used to to raise antibodies and to 

CC prepare vaccines for the treatment of Chlamydial infections. 

CC See also R13334-R13337. 

SO Sequence 102 AA; 

S3 7 A; 4 R; 0 N; 8 D; 0 B; 0 C; 8 Q? 10 E; 0 2; 9 G; 0 h; 

SO 8 I; 10 L; 10 K; 2 M? 1 F? 3 P? 4 S? 6 T; 0 W? 2 Y? 10 V? 

CC Retrieved by shears on Wed 21 Sep 94 11?57:33-PDT using FindSeq 

Initial Score = 5 Optinized Score = 6 Significance = 3.30 
Residue Identity = 54X Matches = 6 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLHITATK 
II I II I 

MSD0ATTLKIKPLGDRILVKREEEASTARGGIILPDTAKKK9DRAEVLALGTGKKDDKG8QLPFEV8VGDIV 
10 20 30 40 50 60 70 

LIDKYSGQELTVEGEEYV 
80 90 



13. US-08-249-182-8 (1-11) 
R13334 HypA protein. 

ID R13334 standard? Protein; 102 AA. 

AC R13334? 

DT 22-0CT-1991 (first entry) 

DE HypA protein. 

KW Antibodies; heat shock; hypersensitive; allergen; HSP60; GroES. 

OS Chlanydia psittaci GPIC. 

PN US7531317-A. 

PD 09-JUL-1991. 

PF 31-MAY-1990? 143560. 

PR 31-MAY-1990? US-531317. 

PA (USSH ) NAT INST OF HEALTH. 

DR HPI; 91-245693/33. 

DR N-PSDB? 913136. 

PT DNA encoding HypA and HypB Chlanydia proteins - used to develop 

PT prods, for detection of and vaccines against Chlanydia infection. 

PS Disclosure? Fig 5? 51pp; English. 

CC The sequence uas deduced fron the first of two ORFs found in clone 

CC pGP57f prepd. fron C. psittaci genonic DNA and contg. the Hyp 

CC operon. It is the hypA proteini of approx. 12 kD» analogous to 

CC the GroES heat shock protein of E. coli. The reconbinant protein 

CC can be used to to raise antibodies and in the preparation of 

CC vaccines for the treatnent of Chlanydial infections. 

CC See also R13335-R13337. 

SO Sequence 102 AA; 

SO 6 A? 6 R; 1 N? 10 D? 0 B; 0 c; 4 Q; 10 E? 0 Z? 9 G? 0 Hi 



CC antibodies by immunoassay methods. 

S9 Sequence 70 AA; 

SS 6 A; 4 R; l n; 2D; 0 B; 2 c; l Q; 4 E; 0 Z; 4 G; 2 H; 

58 II; 7 L! 5 Ki 4 M; 1 F; IP? 2 S? 6 T; 7 U; 4 Y; 6 V; 

CC Retrieved by shears on Wed 21 Sep 94 11:56:53-PDT using FindSeq 

Initial Score = 5 Optimized Score = 6 Significance = 3.30 
Residue Identity = 54X Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 

I III II 

MRVKEKY8HLWRWGWRUGTPILLGMLI1ICSATEKLWVRVYYGVPLHKEATTTLFCASDAKAYDTEVHNVMA 
10 20 30 40 SO 60 70 



11. US-08-249-182-8 (1-11) 

P91962 Polypeptide encoded by cDNA 50F1. 



ID P91962 standard; polypeptide; 74 AA. 

AC P91962; 

DT 21-FEB-1990 (first entry) 

DE Polypeptide encoded by cDNA S0F1. 

KU Hunan colonic mucosa ; colonic cancer; antibody. 

OS Homo sapiens. 

PN EP-337498-A. 

PD 18-0CT-1989. 

PF 17-APR-1989; 106875. 

PR 15-APR-1988; US-182185. 

PA (H0NT-) Montefiore Medical Center. 

PI Augenlicht LH; 

DR HPI; 89-302259/42. 

DR N-PSDB; N91510. 

PT Monitoring stage of malignant diseaser esp. colonic cancer - by Measuring 

PT relative abundance of specific RNA. also neu therapeutic RNAr polypeptide 

PT and antibody. 

PS Disclosure; Fig. 7; 20pp; English. 

CC The polypeptide is expressed by cDNA S0F1. It can be used treating and 

CC preventing colonic cancer of human colonic mucosa. Mono- or polyclonal 

CC antibodies raised against the polypeptide can be used for its purificn. 

CC and for immunoassay of the polypeptide. 

S3 Sequence 74 AA; 

SQ i a; 8 R? 4 n; id; OB; 3 C; 2 Q; 2 E; 0 2; 10 G; 6 h; 

S9 7 1; 8 l; ik; l n; 4 f; i p; 6 S; 6 t; oh; 0 y; 3 V; 

CC Retrieved by shears on Med 21 Sep 94 li:56:51-PDT using FindSeq 

Initial Score = 5 Optinized Score = 5 Significance = 3.30 
Residue Identity = 45X Matches = 5 Mi snatches = 6 

Gaps = 0 Conservative Substitutions = 0 



X 10 
GGQPLWITATK 

II III 

GGGGGGGGHTS I TAHHSL I ENNRN8MRFKHCC I T I VTGSLFTLPTSLRVLRVSLH I DRRHLRLN I F 
X 10 X 20 30 40 SO 60 



12. US-08-249-182-8 (1-11) 
R 13336 HypA protein. 

ID R13336 standard; Protein; 102 AA. 

AC R 13336; 

DT 22-0CT-1991 (first entry) 

DE HypA protein. 



Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 452 Matches = 5 Nisnatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLWITATK 
II I II 

ZRDNHRSELYKYKVIKIEPLGI APTKAKRRVVQREKRX 
10 X 20 X 30 



10. US-08-249-182-8 (1-11) 

P94662 Protein sequence for the anino terminal portion of 

ID P94662 standard? protein? 70 AA. 

AC P94662J 

DT 28-JUN-1990 (first entry) 

DE Protein sequence for the anino terminal portion of the HTLV-III 

DE envelope gp 160 and gp 120 

KW Glycoproteins gp!20 and gpl60? HTLV-III antibodies? innunoassay? 

KM HTLV-III . 

OS HTLV-III. 

FH Key Location/Qualifiers 

FT Cleavage 90.. 91 

FT /note= D site at which leader sequence cleaved fron gp 160° 

FT Region 34 

FT /note=°residue determined by radiolabel sequence 

FT analysis 0 

FT Region 36 

FT /note=°as above 0 

FT Region 38 

FT /note= D as above 0 

FT Region 42 

FT /note= D as above" 

FT Region 44 

FT /note= a as above 8 

FT Region 52 

FT /note="as above 0 

FT Region 54 

FT /note=°as above" 

FT Region 65 

FT /note- 8 as above 8 

FT Region 68 

FT /note=°as above 0 

PN CA1247082-A. 

PD 20-DEC-1988. 

PF 12-N0V-1985? 495112. 

PR 07-N0V-1985? US-795974, US-670361. 

PA (HARD) Harvard College. 

PI Essex ME? 

DR WPI? 89-061499/09. 

DR N-PSDB? N94662. 

PT Hunan T-cell lynphotrophic virus type III - 

PT useful for detection of hunan T-cell lynphotrophic virus type 

PT III antibodies. 

PS Disclosure; ? 27pp? English. 

CC New polypeptides are innunologically cross-reactive with glycoproteins 

CC gp 120 or gp 160 present in cells infected with HTLV-III, where gp 120 

CC has a n.wt. of ca. 120*000 and gp 160 has a n.ut. of ca. 160*000 

CC (90,000 inunglycosylated forn). Also clained are a nethod of assaying 

CC a biological specinen for HTLV-III antibodies, conprising incubating the 

CC specinen with the polypeptides! and a kit for this use. The polypeptides 

CC conprise gp 120, gp 160 and their unglycosylated forms opt. labelled or 

CC bound to an insoluble phase, or anti-idiotypic antibodies having an 

CC antigenic determinant cross-reactive with gp!20, Gp 160 is a precursor 



vi. n t tint ni»<wti uvi i 



PI Avraneas Ai Guillet JG» Moraillon Ai Strosberg AD; 

DR BPI; 92-217024/26. 

PT Envelope protein fragnents of feline innunodef iciency virus and 

PI derived antibodies - useful for diagnosisi therapy and in 

PT construction of nodel for HIV 

PS Claim 8; Page 30; 47pp; French. 

CC The peptide is derived fron Petaluna strain FIV. It is useful as an 

CC immunological reagent for the detection* diagnosis and nonitoring of 

CC innune deficiency in cats. It is recognised by at least one antibody 

CC produced during infection and/or inoculation by FIV Petaluna strain. 

CC For therapeutic user the antibodies are conjugated with an oxidative 

CC enzyne. 

SO Sequence 23 AA; 

SO 0 A; o R ; 2 N; 0 D; 0 B; 2 C; 2 0; 0 E; 0 Z; 3 G; 0 H; 

S3 2 I; 5 l; 4 k; i h; 0 F; o P; 2 s; 2 T; o w; 2 Y; l v; 

CC Retrieved by shears on Wed 21 Sep 94 11:58< V 17-PDT using FindSeq 

Initial Score = 5 Optinized Score = 5 Significance = 3.30 
Residue Identity = 45X Matches = 5 Mi snatches = & 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLWITATK 

II I II 
LGGKINISLCLTGGKHLYNKVTKQLSYC 
10 X 20 X 



9. US-08-249- 182-8 (1-11) 

P91359 Anino acids 482-517 of HIV glycoprotein gpl20 

ID P91359 standard; peptide; 38 AA. 

AC P91359; 

DT 12-APR-1990 (first entry) 

DE Anino acids 482-517 of HIV glycoprotein gp!20 

KW Hunan innunodef iciency virus; HIV; glycoprotein; gp!20; 

KW innunoassay reagent; HIV antibodies; HIV vaccine. 

OS Hunan innunodef iciency virus (HIV). 

PN EP-317804-A. 

PD 31-MAY-1989. 

PF 03-NOV-1988; 118299. 

PR 24-N0V-1987; US-124801. 

PA (ABB0) Abbott Laboratories. 

PI Sarin VK, Knigge KM; 

DR WPI; 89-158923/22. 

PT New HIV peptide(s) 

PT - useful as innunoassay reagents* for prodn. of 

PT antibodies and in vaccines 

PS Clain Kb); 14pp; English. 

CC It is called Peptide II. It is encoded in the region between 

CC bp 7197-7305. Z = -H* a blocking group* an anino acid* a peptide* protein 

CC or any linker. X = -OH* -NH2r -NR(1)R(2) (wherein R corresponds to an 

CC alkyl group) * a peptide* protein or any linker. It nay be produced by 

CC a nunber of nethods including solid phase synthesis and reconbinant DNA 

CC nethodology. It provides a non-infectious and pure source of HIV antigens 

CC for use a innunoassay reagents for detection of HIV antibodies in 

CC biological sanplesi esp. hunan salaiva* for prodn. of poly- and 

CC nonoclonal HIV antibodies* and in HIV vaccines. It can be used as a 

CC source of HIV antigens alone or in conbination or nay be linked to 

CC larger carrier nolecules. 

SO Sequence 38 AA; 

so 2 a; 6 r; 1 n; i d? OB; o c; l o; 3 E; o z; 1 G; OH? 

SO 3 I; 2 l; 6 k; on; 0 f; 2 P; is; it; 1 W; 2 y; 3 v; 

SQ 2 Others; 

CC Retrieved by shears on Wed 21 Sep 94 ll:56:49-PDT using FindSeq 



7. US-08-249-182-8 (1-11) 

P98460 Sequence of C. trachomatis serovar J major outer m 



ID P9S460 standard? Protein; 14 AA. 

AC P98460; 

DT 06-NAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar J major outer membrane protein (MOHP) 

DE variable domain (VD) J-VDIII encoded by base pairs 742-783 

KW Clamydia trachomatis; antigen; monclonal antibody; vaccine; 

KW diagnosis; serotyping; non-immunologic assay. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-MAR-1989; 324664, 

PR 17-MAR-1989; US-324664. 

PA (USSH ) US DEPT HEALTH & HUNAN. 

PI Caldwell HDr Ying Y, Zhang YX» Watkins NG; 

DR HPI; 89-339697/46. 

DR N-PSDB; N97095. 

PT Clanydia trachomatis genes - used for deternn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure; Fig 17; 49pp; English. 

CC The inventors sequenced the 4 MOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The MOMP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by MOMP 

CC specific monoclonal antibodies. The nucleotide, amino acid 

CC sequences and hydrophi I i city /charge value analyses will assist in 

CC the selection of appropriate MOMP antigenic determinants to be used 

CC in the construction of synthetic peptides » subunits or recontinant 

CC chlamydial vaccines. This will allow the prodn. or reagents and 

CC methodologies applicable in the development of new diagnostic tests 

CC for serotyping. 

S8 Sequence 14 AA; 

SQ 4 A; 0 R; 0 N; 1 D; 0 8; 0 C; 0 Q? 2 E; 0 z; l Gj 0 H; 

58 l i; l l; o k; o m; l f; 1 P? o s? 2 t; o w; o y; o v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:43-PDT using FindSeq 



Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 45X Matches = 5 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 



X X 
GGQPLWITATK 

II III 
AEFPLDITAGTEAA 
X 10 



8. US-08-249-182-8 (1-11) 

R24868 Sequence of peptide fragment V which corresp. to r 

ID R24868 standard; Protein; 28 AA. 
AC R24868; 

DT 03-JAN-1992 (first entry) 

DE Sequence of peptide fragment V which corresp. to residues 292-320 of 
DE the env protein. 

KW env protein; immunological reagent; diagnosis; therapy; antibody. 

OS Feline immunodeficiency virus Petaluna strain. 

PN W09209632-A. 

PD li-JUN-1992. 

PF 20-N0V-1991; F00917. 

PR 21-NQV-1990? FR-014519. 



CC for serotyping. 

SQ Sequence 14 AA; 

SQ 4 a; 0 R; 1 N; 0 D; 0 B; 0 C; 0 Q; 2 E; 0 2; l G; 0 H; 

SQ l I; l l; o K; 0 H? l F; l P; o S; 2 T; o w; 0 Y? 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 11 :57:43-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 45X Hatches = 5 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 

II III 
AEFPLNITAGTEAA 
X 10 



6. US-08-249-182-8 (1-11) 

P98468 Sequence of C. trachomatis serovar L3 major outer 

ID P98468 standard; Protein? 14 AA. 

AC P98468; 

DT 06-MAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar L3 major outer membrane protein (MOMP) 

DE variable domain (VD) L3-VDI 1 1 encoded by base pairs 742-783 

KW Clamydia trachomatis; antigen; monclonal antibody; vaccine; 

KM diagnosis; serotyping? non- immunologic assay. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-MAR-1989? 324664. 

PR 17-MAR-1989? US-324664. 

PA (USSH ) US DEPT HEALTH * HUMAN. 

PI Caldwell HD> Ying Y, Zhang YX, Watkins NG; 

DR HPI? 89-339697/46. 

DR N-PSDB? N97103. 

PT Clamydia trachomatis genes - used for determn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure? Fig 19; 49pp; English. 

CC The inventors sequenced the 4 MOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The MOMP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by MOMP 

CC specific monoclonal antibodies. The nucleotide* amino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate MOMP antigenic determinants to be used 

CC in the construction of synthetic peptides* subunits or recomtinant 

CC chlamydial vaccines. This will allow the prodn. or reagents and 

CC methodologies applicable in the development of new diagnostic tests 

CC for serotyping. 

SQ Sequence 14 AA? 

SQ 4 A; 0 R; 0 N? l D; 0 B; 0 c; 0 Q; 2 E; 0 Z; l G; 0 H? 

SQ l I; i l? o k; o m; i f; i p; o s; 2 t; o w? o y; 0 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:43-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 45X Matches = 5 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 

II III 
AEFPLDITAGTEAA 
X 10 



n ^dtuueii nuj uny ir many iat wdUKirib no* 

DR WPI? 89-339697/46. 

DR N-PSDB? N97Q83. 

PT Clamydia trachomatis genes - used for determn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure? Fig 14? 49pp? English, 

CC The inventors sequenced the 4 MOMP VDs of ten C. trachonatis 

CC serovars and the amino acid sequences were deduced. The MOMP VDs 

CC with the greatest total hydrophil icity and charge values were found 

CC to be the location of antigenic determinants recognised by MOMP 

CC specific monoclonal antibodies. The nucleotide, amino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate MOMP antigenic determinants to be used 

CC in the construction of synthetic peptides i subunits or recomtinant 

CC chlamydial vaccines. This uill allow the prodn. or reagents and 

CC methodologies applicable in the development of neu diagnostic tests 

CC for serotyping. 

58 Sequence 14 AA ; 

SS 4 A? 0 R? 0 N? 1 D? 0 B! 0 C? 0 Q? 2 E? 0 2, 1 G? 0 H? 

SQ i i; l l; 0 k; o M? i F; i P; o s; 2 T? o u? o Y? 0 v? 
CC Retrieved by shears on Wed 21 Sep 94 li:57:43-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 45X Hatches = 5 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 

II III 
AEFPLDITAGTEAA 
X 10 



5. US-08-249-182-8 (1-11) 

P98444 Sequence of C. trachomatis serovar C major outer m 

ID P98444 standard; Protein; 14 AA. 

AC P98444? 

DT 06-MAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar C major outer membrane protein (MOMP) 

DE variable domain (VD) C-VDIII encoded by base pairs 742-783 

KW ' Clamydia trachomatis; antigen; monclonal antibody? vaccine; 

KW diagnosis; serotyping; non-immunologic assay;. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-MAR-1989; 324664. 

PR 17-MAR-1989? US-324664. 

PA (USSH ) US DEPT HEALTH h HUMAN. 

PI Caldwell HD, Ying Yr Zhang YX» Watkins NG; 

DR WPI; 89-339697/46. 

DR N-PSDB; N97079. 

PT Clamydia trachomatis genes - used for determn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure; Fig 13? 49pp? English. 

CC The inventors sequenced the 4 MOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The MOMP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by MOMP 

CC specific monoclonal antibodies. The nucleotide* amino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate MOMP antigenic determinants to be used 

CC in the construction of synthetic peptides t subunits or recomtinant 

CC chlamydial vaccines. This will allow the prodn. or reagents and 



P9S452 



Sequence of C. trachomatis serovar H major outer m 



ID P98452 standard; Protein; 14 AA. 

AC P98452; 

DT 06-MAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar H major outer membrane protein (MOMP) 

DE variable domain (VD) H-VDIII encoded by base pairs 742-783 

KU Clamydia trachomatis; antigen; mondonal antibody; vaccine; 

KW diagnosis; serotyping; non-immunologic assay; ss. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-HAR-1989; 324664. 

PR 17-MAR-1989; US-324664. 

PA (USSH ) US DEPT HEALTH & HUMAN. 

PI Caldwell HD, Ying Yr Zhang YX, Watkins NG; 

DR HPI; 89-339697/46. 

DR N-PSDB; N97037. 

PT Clamydia trachomatis genes - used for determn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure? Fig 15; 49pp; English. 

CC The inventors sequenced the 4 hOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences yere deduced. The MOHP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by MOHP 

CC specific monoclonal antibodies. The nucleotide! amino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate MOMP antigenic determinants to be used 

CC in the construction of synthetic peptides » subunits or recomtinant 

CC chlamydial vaccines. This will allow the prodn. or reagents and 

CC methodologies applicable in the development of new diagnostic tests 

CC for serotyping. 

S9 Sequence 14 AA; 

SQ 4 A; o R; 0 N; i D? 0 B; 0 C; 0 Q? 2 E; 0 Z; 1 G; 0 H; 

SQ i I; i L; o K? o M; l f; 1 P; 0 s? 2 T; 0 w; 0 Y; o v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57?43-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 45X Matches = 5 Mismatches = 6 

Gaps - 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 

II III 
AEFPLDITAGTEAA 
X 10 



4. US-08-249-182-8 (1-11) 

P98448 Sequence of C. trachomatis serovar A major outer m 

ID P98448 standard; Protein; 14 AA. 
AC P98448; 

DT 06-MAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar A major outer membrane protein (MOMP) 

DE variable domain (VD) A-VDIII encoded by base pairs 742-783 

KU Clamydia trachomatis; antigen; monclonal antibody; vaccine; 

KW diagnosis; serotyping; non-immunologic assay. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-MAR-1989; 324664. 

PR 17-MAR-1989; US-324664. 

PA (USSH ) US DEPT HEALTH h HUMAN. 



SQ i A; o R; o n; 0 D; o B; 0 C; 1 Q; 0 E; 0 Z; 2 G? 0 H? 
SQ l I; i L; l K; 0 H; o f; l p; o s; 2 T; l w; o Y; o v; 
CC Retrieved by shears on Wed 21 Sep 94 ll:58?47-PDT using FindSeq 

Initial Score = il Opiinized Score = 11 Significance = 8.24 
Residue Identity = 100X Matches = 11 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 

milium 

GGQPLWITATK 
X 10 



2. US-08-249-182-8 < 1-1 i > 

P98464 Sequence of C. trachomatis serovar K major outer m 

ID P98464 standard; Protein? 14 AA. 

AC P98464; 

DT 06-MAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar K major outer membrane protein (MOMP) 

DE variable domain (VD) K-VDIII encoded by base pairs 742-783 

KW Clamydia trachomatis; antigen; monclonal antibody; vaccine; 

KH diagnosis; serotyping; non-imnunologic assay. 

OS Chlamydia trachomatis. 

PN US7324664-A. 

PD 29-AUG-19B9. 

PF 17-MAR-1989; 324664. 

PR 17-MAR-19S9, US-324664. 

PA (USSH ) US DEPT HEALTH St HUMAN. 

PI Calduell HD, Ying Y» Zhang YX, Watkins NG; 

DR UPl; 89-339697/46. 

DR N-PSDB? N97099. 

PT Clamydia trachomatis genes - used for determn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure; Fig 18? 49pp; English. 

CC The inventors sequenced the 4 MOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The MOMP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by MOMP 

CC specific monoclonal antibodies. The nucleotide* amino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate MOMP antigenic determinants to be used 

CC in the construction of synthetic peptides * subunits or recomtinant 

CC chlamydial vaccines. This will allow the prodn. or reagents and 

CC methodologies applicable in the development of new diagnostic tests 

CC for serotyping. 

SQ Sequence 14 AA; 

SQ 3 A; 0 R; o N; i D; 0 B; o C; o Q; 2 E; o z; 1 G; 0 H; 

sq l i; l l; o k; o «; l f; i p? o s; 2 T; o w; 0 Y; l V; 

CC Retrieved by shears on Wed 21 Sep 94 1H57:44-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.30 
Residue Identity = 45X Hatches = 5 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 

II III 
VEFPLDITAGTEAA 
X 10 



/ * 


r Totou 




it 








V 


8. 


R24868 


Sequence of peptide fragment 


28 


5 


5 


3.30 


0 


9. 


P91359 


Anino acids 482-517 of HIV gl 


38 


5 


5 


3.30 


0 


10. 


P94662 


Protein sequence for the anin 


70 


5 


6 


3.30 


0 


11. 


P91962 


Polypeptide encoded by cDNA 5 


74 


5 


5 


3.30 


0 


12. 


R13336 


HypA protein. 


102 


5 


6 


3.30 


0 


13. 


R 13334 


HypA protein. 


102 


5 


6 


3.30 


0 


14. 


R21326 


Sequence of protein G. 


257 


5 


5 


3.30 


0 


15. 


R29637 


pCTD ORF 2. 


354 


5 


6 


3.30 


0 


16. 


R39684 


VCAN-6D/ICAH-1. 


643 


5 


5 


3.30 


0 


17. 


R39685 


VCAH-6D/ICAH-2. 


644 


5 


5 


3.30 


0 


18. 


R39686 


VCAN-6D/ICAH4-1. 


647 


5 


5 


3.30 


0 


19. 


R38549 


VCAM-6D. 


647 


5 


5 


3.30 


0 


20. 


R05795 


HIV-1 env nutein lacking hype 


700 


5 


5 


3.30 


0 


21. 


R38550 


VCAH/ICAH-1. 


735 


5 


5 


3.30 


0 


22. 


R39682 


VCAM/ICAH-2. 


736 


5 


5 


3.30 


0 


23. 


R38548 


VCAH-7D. 


739 


5 


5 


3.30 


0 


24. 


R08U8 


Vascular cell adhesion nolecu 


739 


5 


5 


3.30 


0 


25. 


R39683 


VCAM/ICAH-3. 


740 


5 


5 


3.30 


0 


26. 


R29706 


env gene decoded fron viral D 


863 


5 


5 


3.30 


0 


27. 


R43950 


Ge protein fragnent. 


1150 


5 


5 


3.30 


0 


28. 


R12608 


EGRF-R erbB-3 clone E3-16 pro 


1343 


5 


5 


3.30 


0 






***« 2 standard deviations above nean 


*#»* 








29. 


P93053 


HIV env protein analogue ( j) . 


11 


4 


4 


2.47 


0 


30. 


R42333 


EBV VCA peptide. 


12 


4 


5 


2.47 


0 


31. 


R42332 


EBV VCA peptide. 


12 


4 


4 


2.47 


0 


32. 


R42331 


EBV VCA peptide. 


12 


4 


4 


2.47 


0 


33. 


R42330 


EBV VCA peptide. 


12 


4 


4 


2.47 


0 


34. 


R41295 


Peptide fragnent F7. 


12 


4 


5 


2.47 


0 


35. 


P98436 


Sequence of C. trachomatis se 


14 


4 


4 


2.47 


0 


36. 


P98428 


Sequence of C. trachomatis se 


14 


4 


4 


2.47 


0 


37. 


P98420 


Sequence of C. trachomatis se 


14 


4 


4 


2.47 


0 


38. 


P98412 


Sequence of C. trachonatis se 


14 


4 


4 


2.47 


0 


39. 


P98440 


Sequence of C. trachonatis se 


14 


4 


4 


2.47 


0 


40. 


P93291 


Sequence of Chalnydia trachon 


14 


4 


4 


2.47 


0 



1 . US-0B-249-182-8 (1-11) 

R37450 Autoiaxin peptide ATX 100. 

ID R37450 standard; peptide? 11 AA. 

AC R37450; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 100. 

KW Cell motility stimulating; cancer metastasis; antibody; detection; 

KW imnunostains; disease outcome prediction; therapy choice; 

KW cancer therapy? crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992? US-822043. 

PA (USSH ) US DEPT HEALTH St HUMAN SERVICE. 

PI Krutzsch Hr Liotta LA> Schiffmann Ei Stracke H. 

DR WPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33? 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 100. It may be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in immunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 



10- 



5- 



II II I I I I I I I II I I I I 

SCORE 0| |1 | 2 4 | 5 | 6 |7 | 9 10 11 
STDEV 0 1 2 3 4 5 6 7 



PARAMETERS 



Similarity matrix 


Unitary 


K-tuple 2 


Mismatch penalty 


I 


Joining penalty 20 


Gap penalty 


1.00 


Window size 5 


Gap size penalty 


0.05 




Cutoff score 


0 




Randomization group 


0 




Initial scores to save 40 


Alignments to save 15 


Optimized scores to 


save 0 


Display context 50 




SEARCH STATISTICS 


Scores; 


Mean 


Median Standard Deviation 




1 


3 1.21 


Tines: 


CPU 


Total Elapsed 




00:00:08.92 


00:00:08.00 


Number of residues: 




482836 


Number of sequences 


searched: 


5543 


Number of scores above cutoff: 


3132 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence was found: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



1. R37450 Autotaxin peptide ATX 100. 11 11 11 8.24 0 

The list of other best scores is: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



**** 3 standard deviations above mean 



2. 


P98464 


Sequence of C. trachonatis 


se 


14 


5 


5 


3.30 


0 


3. 


P9B452 


Sequence of C. trachonatis 


se 


14 


5 


5 


3.30 


0 


4. 


P98448 


Sequence of C. trachonatis 


se 


14 


5 


5 


3.30 


0 


5. 


P98444 


Sequence of C. trachonatis 


se 


14 


5 


5 


3.30 


0 


6. 


P98468 


Sequence of C. trachonatis 


se 


14 


5 


5 


3.30 


0 



CC 1 env protein. This protein and fusion proteins comprising it are 

CC useful for detecting antibodies to HTLV-1 in body fluids r eg bloodi 

CC where they provide a more sensitive and selective assay than 

CC current viral lysate tests. 

CC See also 911553-58. 

S9 Sequence 106 AA; 

S9 5 A; 6 R; 7 n; 5 D; 0 b; 2 c; 9 9; 5 E; 0 Z; 8 G; 3 H; 

S9 b I; 17 L; 6 KM Mi 2 F; 5 P; 7 S; 3 T; 3 H? 1 Y; 5 V» 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:21-PDT using FindSeq 

Initial Score = 4 Optimized Score = 5 Significance = 2.68 
Residue Identity = 507. Hatches = 5 Mi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
II II I 

KNLLKIA8YAA9NRRGLDLLFWESGGLCKAL9EQCRFPNITNSHVPIL9ERPPLENRVLTGHGLNWDLGSVD 
30 40 50 60 70 80 X 90 

L9PSLIS 
100 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 

Results file us-08-249-182-8.res made by on Wed 21 Sep 94 12;12:21-PDT. 



Query sequence being conpared«'US-08-249-182-8 (1-11) 
Number of sequences searched: 5543 
Number of scores above cutoffs 3132 

Results of the initial comparison of US-08-249-182-8 (1-11) with: 
File s /home/shears/loring/lorin*.pep 

10000- 

N 

U 5000- 

n 

B 

E * # 
R 
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F 1000- 
S 

E 500- * 
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U 

E 

N 

C - * 
E 

S 100- 



50- 

* 



PR 18-SEP-1980; GB-030208. 

PR 22-0CT-1980; GB-034130. 

PR 27-NQV-1980; GB-038147. 

PR 08-APR-1981; GB-011064. 

PR 18-AUG-1981? GB-025150. 

PA (NATR ) National Res Dev Corp. 

PA (WELL ) Uellcone Foundation Ltd. 

PI Boothroyd JCi Cross GAH> Highfield PEr Winther MDt Rowlands DJi 

PI Broun Fr Harris TJR> Lowe PA; 

DR UPl; 82-26702E/14. 

DR N-PSDB; N20019. 

PT DNA corresp. to (part of) foot and nouth disease virus RNA - useful 

PT in prepn. of vaccines for producing antibodies against the virus 

PS Exanple; Fig 6; 57pp; English. 

CC The inventors clain a DNA nolecule conprising a nucleotide sequence 

CC corresp. to all or a portion of foot-and-nouth disease virus RNA 

CC (FMDV). The DNA nolecule is esp. for a precursor of FMDV capsid 

CC protein. It esp. codes for FMDV protein p88 and VP1-VP4. It nay code 

CC for VP4i VP2> VP3 and VP1 contiguously. The inventors also clain a 

CC vaccine for stiunlating prodn. of antibodies against FMDV in a 

CC nannal which comprises at least one of the above recombinant 

CC proteins produced by a host cell transforned with the DNA. 

SQ Sequence 78 AA; 

SQ 2 Af 2 R; 2 N; 0 D; 0 B; 3 C? 5 9; 0 E; 0 Z; 4 G; 4 H; 

SQ 0 I; 8 L; l K? 2 N; 3 F; 8 P; 10 S; 15 T; 3 w; 2 Y; 4 V; 

CC Retrieved by shears on Wed 21 Sep 94 11:58:15-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 2.68 
Residue Identity = 40% Matches = 4 Mismatches = 6 

Gaps = 0 Conservative Substitutions * 0 

X 10 
VPPFENIELY 
III! 

GAL0TYWTWPKHVPPFFVSTMGNRTSLRG9TTPVFWPSLMSPLPQNTCSTHTY9GLHSTTHSTLVLSTCTSC 
10 X 20 X 30 40 50 60 70 



15. US-08-249-182-7 (1-10) 

Rl 1715 HTLV-1 env. gp21 epitope encoded by ENV93 sequence 

ID Rl 1715 standard; Protein; 106 AA. 

AC R11715; 

DT 27-JUN-1991 (first entry) 

DE HTLV-1 env. gp21 epitope encoded by ENV93 sequence. 

KW Hunan T-cell leukaenia virus; HTLV-1; fusion protein; antibodies. 

OS Hunan T-cell leukaenia virus. 

PN EP-424748-A. 

PD 02-MAY-1991. 

PF 12-0CT-1990; 119560. 

PR 23-0CT-1989; US-425252. 

PA (H0FF ) Hoffnann-La Roche AG. 

PI Buonagurio DA; Longiaru M; 

DR WPI; 91-126308/18. 

DR N-PSDBJ Q11552. 

PT New nucleic acid encoding conserved epitope of HTLV gp.21 - and 

PT hybrids with other epitopes and the derived polypeptide (s) useful as 

PT immunoassay reagents for detecting specific HTLV antibodies in serun 

PS Clain 3; page 12; 31pp; English. 

CC This epitope^ fron the innunodoninant conserved region of HTLV-1 

CC envelope (env) glycoprotein (gp)21i is encoded by the ENV93 gene 

CC construct. It constitutes residues 342-434 of the gp21 sequence. 

CC This ENV93 gene construct nay be used alone or as a vehicle for the 

CC high level expression of other epitopes of HTLV-1 env as fusion 

CC proteins. In the 2nd case the ENV93 sequence is fused to a sequence 



13. US-08-249-182-7 (1-10) 

P20022 Sequence of a foot and mouth disease virus capsid 



ID P20022 standard,* Protein; 67 AA. 

AC P20022; 

DT 20-AUG-1992 (first entry) 

DE Sequence of a foot and mouth disease virus capsid protein 

DE encoded by a region of recombinant plasmid pFA61/t76 

KU Vaccine; antibody; capsid protein; immunogen; antigen; 

KW foot and mouth disease. 

OS Foot and mouth disease virus. 

PN EP--48455-A. 

PD 31-MAR-1982. 

PF 17-SEP-1981. 

PR 18-SEP-1980; GB-03020B. 

PR 22-0CT-1980; GB-034130. 

PR 27-N0V-1980; GB-038147, 

PR 08-APR-1981; GB-011064. 

PR 18-AUG-1981; GB-025150. 

PA (NATR ) National Res Dev Corp. 

PA (WELL ) Wellcome Foundation Ltd. 

PI Boothroyd JC, Cross GAM* Highfield PEr Hinther fIDi Rowlands DJ> 

PI Brown F, Harris TJR, Lowe PA; 

DR HPI; 82-26702E/14. 

OR N-PSDB; N20020. 

PT DNA corresp. to (part of) foot and nouth disease virus RNA - useful 

PT in prepn. of vaccines for producing antibodies against the virus 

PS Example; Fig 7; 57pp; English. 

CC The inventors claim a DNA molecule comprising a nucleotide sequence 

CC corresp. to all or a portion of foot-and-mouth disease virus RNA 

CC (FMDV). The DNA molecule is esp. for a precursor of FMDV capsid 

CC protein. It esp. codes for FMDV protein p88 and VP1-VP4. It may code 

CC for VP4» VP2> VP3 and VP1 contiguously. The inventors also claim a 

CC vaccine for stiumlating prodn. of antibodies against FMDV in a 

CC mammal ohich comprises at least one of the above recombinant 

CC proteins produced by a host cell transformed uith the DNA. 

SQ Sequence 67 AA; 

SQ 1 A; 9 R; 1 N; o D; 0 B; 3 C; 3 0; 3 E; 0 Z; 3 G; 3 H; 

SQ 0 I; 3 L? 2 K? 0 M; 2 F; 8 P; 12 s ; 7 T; 2 w; o Y; 5 v; 

CC Retrieved by shears on Wed 21 Sep 94 11:58: 15-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance » 2.68 
Residue Identity = 40X Matches = 4 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 
II II 

TRSPVSFTCRPPPSVSELRFCWPGWSRSPRVSSGPHPKTLRESRNSSKHVTLTTSCHSQERRVAGST 
10 20 30 40 50 60 



14. US-08-249-182-7 (1-10) 

P20020 Sequence of a foot and mouth disease virus capsid 

ID P20020 standard; Protein; 78 AA. 

AC P20020; 

DT 20-AUG-1992 (first entry) 

DE Sequence of a foot and mouth disease virus capsid protein 

DE encoded by a region of recombinant plasmid pFA61/t76 

KW Vaccine; antibody; capsid protein; immunogen; antigen; 

KU foot and nouth disease. 

OS Foot and mouth disease virus. 

PN EP--48455-A. 

PD 31 -MAR- 1982. 



ejipresseo oy a vec^gr prQouteo py a novel Torn o? t us ion run wnxtn 

CC enables fusion of heavy and light chains prior to vector ligation! 

CC avoiding the cumbersome separate cloning of fragnents. This linker 

CC sequence noves the Spel siter retains native IgGl upper hinge region^ 

CC retains original lamB sequence. 

SQ Sequence 28 AA; 

SQ 2 A; 0 R; 0 N; 2 D; 0 B; i C; 0 8; 2 E; 0 Z; 0 G; 1 Hi 

SO 0 I; 2 L; 3 K; 0 M; 2 F; 5 P; 4 Si 3 T; 0 W? 1 Yl 0 V? 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:26-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance - 2.68 
Residue Identity = 40X Hatches = 4 Mi snatches = 6 

Gaps s 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

II II 
EPKSCDKTHTSPPAPAPELLKSSFYFDT 
10 20 



12. US-08-249-182-7 (1-10) 

R27562 Insert B to prevent steric hindrance h competition 

ID R27562 standard? Protein? 34 AA. 

AC R27562; 

DT 26-FEB-1993 (first entry) 

DE Insert B to prevent steric hindrance & competition with E. coli coat. 

KW Dicistronic expression vector; fusion PCR; antibody; cDNA library; 

KU 55. 

OS Synthetic. 

PN W09215678-A. 

PD 17-SEP-1992. 

PF 27-FEB-1992; U01475. 

PR 01-MAR-1991; US-663442. 

PA (STRA-) STRATAGENE. 

PI Sorge JA; 

DR KPI; 92-331724/40. 

PT Prodn. of dicistronic DNA library used to Rake antibodies » etc. - 

PT includes forming 1st and 2nd PCR admixtures! subjecting then to 

PT PCR therno-cyclesi sepg. double stranded DNA, hybridising* etc. 

PS Disclosure; Page 100; 143pp; English. 

CC This peptide linker sequence is used to increase the distance of an 

CC expressed IgG polypeptide from the surface membrane of E. colir which 

CC results in decreased steric hindrance and competition of the preselected 

CC polypeptide tilth the lipopolysaccharide coat of E. coli. The IgG is 

CC expressed by a vector produced by a novel form of fusion PCR which 

CC enables fusion of heavy and light chains prior to vector ligation 

CC avoiding the cumbersome separate cloning of fragnents. This linker 

CC sequence moves the Spel site* retains original IgGl upper hinge regionr 

CC retains original lamB sequence. 

SQ Sequence 34 AA; 

SQ 2 A; o R; 0 n; 2 D; o B; l c; o Q; 2 E; o 2? 0 G; 2 H; 

SQ 0 I; 2 l; 5 K; o h; 1 F; 6 P; 5 s; 5 T; 0 «; 1 y; 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 11:58:26-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 2.68 
Residue Identity = 40V. Matches = 4 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 
II II 

PKSCDKTHTEPKSTDKTHTSPPAPAPELLKSSFY 



10 20 30 



FT gp. which modifies the N-terminus; 

FT Z= one or more amino acids* OHi NH2» or a 

FT linkage involving either of these 2 gps.° 

FT Modified.site 24 

FT /note= "the C-terminal comprises Y-(X)-Z° 

PN W09318054-A. 

PD 16-SEP-1993. 

PF 08-MAR-1993; E00517. 

PR 06-MAR-1992; EP-400598. 

PA (INN0-) INNQGENETICS NV SA. 

PI De LEYS R; 

DR MP I J 93-303397/38. 

PT New biotinylated peptide(s) corresp. to immuno-dominant 

PT epitope(s) - uith increased antigenicityr useful in antibodies 

PT detection and vaccines against hepatitis C, HIV and HTLV 

PS Claim 4; Page 90-98; 133pp; English. 

CC Peptide compsns. comprise at least one and pref. a combination of 

CC two, three* four or more biotinylated peptides chosen froR the 

CC sequences given in R41058-R41166. The peptides represent 

CC immunologically important regions of viral proteins and are 

CC prepd. by solid phase peptide synthesis. The compsns. are 

CC useful for the detection of antibodies to HCVi and/or HIV* 

CC and/or HTLV-I or II. 

S3 Sequence 24 AA; 

S8 l A; 0 R; l N; 0 D; 0 B; 0 c; 0 a; 0 E; 0 1; o G; 0 H; 

SQ 0 I? 4 L? 0 K; o M; 0 F; 3 P; 7 Si i T; 0 W; 2 Y; 3 V? 

SQ 2 Others; 

CC Retrieved by shears on Wed 21 Sep 94 ii:59:29-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 2.68 
Residue Identity = 402 Hatches = 4 Mi snatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

II II 
XVLYSPNVSVPSSSSTLLYPSLAX 
10 20 



11. U5-08-249-1S2-7 <1-10) 

R27561 Insert A to prevent steric hindrance 6 competition 

ID R275&1 standard? Protein; 28 AA. 

AC R27561; 

DT 26-FEB-1993 (first entry) 

DE Insert A to prevent steric hindrance & competition with E. coli coat. 

Ky Dicistronic expression vector; fusion PCR; antibody; cDNA library; 

KW ss. 

OS Synthetic. 

PN W09215678-A. 

PD 17-SEP-1992. 

PF 27-FEB-1992; U01475. 

PR 01-MAR-1991; US-663442. 

PA (STRA-) STRATAGENE. 

PI Sorge JA; 

DR HPI; 92-331724/40. 

PT Prodn, of dicistronic DNA library used to make antibodiesf etc. - 

PT includes forming 1st and 2nd PCR admixtures r subjecting then to 

PT PCR thermo-cyclesi sepg. double stranded DNA, hybridising, etc. 

PS Disclosure; Page 100; 143pp; English. 

CC This peptide linker sequence is used to increase the distance of an 

CC expressed IgG polypeptide from the surface membrane of E. coli» which 

CC results in decreased steric hindrance and competition of the preselected 

CC polypeptide with the lipopolysaccharide coat of E. coli. The IgG is 



i * wiutt a v i/ti i>ii tot uunwgct t-uni*y» / iw orunu oui.ua 

PN EP-507573-A. 

PD 07-0CT-1992. 

PF 02-APR-1992J 302882. 

PR 03-APR-1991; JP-071253. 

PA (SANW ) SANWA KAGAKU KENKYUSHO CO. 

PI Kaio Bi Kurono Hr Hitani T> Oouaki H> Sato Ht Sauai K> 

PI Takahashi H; 

DR HPI; 92-333790/41. 

PT Orally and nasally active notilin conpsns. - for pronoting 

PT motility of digestive tract 

PS Clain 2; Page 7; 8pp; English. 

CC The peptide is a notilin-lifce substance, pref. L-leucine-13-notilin- 

CC honoseriner which nay be administered orally or nasally* as distinct 

CC from prior art conpsns. which are only effective by injection. 

CC Cpds. contg. this peptide nay be readily absorbed to pronote 

CC notility of the digestive tract* and are used for treating 

CC dyskinesia or for causing contraction of the nuscles after surgical 

CC operation. The conpsn. nay opt. contain a peptolytic enzyne 

CC inhibitor to prevent or reduce attack by trypsin or chynotrysin on 

CC the peptide. 

SQ Sequence 23 AA; 

SQ 0 A; 2 R? 0 N; 0 Di o 6; 0 CM Q; 3 E; 0 Zi 1 G; 0 H; 

SQ 1 111 L; 2 K ; 0 H; 2 F; 0 P; 1 S; l T; 0 H; i Y; i v; 

SQ 6 Others; 

CC Retrieved by shears on Wed 21 Sep 94 1U58:25-PDT using FindSeq 

Initial Score - 4 Optinized Score = 4 Significance = 2.68 
Residue Identity = 40X Hatches = 4 Hisnatches = 6 

Gaps ~ 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

I I II 
FVXIFTYXELXRXQEKERXKGXS 
X 10 20 



10. US-08-249-182-7 (1-10) 

R41084 HTLV-I and HTLV-II peptide I-gp46-3. 

ID R41084 standard; peptide; 24 AA. 

AC R41084; 

DT 22-HAR-1994 (first entry) 

DE HTLV-I and HTLV-II peptide I-gp46-3. 

KW Hunan immunodeficiency virus; HIV; hepatitis C virus; HCV; 

KW non-A non-B hepatitis; NANBH; hunan T-cell lynphotropic virus; HTLV; 

KW epitope; antibody; biotin; diagnosis; detection; vaccine. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Modif ied_site 1 

FT /note= "the N-terninal comprises (A)-(B)-(X)-Y; where 

FT B= biotin; 

FT X= biotinylation cpd. incorporated 

FT during synthesis; 

FT Y= bond or linking gp(s). which 

FT nininises steric hindrance* 

FT where Y is not a bond it is pref. 1-10 

FT residues of (sane or different) glycinef 

FT beta-alaninei 4-aninobutyric acid* 

FT 5-aninovaleric acid or 6-aninohexanoic acid; 

FT parenthesis around B and X indicate opt. presence 

FT at the specified positions but B or X nust be 

FT present in at least one of the positions shown* 

FT B interacts uith the peptide to give a cpd. 

FT with greater diagnostic sensitivity; 



DE IgGl light chain fragnent #3. 

KU Fd f ; fragneni; human; IgGl; variable region? Fab'; F(ab r )2; T cell; 

KU gene nodule; binding activity; antibody. 

OS Hoflo sapiens. 

FH Key Location/Qualifiers 

FT Crossjinks 5 

FT /note= "Cys involved in disulphide bridge with 

FT heavy chain" 

PN H09222324-A. 

PD 23-DEC-1992. 

PF 15-JUN-1992; U04976. 

PR 14-JUN-1991; US-714175. 

PA (XOMA ) XOMA CORP. 

PI Better HD, Carroll S, Horuitz AH; 

DR WPI; 93-017909/02. 

DR N-PSDB; 834567-72. 

PT Polynucleotide sequences encoding Fab T and F(ab')2 fragnents - 

PT used to produce! e.g. antibody-ricin A chain innunoJtoxints) 

PS Disclosure; Fig lb; 92pp; English. 

CC The sequences given in R30876-78 represent Fd ? fragments of hunan 

CC IgGl. These polypeptides represent a variable region* which when 

CC part of an Fab 1 or F(ab')2 are reactive with T cells. Six different 

CC Fd' gene nodules were constructed. These gene nodules were used to 

CC create Ig fragnents which retain the full binding activity of the 

CC whole antibody. 

SO Sequence 14 AA; 

59 l A; o R; 0 N; o d; o b; 1 C? 0 S; l E; 0 Z; 0 G; 0 H; 

so 0 I; 2 l; o K? 0 «; 2 F; 5 P; l s; l T; o w; 0 Yi 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 1H58:34-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 2.68 
Residue Identity = 40% Hatches = 4 Misnatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

II II 
TSPPCPAPELLFFP 
X 10 



9. US-08-249-182-7 (1-10) 

R27059 Moti I in-like peptide. 

ID R27059 standard; peptide; 23 AA. 

AC R27059; 

DT 03-KAR-1993 (first entry) 

DE Moti 1 in-like peptide. 

KU Absorb; orally; nasally; notility; digestive tract; dyskinesia. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT RiscjJifference 3 

FT /label= Pror Gly> Asn> Ser 

FT Miscjiifference 8 

FT /label* Gly, Pro, Asn* Ser 

FT Miscjiifference 11 

FT /label= Gin, Glu, Asp 

FT Misc_difference 13 

FT /note= "any anino acid other than Met" 

FT Miscjiifference 19 

FT /label= Asn» Glu, Asp 

FT Nisc_difference 22 

FT /label* Gin. Lys» Arg 

FT Modified_site 23 

FT /note= "honoSeri its lactone or any polypeptide 



Aiiinoi c - f u(jnni£eu aLur r - *t oiyniT lLdriLtr - coo 

Residue Identity = 57Z Hatches = 4 Hisnatches = 3 

Gaps = 0 Conservative Substitutions = 0 



X X 10 
VPPFENIELY 

II I! 
PILQERPPLENR 
X 10 X 



7. US-08-249-182-7 (1-10) 

R30876 IgGl light chain fragment #1. 

ID R30876 standard; peptide; 14 AA. 

AC R30876; 

DT 10-HAY-1993 (first entry) 

DE IgGl light chain fragment #1. 

KW Fd'; fragment; hunan; IgGl; variable region? Fab'? F(ab')2; T cell; 

KW gene nodule; binding activity; antibody. 

OS Horo sapiens. 

FH Key Location/Qualifiers 

FT Crosslinks 2 

FT /note= °Cys involved in disulphide bridge with 

FT heavy chain" 

FT Cross. I inks 5 

FT /note= °Cys involved in disulphide bridge with 

FT heavy chain" 

PN H09222324-A. 

PD 23-DEC-1992. 

PF 15-JUN-1992; U04976. 

PR 14-JUN-1991; US-714175. 

PA (XOMA ) XOMA CORP, 

PI Better MD> Carroll S> Horuitz AH; 

DR WPI; 93-017909/02. 

DR N-PSDB; 034567-72. 

PT Polynucleotide sequences encoding Fab' and F(ab')2 fragments - 

PT used to produce* e.g. antibody-ricin A chain innuno;toxin(s) 

PS Disclosure; Fig lb; 92pp; English. 

CC The sequences given in R30876-78 represent Fd' fragnents of hunan 

CC IgGl. These polypeptides represent a variable regioni yhich when 

CC part of an Fab 1 or F(ab')2 are reactive with T cells. Six different 

CC Fd' gene nodules were constructed. These gene nodules were used to 

CC create Ig fragnents which retain the full binding activity of the 

CC whole antibody. 

SO Sequence 14 AA; 

so l A; o R; o N; 0 D; o B; 2 c; 0 e; l E; o z; 2 G; 0 H; 

so o i; 2 l? o k; o m o f; 5 P; o s; i t; o w; o y; o v? 

CC Retrieved by shears on Wed 21 Sep 94 li:58:34-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 2.68 
Residue Identity = 40% Matches = 4 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

II II 
TCPPCPAPELLGGP 
X 10 



8. US-08-249-182-7 (1-10) 

R30878 IgGl light chain fragment #3. 

ID R30878 standard; peptide; 14 AA. 
AC R30878; 



PA (RENA/) Renard A J J. 

PI Renard AJJ* Thiry MEG; 

DR HPl; 90-140005/19. 

DR N-NSDB; 804292. 

PT Immunogenic recombinant polypeptide - 

PT from Fish haemorrhagic septicaemia virus for protective 

PT vaccines* and new DNA sequences* vectors* transformed hosts and 

PT antibodies 

PS Clain 1; Page 66; 99pp; French. 

CC Recombinant polypeptide sequence is included in all 

CC recombinant proteins covered by the invention. 

CC See also Q 04308, 804318-20* (305332-42. 

S8 Sequence 455 AA; 

SQ 19 A; 19 R; 21 n; 29 D; 0 B; 12 c; 14 8; 18 E; 0 Z; 25 G; 23 H; 

SQ 31 I; 31 L; 23 K; 9 h; 17 F; 27 PJ 34 S? 45 Ti 9 w; 17 y; 32 v; 

CC Retrieved by shears on Wed 21 Sep 94 11:56;58-PDT using FindSeq 

Initial Score = 5 Optimized Score = 6 Significance = 3.57 
Residue Identity = 607. Hatches = 6 Hi snatches - 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
II III I 

HEUNTFFLVILI I I IKSTTP8IT8RPPVENISTYHADWDTPLYTHPSNCRDDSFVPIRPA8LRCPHEFEDIN 
10 20 X 30 X 40 50 60 70 

KGLVSVPTRIIH 
80 



6. US-08-249-182-7 (1-10) 

R34234 HTLV-I gp21 transmembrane glycoprotein fragment 9. 

ID R34234 standard; peptide; 12 AA. 

AC R34234; 

DT 04-AUG-1993 (first entry) 

DE HTLV-I gp21 transmembrane glycoprotein fragment 9. 

KW Human T-cell leukaemia virus* hydrophilic; conjugate; aggregate* 

KM diagnosis; antibodies. 

OS Synthetic. 

PN H09306843-A. 

PD 15-APR-1993. 

PF 08-0CT-1992; U08405. 

PR 08-OCT-1991; US-771553. 

PA (UYDU-) UNIV DUKE. 

PI Haynes BF, Palker TJ; 

DR WPI; 93-134125/16. 

PT Antigenic determinant peptide(s) of HTLV envelope glycoprotein - 

PT useful for detecting anti-HTLV-I and -II antibodies and as 

PT vaccine against HTLV 

PS Claim 4; Page 11? 50pp* English. 

CC The sequence of peptide 9 corresponds to residues 411-422 from the 

CC HTLV-I gp21 transmemnbrane glycoprotein. When covalently linked 

CC to a carrier mol. the hydrophilic peptide can induce in a mammal the 

CC prodn. of high titres of antibodies to gp46 envelope glycoprotein from 

CC HTLV-I or -II. The peptide and carrier may be used in vaccines against 

CC HTLV-I or -II infection. The peptide may be used in a diagnostic 

CC assay to detect the presence and titre of anti-HTLV antibodies. 

CC See also R34225-57. 

S8 Sequence 12 AA; 

SQ 0 a; 2 R; i N; 0 D; 0 B; 0 C; 1 8; 2 E; 0 z; 0 G; 0 H; 

S8 l I; 2 L* 0 K; 0 n; o F; 3 P; 0 s? o T; o w; o Y; 0 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:00-PDT using FindSeq 



ncsiuuc menu by - jua rmbinr^ - J nnndbLnca ** J 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

II I II 
FVPIFTYGELGRL9EKERNKGQ 
X 10 20 



4. US-08-249-182-7 (1-10) 

P80164 Biosynthetic multifunctional protein. 

ID PS0164 standard; protein; 200 AA. 

AC P80164; 

DT 16-NOV-1990 (first entry) 

DE Biosynthetic multifunctional protein. 

KW Biosynthetic multifunctional protein; biosynthetic antibody binding site; 

KW protein trailer; ricin. 

PN «08809344-A. 

PD 01-DEC-1988. 

PF 19-MAY-1988; U01737. 

PR 21-BAY-1987; US-052800. 

PA (CREA-) Creative Biomolecules Inc. 

PI Huston JS> Oppermann H; 

DR WPI; 88-353923/49. 

DR N-PSDBJ N80190. 

PT Recombinant multifunctional protein - having antibody binding site and a 

PT sequence for biological activity, ion sequestering or binding to a 

PT solid support. 

PS Disclosure; HSpp; English. 

CC The sequence is a biosynthetic multifunctional protein including a 

CC biosynthetic antibody binding site and a ricin protein trailer linked 

CC via a spacer sequence. 

S3 Sequence 200 AA; 

SO 18 A; 17 r; 14 N; 6 D; 0 B; 1 C; 9 Q; 13 E; 0 2; 16 G; 4 H; 

SQ 16 i; 17 l; l K; 2 M; ll F; 8 P; 9 s; 15 T; 0 w; 12 Y; u v; 

CC Retrieved by shears on Wed 21 Sep 94 li:57:03-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.57 
Residue Identity = SOX Matches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
Mill 

VVGYRAGNSAYFFHPDNQEDAEAITHLFTDVONRYTFAFGGNYDRLEQLAGNLRENIELGNGPLEEAISALY 
90 100 110 120 130 X 140 X 150 

Y YSTGGTQLPTLARSF 1 1 C 1 8M I SEAARF8Y I EGEMRT 
160 170 180 190 



5. US-08-249-182-7 (1-10) 

R04576 Polypeptide recognised by neutralising anti-HSV an 

ID R04576 standard; protein; 455 AA. 

AC R04576J 

DT 24-SEP-1990 (first entry) 

DE Polypeptide recognised by neutralising anti-HSV antibodies 

KH Fish haemorrhagic septicaemia virus (HSV)? 

KW immunogenic recombinant polypeptide; ss 

OS synthetic. 

PN CA2000570-A. 

PD 12-APR-1990. 

PF 12-0CT-1989; . 



FT /label= extracellularjtonain 

FT /note= "soluble, immunogenic form of IFN-R 0 

PN EP-5634B7-A. 

PD 06-0CT-1993. 

PF 31-MAR-1992; 400902. 

PR 31 -MAR- 1992; EP-400902. 

PA (EUBI-) LAB EURO BIOTECHNOLOGIE SA. 

PI Benoit Pi Maguire Di Meyer F» Plavec Ii Tovey MG; 

DR HPI; 93-312951/40. 

DR P-PSDB; R42635. 

PT Monoclonal antibody to human interferon type-I receptor - having 

FT neutralising activity against human type I interferon* used for 

PT therapy and diagnosis 

PS Disclosure; Fig 3; 21pp? English. 

CC Monoclonal antibodies produced against soluble forms of the human 

CC interferon alpha-beta receptor based on the full-length human IFN-R 

CC sequence are claimed. The antibodies are useful for treatment and 

CC prophylaxis of disorders involving cell proliferation and/or viral 

CC infection. 

SO Sequence 557 AA; 

SQ 26 A; 13 R; 35 N; 25 D; 0 B; 11 C? 23 QJ 42 E? 0 Z? 20 Gi 9 H; 

SO 43 I; 44 L; 44 k; 6 M; 26 Fi 26 P; 51 s; 36 T; 13 M; 22 y; 42 v; 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:33-PDT using FindSeq 

Initial Score = 6 Optimized Score = 6 Significance = 4.46 
Residue Identity = 60X Matches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 
li llll 

RIENIYSRHKIYKLSPETTYCLKVKAALLTSWKIGVYSPVHCIKTTVENELPPPENIEVSVQNflNYVLKyDY 
180 190 200 210 220 230 240 250 

TYANMTFQVQWLHAFLKRNPGNHLYKWKQIPDCENVKT 
260 270 280 



3. US-08-249-1S2-7 (1-10) 
P82050 13-Leu motilin 

ID P82050 standard; peptide; 22 AA. 
AC P82050; 

DT 19-0CT-1990 (first entry) 

DE 13-Leu motilin 

KW Mot i I in ; monoclonal antibodies. 

OS synthetic. 

PN J63044896-A. 

PD 25-FEB-1988. 

PF 13-AUG-1986; 189811. 

PR 13-AUG-1986; JP-189811, 

PA (KYOW) Kyoua Hakko Kogyo KK. 

DR WPI; 88-04821/14. 

PT Monoclonal antibody recognising 13-leucine motilin - used to accelerate 

PT peristalsis in humans 

PS Claim 3; page 1; 5pp? Japanese. 

CC By replacing M13 with leucine the activity is increased and the motilin 
CC is useful in medicine. Monoclonal antibodies can be prepared from this 
CC motilin deriv. uhich are useful for detecting its presence. 
SQ Sequence 22 AA; 

SQ OA; 2 R; IN; o D; 0 8; 0 C; 3 Q; 3 E; 0 2; 2 G; 0 h; 
SQ i i; 2 l; 2 k; o m; 2 f? l P; OS; IT; 0 w; l y; l v? 
CC Retrieved by shears on Wed 21 Sep 94 11:57:00-PDT using FindSeq 

Initial Score = 5 Optimized Score = 5 Significance = 3.57 







nundn Rdppo innunuyiuoui in ii 




n 




coo 




36. 


A49833 


autoantigen recognized by an 


443 


4 


4 


2.68 


0 


37. 


R43339 


Completely humanised C4G1 Ig 


449 


4 


4 


2.68 


0 


38. 


R33311 


Humanised tlaEil Version 1 (in 


453 


4 


4 


2.68 


0 


39. 


R30774 


H52H4-160 murine anti-CD18 an 


454 


4 


4 


2.68 


0 


40. 


R42066 


Hunan anti-HBs heavy chain. 


459 


4 


4 


2.68 


0 



1. US-08-249-182-7 (1-10) 

R37449 Auiotaxin peptide ATX 48. 

ID R37449 standard; peptide; 10 AA« 

AC R37449? 

DT 22-JUL-1993 (first entry) 

DE Auiotaxin peptide ATX 48. 

KW Cell motility stimulating; cancer metastasis; antibody; detection; 

KW inmunostains? disease outcome prediction; therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992? 822043. 

PR 17-JAN-1992? US-822043. 

PA (USSH ) US DEPT HEALTH It HUMAN SERVICE. 

PI Krutzsch Hr Liotta LA, Schiffmann Er Stracke H. 

DR WPI? 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp? English. 

CC The sequence is that of autotaxin peptide ATX 48. It may be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in imnunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

S9 Sequence 10 AA; 

S9 0 A; o R; 2 N; 0 D; 0 B; 0 c; o Q ; 1 E; o 2; o G; o H; 

S8 1 I; l L; 0 K; 0 M; 1 F; 2 P; i s; 0 T; 0 w; l Y? 0 V? 

CC Retrieved by shears on Wed 21 Sep 94 ll:58;47-PDT using FindSeq 

Initial Score = 8 Optimized Score = 8 Significance = 6.25 
Residue Identity = 802 Hatches = 8 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

llllll II 
SPPFENINLY 
X 10 



2, US-08-249-182-7 (i-10) 

R42635 Human interferon receptor. 

ID R42635 standard; Protein; 557 AA. 

AC R42635; 

DT 20-APR-1994 (first entry) 

DE Human interferon receptor. 

KW IFN-R? extracellular domain? monoclonal antibody; viral infection; 

KW cell proliferation; allograft rejection; systemic lupus erythematosus; 

KW psoriasis; multiple sclerosis; Behcet's Disease; aplastic anaemia; 

KW immunodeficiency? measles virus; interferon-alpha-beta. 

OS Homo sapiens. 

FH Key Location/Qualifiers 



Initial scores to save 40 Alignments to save 15 
Optimized scores to save 0 Display context 50 



SEARCH STATISTICS 



Scores? 



Mean Median Standard Deviation 

1 1 1.12 



Tines: 



CPU 
00:00:08.91 



Total Elapsed 
00:00:08.00 



Number of residues: 482836 
Nunber of sequences searched: 5543 
Nunber of scores above cutoff: 2620 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 300% identical sequence to the query sequence was not found. 



The list of best scores is: 



Init. Opt. 

Sequence Wane Description Length Score Score Sig. Frane 







«»* b standard deviations 


above mean 


««»• 








1. 


R37449 


Autotaxin peptide ATX 48. 


10 


8 


8 


6.25 


0 






*«»* 4 standard deviations 


above mean *•*« 








2. 


R42635 


Hunan interferon receptor. 


557 


6 


6 


4.46 


0 






»#»* 3 standard deviations 


above mean 


**#« 








3. 


P82050 


13-Leu not i I in 


22 


5 


5 


3.57 


0 


4. 


P80164 


Biosynthetic multifunctional 


200 


5 


5 


3.57 


0 


5. 


R04576 


Polypeptide recognised by neu 


455 


5 


6 


3.57 


0 






**»* 2 standard deviations . 


above mean 


*»** 








6. 


R34234 


HTLV-1 gp21 transmembrane gly 


12 




4 


2.68 


0 


7. 


R30876 


IgGl light chain fragment #1. 


14 




4 


2.68 


0 


8. 


R30878 


IgGl light chain fragment #3. 


14 




4 


2.68 


0 


9. 


R27059 


Hotilin-like peptide. 


23 




4 


2.68 


0 


10. 


R41084 


HTLV-I and HTLV-1 I peptide I- 


24 




4 


2.68 


0 


11. 


R27561 


Insert A to prevent steric hi 


28 




4 


2.68 


0 


12. 


R27562 


Insert B to prevent steric hi 


34 




4 


2.68 


0 


13. 


P20022 


Sequence of a foot and mouth 


67 




4 


2.68 


0 


14. 


P20020 


Sequence of a foot and mouth 


78 




4 


2.68 


0 


15. 


R11715 


HTLV-1 env. gp21 epitope enco 


106 




5 


2.68 


0 


16. 


A48491 


twitching motility protein P 


135 




4 


2.68 


0 


17. 


Rl 1721 


ENV93/HTLV-1-IIIB' fusion pro 


158 




5 


2.68 


0 


18. 


R21796 


Bet v I allergen of birch. 


160 




4 


2.68 


0 


19. 


R21791 


Aln g I allergen of alder. 


160 




5 


2.68 


0 


20. 


R11717 


ENV93/HTLV-1-II fusion protei 


217 




5 


2.68 


0 


21. 


R37573 


Partial human skeletal muscle 


223 




4 


2.68 


0 


22. 


RU716 


ENV93/HTLV-1-I fusion protein 


245 




5 


2.68 


0 


23. 


R11720 


ENV93/HTLV-1-IIIA fusion prot 


249 




5 


2.68 


0 


24. 


R24186 


Bovine RSV strain A 51908 11 p 


256 




4 


2.68 


0 


25. 


R13117 


Cholera toxin Al fragment. 


258 




4 


2.68 


0 


26. 


R11719 


ENV93/HTLV-1-III fusion prote 


315 




5 


2.68 


0 


27. 


R33279 


43 kD endoflagellum sheath pr 


320 




5 


2.68 


0 


28. 


P93704 


Sequence of the 65k d surface 


325 




4 


2.68 


0 


29. 


R37572 


Rabbit skeletal muscle ADP-ri 


327 




4 


2.68 


0 


30. 


R20130 


SEQ ID No. 5 of the constant 


330 




4 


2.68 


0 


31. 


P81026 


C region of H chain (gamnal) 


330 




4 


2.68 


0 


32. 


R20129 


SEQ ID No. 4 of the constant 


337 




4 


2.68 


0 


33. 


Rl 1713 


ENV93/HTLV-1-II+I fusion prot 


344 




5 


2.68 


0 


34. 


R 11068 


12D3 antigen sequence deduced 


346 




4 


2.68 


0 



Results file us-08-249-182-7.res Bade by on Med 21 Sep 94 12:11 :22-PDT. 



Query sequence being coflpared:US-08-249-182-7 (1-10) 
Nunber of sequences searched: S543 
Nunber of scores above cutoff: 2620 

Results of the initial conparison of US-08-249-182-7 (1-10) uith: 
File : /hone/shears/loring/lorin*.pep 

10000- 

N 

U 5000- 
H 

S < 

E * 
R 

0 

F 1000- 
S 

E 500- 
8 

U * 

E 

N 

C 

E 

S 100- 



50- 



10- 



5- 



II I I I I I I I I I I I 

SCORE 0| 1|2|3|4|4 5 6 
STDEV 0 1 2 3 4 5 



PARAMETERS 



Similarity natrix Unitary 

Hi snatch penalty 1 

Gap penalty 1.00 

Gap size penalty 0.05 

Cutoff score 0 

Random z at ion group 0 



K-tuple 

Joining penalty 
Uindou size 



2 

20 
5 



i luiua arm iui uiaynuais ut mum nnvi ur ^ire mug uunuiuiuria 

CC It has application as a vaccine. See also P82715. 
SQ Sequence 34 AA; 

SO 3 A? 2 R? l N? 1 D; o B? 2 c? 2 G? 1 E; o z? 3d o H? 

SQ 3 1? 5 Li 2 K? 0M» 0 F? 1 P? 1 S? 2 T; 2 W? i Y? 2 V? 
CC Retrieved by shears on Wed 21 Sep 94 1 1 :57:06-PDT using FindSeq 

Initial Score = 4 Optinized Score = 4 Significance = 3.67 
Residue Identity = 57X Matches = 4 Mismatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 

mi 

RILAVERYLKDQSLLGIWGCSGKLICTTAVPWNA 
10 20 30 X 



15. US-08-249-182-6 (1-9) 

R44497 Sequence of the HIV-1 epitope gp41. 



ID R44497 standard; Protein? 35 AA. 

AC R44497? 

DT 26-MAY-1994 (first entry) 

DE Sequence of the HIV-1 epitope gp41. 

KM Epitope? scFV; single chain antibody fragment? composite antibody? 

KW hunan innunodef iciency virus? HIV-1 ; surface protein? gp41. 

OS HIV-1 . 

PN W09324630-A. 

PD 09-DEC-1993. 

PF 19-HAY-1993? AU0228. 

PR 22-HAY-1992? AU-002551. 

PA (AGEN-) AGEN LTD. 

PI Hillyard CJ, Hudson PJ, Lilley GG? 

DR «PI; 93-405821/50. 

PT Bifunctional recombinant protein - contains particle and analyte 

PT binding noietiesj used in agglutination assays pref. on whole 

PT blood 

PS Example? Figure 7? 42pp? English. 

CC Epitopes of the surface protein gp41 fron HI VI and HIV2 virus types 

CC nay be combined with epitopes fron gpl20 surface protein or p24 

CC core protein or substituted for the N2-FLAG epitope in scFv 

CC constructs or added to the scFv-M2 FLAG construct » thereby 

CC producing various bifunctional reagents capable of binding 

CC erythrocytes and serun antibodies which nay be present in patient's 

CC serun. The sequences of M2-FLAGr HIV1 and HIV2 epitopes are given 

CC in R4449&, R44497 and R44498 respectively. 

SQ Sequence 35 AA? 

SQ 4 A? 2 R? 1 N? 1 D? 0 B? 2 C? 2 9? 0 E? 0 Z? 3 G? 0 H? 

SQ 3 I? 5 L? 2 K? 0 H? 0 F? 1 P? 2 S? 2 T? 2 W? 1 Y? 2 V? 

CC Retrieved by shears on Wed 21 Sep 94 11?59:45-PDT using FindSeq 

Initial Score = 4 Optinized Score = 4 Significance = 3.67 
Residue Identity = 507. Hatches = 4 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X X 
YDVPWNETI 
1 1 II 

RILAVARYLKDQGLLGIWGCSGKLICTTAVPWNAS 
10 20 30 X 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 



FastDB - Fast Pairuise Comparison of Sequences 



35 4 



OS Synthetic. 

PN W09215678-A. 

PD 17-SEP-1992. 

PF 27-FEB-1992; U01475. 

PR 01-HAR-1991; US-&63442. 

PA (STRA-) STRATAGENE. 

PI Sorge JA; 

DR WPI? 92-331724/40. 

PT Prodn. of dicistronic DNA library used to wake antibodies? etc. - 

PT includes forming 1st and 2nd PCR admixtures, subjecting then to 

PT PCR therno-cycles, sepg. double stranded DNA* hybridising* etc. 

PS Disclosure; Page 100; 143pp; English. 

CC This peptide linker sequence is used to increase the distance of an 

CC expressed IgG polypeptide from the surface membrane of E. colii which 

CC results in decreased steric hindrance and competition of the preselected 

CC polypeptide with the lipopolysaccharide coat of E. coli. The IgG is 

CC expressed by a vector produced by a novel form of fusion PCR which 

CC enables fusion of heavy and light chains prior to vector ligation! 

CC avoiding the cumbersome separate cloning of fragments. This linker 

CC sequence moves the Spel site* and retains the native IgGl upper hinge 

CC region* retains original lanB sequence. 

SO Sequence 29 AA; 

SQ 0 A; o R; o N; 4 D ; 0 B; l c; 0 Q; 1 E* 0 1; l G; l h; 

sq 0 I? 0 L? 3 K; 0 M; 3 F; 2 P; 5 S; 3 T; 0 w; 4 Y; l v? 

CC Retrieved by shears on Wed 21 Sep 94 ll:58:26-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.67 
Residue Identity = 447. Matches = 4 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
YDVPWNETI 
llll 

EPKSCDKTHTSYFYDVPDYGSKSSFYFDT 
10 X 20 X 



14. US-08-249-182-6 (1-9) 

P82714 Peptide for detection of antibodies to HTLV-III. 

ID P82714 standard; peptide; 34 AA. 

AC P82714; 

DT 23-N0V-1990 (first entry) 

DE Peptide for detection of antibodies to HTLV-III. 

KU HTLV-III; AIDS; ARC. 

OS synthetic. 

FH Key Location/Qualifiers 

FT Region 1..16 

FT /label=active fragment 

FT Region 17.. 35 

FT /label=active frament 

PN ZA8704759-A. 

PD 05-JAN-1988. 

PF 01-JUL-1987; 004759. 

PR ll-SEP-1985; US-774644. 

PR 04-MAR-1986; US-837566. 

PR 02-APR-1986; US-847102. 

PR 10-FEB-1987; US-013014. 

PA (UNBI-) UTB Biomedical Inc. 

PI Wang CY, Wang JJG.J 

DR WPI; 88-147753/21. 

PT Peptide compsn. - used in detection of antibodies of HTLV-III. 

PS Claim 1; page 44; 48pp; English. 

CC This peptide* or its analogues or active fragments* is used in a 

CC compsn. for the detection of antibodies to HTLV-III in body 



SQ 2 Others; 

CC Retrieved by shears on Wed 21 Sep 94 li:59;28-PDT using FindSeq 



Initial Score = 4 Optimized Score = 4 Significance = 3.67 
Residue Identity * 57X Hatches - 4 Hi snatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 

MM 

XD9QLLGIWGCSGKHICTTNVPWNX 
10 20 X 



12. US-08-249-182-6 (1-9) 

P80566 Peptide region of hunan innunodef iciency virus-l g 

ID P80566 standard; protein; 27 AA. 

AC PS 0566; 

DT 07-N0V-1990 (first entry) 

DE Peptide region of hunan innunodef iciency virus-l gp41Bl. 

KW HIV-l; peptide; HIV-l gp41Bl; antibody; innunogen. 

PN EP-284587-A. 

PD 28-SEP-1988. 

PF 28-NAR-1988? 850105. 

PR 17-HAR-1987; SE-701294. 

PR 18-HAY-1987; US-051726. 

PA (VIR0-) Virovahl SA. 

PI Vahine A, Svennerholn Bf Ryno It Jeansson Si Horal P; 

DR WPI? 88-272997/39. 

PT Synthetic peptide antigens for detection of HIV-l infection - 

PT also useful as innunogens in vaccine conpsns. 

PS Clain 5; page 24; 31pp; English. 

CC The synthetic peptide correspond to regions of innunologically important 

CC proteins of HIV-l. The peptide provides a superior! sensitive and 

CC selective assay for the presence of antibodies to HIV-l. It nay also be 

CC used as an innunogen to elicit the prodn. of anti-HIV-1 antibodies. 

CC X is the H of the NH2 group of the peptidet or an additional anino acid 

CC selected to facilitate coupling of the peptide to a carrier protein. 

CC Y is absent or Cys, and 2 is OH or NH2. 

SQ Sequence 27 AA; 

SQ 2 a; 0 R; 4 N; o D; o B; o C; 1 Q; l E; o z; o G? oh; 

SQ l I; l L; l K; 2 w; 0 F; l P; 3 s; 2 t; 4 u; o y; l v; 

SQ 3 Others 

CC Retrieved by shears on Wed 21 Sep 94 11?57:03-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.67 
Residue Identity = 44X Hatches = 4 Hisnatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 
llll 

XTAVPWNA5WSNKSLE8IWNNMTWMYZ 
X 10 20 



13. US-08-249-182-6 (1-9) 

R27564 Insert D to prevent steric hindrance conpetition 

ID R27564 standard; Protein; 29 AA. 

AC R27564J 

DT 26-FEB-1993 (first entry) 

DE Insert D to prevent steric hindrance & competition with E. coli coat. 

KW Dicistronic expression vector; fusion PCR; antibody; cDNA library; 



Initial Score = 4 
Residue Identity = 442 
Gaps = 0 

X X 
YDVPWNETI 
IMI 

XIWGCSGKLICTTAVPWNASX 
10 X 20 



Optimized Score = 4 
Hatches = 4 

Conservative Substitutions 



Significance = 3.67 
Hisnatches = 5 
0 



11. US-08-249-182-6 (1-9) 

R41063 HIV-1 gp41 peptide (isolate ELI). 

ID R41063 standard; peptide; 25 AA. 
AC R41063; 

DT 22-HAR-1994 (first entry) 

DE HIV-1 gp41 peptide (isolate ELI) . 

KM Hunan icmunodef iciency virus; HIV; hepatitis C virus; HCV; 

KW non-A non-B hepatitis? NANBH; hunan T-cell lynphotropic virus? HTLV; 

KW epitope; antibody; biotin; diagnosis; detection; vaccine. 

OS Synthetic. 

H Key Location/Qualifiers 
Modified.site 1 

/note- "the N-terninal conprises (A)-(B)-(X)-Y; where 
B= biotin; 

X- biotinylation cpd. incorporated 
during synthesis; 
Y= bond or linking gp(s). which 
nininises steric hindrance, 
where Y is not a bond it is pref. 1-10 
residues of (sane or different) glycinei 
beta-alaniner 4-aninobutyric acidr 

5- aninovaleric acid or 6-aninohexanoic acid; 
parenthesis around B and X indicate opt. presence 
at the specified positions but B or X nust be 
present in at least one of the positions shouni 
B interacts with the peptide to give a cpd. 
with greater diagnostic sensitivity; 
A (optional)* one or (tore anino acids* NH2 or 
gp. which nodifies the N-terninus; 
Z= one or nore anino acidsi OH* NH2* or a 
linkage involving either of these 2 gps." 
Modified^site 25 

/note= "the C-terninal conprises Y-(X)-Z B 
W09318054-A. 
16-SEP-1993. 
08-HAR-1993; E00517. 

06- NAR-1992; EP-400598. 
(INN0-) INNDGENETICS NV SA. 
De LEYS R; 
WPl; 93-303397/38. 

New biotinylated peptide(s) corresp. to innuno-doninant 
epitope(s) - with increased antigenicity^ useful in antibodies 
detection and vaccines against hepatitis C* HIV and H7LV 
Clain 4; Page 90-98; 133pp; English. 

Peptide conpsns, comprise at least one and pref. a conbination of 
two. threei four or nore biotinylated peptides chosen fron the 
sequences given in R41058-R41 166. The peptides represent 
CC immunologically important regions of viral proteins and are 
CC prepd. by solid phase peptide synthesis. The conpsns. are 
CC useful for the detection of antibodies to HCV, and/or HIV, 
CC and/or HTLV-I or II. 
SO Sequence 25 AA; 

SQ 0 A; 0 R; 2 N; l D; 0 B; 2 C; 2 G; 0 E; 0 z; 3 G; l H; 



udp» - v ^uri^er vduive cuuni uuLiuna - u 

X X 
YDVPUNETI 

llll 

I WGCSGKL I CTT AVPWNAS 
10 X X 



10. US-08-249-182-6 (1-9) 

R41059 HIV-1 gp41 peptide (isolate HTLV-IIIB). 

ID R41059 standard; peptide; 21 AA. 

AC R41059; 

DT 22-MAR-1994 {first entry) 

DE HIV-1 gp41 peptide (isolate HTLV-IIIB). 

KW Hunan innunodef iciency virus; HIV; hepatitis C virus; HCV? 

KU non-A non-B hepatitis! NANBH; hunan T-cell lynphotropic virus; HTLV; 

KW epitope; antibody; biotin; diagnosis; detection; vaccine. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Modif ied_site 1 

FT /note= "the N-terninal conprises (A)-(B)-(X)-Y; where 

FT B= biotin; 

FT X= biotinylation cpd. incorporated 

FT during synthesis; 

FT Y= bond or linking gp(s). which 

FT nininises steric hindrance* 

FT where Y is not a bond it is pref. 1-10 

FT residues of (sane or different) glycine* 

FT beta-alaninei 4-aninobutyric acid* 

FT 5-aninovaleric acid or 6-aninohexanoic acid; 

FT parenthesis around B and X indicate opt. presence 

FT at the specified positions but B or X nust be 

FT present in at least one of the positions shown* 

FT B interacts with the peptide to give a cpd. 

FT with greater diagnostic sensitivity; 

FT A (optional)= one or nore anino acids* NH2 or 

FT gp. which modifies the N-terninus; 

FT Z= one or nore anino acids* OH* NH2* or a 

FT linkage involving either of these 2 gps. u 

FT Hodified.site 21 

FT /note= "the C-terninal conprises Y-(X)-Z° 

PN H09318054-A, 

PD 16-SEP-1993. 

PF 08-MAR-1993; E00517. 

PR 0A-MAR-1992; EP-400598. 

PA (INN0-) INNOGENETICS NV SA. 

PI De LEYS R* 

DR WPI; 93-303397/38. 

PT New biotinylated peptide(s) corresp. to innuno-doninant 

PT epitope(s) - with increased antigenicity* useful in antibodies 

PT detection and vaccines against hepatitis C* HIV and HTLV 

PS Clain 4; Page 90-98; 133pp* English. 

CC Peptide conpsns. comprise at least one and pref. a conbination of 

CC two* three* four or nore biotinylated peptides chosen fron the 

CC sequences given in R41058-R41166. The peptides represent 

CC innunologically inportant regions of viral proteins and are 

CC prepd. by solid phase peptide synthesis. The conpsns. are 

CC useful for the detection of antibodies to HCV, and/or HIV* 

CC and/or HTLV-I or II. 

SQ Sequence 21 AA; 

SQ 2 A; 0 R; l N; 0 D; 0 B; 2 C; 0 8; 0 E; 0 z; 2 G; 0 H; 

SQ 2 I; l l; l k; o «; o f; l p; 2 s? 2 t; 2 h; o y; 1 v? 

SQ 2 Others? 

CC Retrieved by shears on Wed 21 Sep 94 ll:59:28-PDT using FindSeq 



DR WPI; 91-224505/31. 

PT Peptide compsns. corresp. to envelope fragments of HTLV-1 ,2 - for 

PT detecting antibodies to these viruses and diagnosing HIV and 

PT adult T-cell leukaemia infections 

PS Claim 19; Page 20; 27pp; English. 

CC This peptide is one of 16 peptides useful for detecting antibodies to 

CC HTLV or HIV viruses. The peptides correspond to partial sequences of 

CC the HTLV virus designated gp21 and gp64r both part of gp61* which 

CC defines the envelope protein of the HTLV-I or HTLV-I I virus or 

CC their analogues. The peptides can be amidated at the C-terminal. 

CC This particular peptide is used for detecting antibodies to HIV-1 and 

CC is included in a composition with at least one other peptide of the 

CC invention which can detect HTLV-I and/or -II. 

CC See R13184-R13193 and R1386I-6. 

SO Sequence 19 AA; 

SQ 2 A; 0 R? 1 N; 0 D; 0 8; 2 c; 0 9; 0 E; 0 Z; 2 G; 0 H; 

S9 2 I;l L; l K; 0 H; 0 F; l P; 2 s? 2 T; 2 W; 0 Y; 1 V; 

CC Retrieved by shears on Wed 21 Sep 94 ll:57:31-PDT using FindSeq 

Initial Score = 4 Optimized Score = 4 Significance = 3.67 
Residue Identity = SOX Matches = 4 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 
II 1 1 

IWGCSGKLICTTAVPUNAS 
10 X X 



9. US-08-249-182-6 (1-9) 

R06410 HTLV-1 corresponding peptide (VIII) . 

ID R06410 standard; protein; 19 AA. 

AC R06410; 

DT 21-DEC-1990 (first entry) 

DE HTLV-1 corresponding peptide (VIII). 

KM HTLV-1; HIV; antibodies; vaccines; polymers; 

OS Synthetic. 

PN W09008162-A. 

PD 26-JUL-1990. 

PF 16-JAN-1990; U00260. 

PR 13-JAN-1989; US-297635. 

PA (UNBI-) UNITED BIOMED INC. 

PI Yang CY; 

DR WPI; 90-254015/33. 

PT Synthetic peptide(s) corresponding to HTLV-1 and op. HIV - used 

PT for detection of antibodies? in vaccines and for development of 

PT antibodies 

PS Claim 1 (VIII)? Page 42; 52pp; English. 

CC Peptides having specific imnunoreactivity to antibodies to HTLV-1 

CC and HIV comprise at least one peptide selected from those 

CC represented in R06403-08; and this sequence on its oun ( or an analogue 

CC of it in which amino acids may be added, deleted or substd, or segmentsi 

CC mixts.i conjugates or polymers of the peptides representes in R06409-12. 

CC The peptides are safer sensitive and specific in the detection of 

CC antibodies. This peptide corresponds to sequences of HIV-1 or 

CC HIV-2. 

SO Sequence 19 AA; 

SQ 2 A; 0 R? l N; 0 D; 0 B; 2 C; 0 a; 0 E; 0 Z; 2 Gi 0 H; 

so 2 I; 1 L; 1 k; o m; o F; l P? 2 s; 2 t; 2 w; o y; l v; 

CC Retrieved by shears on Wed 21 Sep 94 ll;57:07-PDT using FindSeq 



Initial Score = 4 Optimized Score = 4 Significance = 3.67 
Residue Identity = SOX Hatches = 4 Mismatches = 4 
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Query sequence being cofipared:US-08-249-182-10 (1-12) 
Number of sequences searched: 42145 
Number of scores above cutoff: 4700 

Results of the initial comparison of US-08-249-182-10 (1-12) with: 
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Nunber of residues: 5287517 
Nunber of sequences searched: 42145 
Nunber of scores above cutoff: 4700 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence uas found: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



1. R37452 Autotaxin peptide ATX 102. 12 12 12 8.70 0 
The list of other best scores is: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



ft*** 5 standard deviations above nean **** 



C. 




Neuron protein encoded by pCD 


223 


8 


8 


5.54 


0 






**** 3 standard deviations above nean 


*§*♦ 








3. 


R0B086 


Feline T-cell lynphotrophic 1 


57 




6 


3.95 


0 


4. 


P82320 


PAP-III isolated fron biologi 


145 




6 


3.95 


0 


5. 


R22403 


Partial sequence of N-lipocor 


304 




6 


3.95 


0 


6. 


R10689 


Cephalosporin antibiotic bios 


319 




6 


3.95 


0 


7. 


P91362 


Hunan lipocortin-III. 


323 




6 


3.95 


0 


8. 


P61523 


Sequence of hunan lipocortin. 


346 


6 


6 


3.95 


0 


9. 


R06560 


Hunan lipocortin obtained fro 


346 




6 


3.95 


0 


10. 


P82063 


Hunan lipocortin 


346 




6 


3.95 


0 


11. 


P82062 


Recombinant rat lipocortin 


346 




6 


3.95 


0 


12. 


P82318 


Lipocortin I isolated fron bi 


346 


t> 


6 


3.95 


0 


13. 


R22402 


Hunan lipocortin. 


363 




6 


3.95 


0 


14. 


P60657 


Sequence of hunan lipocortin. 


363 




6 


3.95 


0 


15. 


R41017 


Insecticidal protein gene 0RF 


370 




6 


3.95 


0 


16. 


R14526 


41.9 kD toxin. 


370 




6 


3.95 


0 


17. 


P90400 


Modified hunan lipocortin 


387 




6 


3.95 


0 


18. 


R03725 


Hunan placenta-derived coagul 


672 




6 


3.95 


0 


19. 


R03726 


Hunan placenta-derived coagul 


786 




6 


3.95 


0 


20. 


R37309 


Cardiac adenylyl cyclase. 


1165 




6 


3.95 


0 


21. 


R32882 


Cardiac adenylyl cyclase type 


1184 




6 


3.95 


0 


22. 


P70647 


Sequence of N-terninal apolip 


2721 




6 


3.95 


0 


23. 


R23963 


AFP-1 (Ala 2460 Val). 


2783 




6 


3.95 


0 


24. 


R23962 


AFP-1. 


2783 




6 


3.95 


0 


25. 


R03180 


Receptor fragnent No376-390 c 


15 


5 


6 


3.16 


0 


26. 


R 10873 


Manual i an atrial natriuretic 


17 


5 


5 


3.16 


0 


27. 


R36481 


DFII-4. 5(36-60), a Dernatopha 


25 


5 


5 


3.16 


0 


28. 


R36482 


DFII-4. 3(45-70) > a Dernatopha 


26 


5 


5 


3.16 


0 


29. 


R36483 


DFII-15(51-77) , a Dernatophag 


27 


5 


5 


3.16 


0 


30. 


R36490 


DFlI-22(36-63) i a Dernatophag 


28 


5 


5 


3.16 


0 


31. 


R36489 


DFI 1-21 (33-60) y a Dernatophag 


28 


5 


5 


3.16 


0 


32. 


P81534 


Hunan insulin acceptor protei 


29 


5 


6 


3.16 


0 


33. 


P81533 


Hunan insulin acceptor protei 


37 


5 


6 


3.16 


0 


34. 


R32204 


Apple fruit PP0 pSR8. 


58 


5 


5 


3.16 


0 


35. 


R37918 


Cyn dl derived fron clone 3 ( 


138 


5 


5 


3.16 


0 


36. 


R13904 


Derf II allergen encoded by p 


142 


5 


5 


3.16 


0 


37. 


R13903 


Derf II allergen encoded by p 


142 


5 


5 


3.16 


0 


38. 


R45140 


Murine cytokine synthesis inh 


157 


5 


5 


3.16 


0 


39. 


R10159 


pcDtSR alpha)-F115 nouse cyto 


178 


5 


5 


3.16 


0 
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1. US-08-249-182-10 (1-12) 

R37452 Autotaxin peptide ATX 102. 

ID R37452 standard; peptide; 12 AA. 

AC R37452; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 102. 

KW Cell motility stimulating; cancer metastasis; antibody; detection; 

KW imnunostains; disease outcome prediction; therapy choice? 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF I7-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch Hi Liotta LA, Schiffmann E* Stracke H. 

DR WPI? 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 102. It may be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in imnunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SO Sequence 12 AA; 

SO 0 A; 1 R; 0 N; 2 D; 0 B; 0 C; 0 Q; l E; o Z; 0 G; l H; 

so l I; 2 L; 0 K; o N; 2 F; o P; l S; l T; o w; o V? o v? 

Initial Score = 12 Optimized Score = 12 Significance = 8.70 
Residue Identity = 1007. Hatches = 12 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
NMIIIII1II 
DIEHLTSLDFFR 
X 10 X 



2. US-03-249-182-10 (1-12) 

P93825 Neuron protein encoded by pCDR31. 

ID P93825 standard; protein; 223 AA. 

AC P93825; 

DT 12-JUN-1990 (first entry) 

DE Neuron protein encoded by pCDR31. 

KW Neuron polypeptide; pCDR31? paraneoplastic cerebellar degeneration. 

FH Key Location/Qualifiers 

FT Region 3.. 206 

FT /note= D region corresp. to tandem rpt region in N931B8 0 

PN EP-297585-A^ 

>5 0 4- J AN- 19897 

PF 30-JUN-1986; 110488. 

PR 01-JUL-1987; US-0&8917. 

PA (SL0K) Sloan-Kettering Inst. 

PI Dropcho EJr Chen YT, Posner JB, Old LJ; 

DR WPI; 89-008858/02. 

DR N-PSDB; N931S8. 



PT antibodies - used to prevent para-neoplastic cerebellar degeneration. 

PS Clain 2 iFig 3; 17pp; English. 

CC The neurone peptide is produced in hosts transformed with pCDR31. 

CC The peptide and MAbs raised to it are used to prevent paraneoplastic 

CC cerebellar degeneration. The N-terninal Het nay be absent (Claim 1). 

SQ Sequence 223 AA; 

SQ 6 a; 6 R; 2 n; 54 D; 0 B; l C? 2 Q; 34 E; 0 Z ; 0 G; o H? 

SQ 6 I? 43 L; 5 Kr 11 MJ 15 F? 8 PJ 3 S; 6 T; 9 U; 2 Y? 10 V? 

Initial Score = 8 Optimized Score = 8 Significance = 5.54 
Residue Identity = 66X Hatches = 8 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
I! I lllll 

LEDVDF9EDPNYPEDLDCWEDVDFLEDURLLEDMDFLEDMDFLEDVDLSEDIYHLEDLDFFRKHHIDHKTWI 
150 160 170 180 190 200 210 

WWKT 
220 



3. US-08-249-182-10 (1-12) 

R08086 Feline T-cell lynphotrophic lentivirus of clone R5 



ID R0S086 standard; protein; 57 AA. 

AC R08086; 

DT 26-FEB-1991 (first entry) 

DE Feline T-cell lynphotrophic lentivirus of clone R5XCXL1. 

KM Feline T-cell lymphotropic lentivirus? FIV; R5XCXU; antibodies? 

KW vaccines. 

OS Feline T-cell lynphotropic lentivirus 2428 (Pentaluma). 

PN W09013573-A. 

PD 15-N0V-1990. 

PF 30-APR-1990; U02338. 

PR 08-HAY-1989; US-348784. 

PR 08-DEC-1989? US-447810. 

PA (IDEX-) IDEXX CORP. 

PI Anderson PRi Oconnor TP* Tonelli QJ; 

DR UPi; 90-361429/48. 

DR N-PSDB? 906655. 

PT Feline T-cell lynpho-tropic lentivirus poly-peptide(s) - used for 

PT specific detection of FIV antibodies, prodn. of antibodies and in 

PT vaccines 

PS Disclosure; Fig 5(c); 37ppf English. 

CC The amino acid sequence shows homology uith the envelope gene of 

CC equine infectious anemia virus, a lentivirus* immunologically 

CC closely related to FIV. 

CC Strain R5X has been deposited ATCC 67939. 

CC See also 806653-55 and R08094-96. 

59 Sequence 57 AA; 

SQ l A; 3 R; 5 n; 2 D; 0 B; 2 C; 2 8; 2 E; 0 Z; l G; 1 H; 

SQ 2 I; 6 L; 0 K; o «; 7 F; 3 P; 6 s; 3 T; 2 w? 0 Y; 7 v; 

SQ 2 Others; 



Initial Score = 6 Optimized Score = 6 Significance = 3.95 
Residue Identity = 5QX Matches = 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 

III! II 

ATNSQSCENVFSSXLPNGHNTVyEQVVFPLSIFWDLFXRVRNCVLTSLFPFRDLVIF 
10 20 30 40 50 X 



4. US-08-249-182-10 (1-12) 

P82320 PAP-III isolated from biological fluid, used as an 

ID P82320 standard; protein; 145 AA. 

AC P82320; 

DT 13-NQV-1990 (first entry) 

DE PAP-III isolated fron biological fluid* used as anticoagulant. 

KW PAP-III; anticoagulant; anti -inflammatory agent; phospholipid! 

KW phospholipase A2; disseminated intravacular coagulation? 

KM deep vein thronbosis. 

OS Homo sapiens. 

PN W08805659-A. 

PD ll-AUG-1988. 

PF 05-FEB-1988; U00340. 

PR 06-FEB-19B7; US-011782. 

PR 05-JUN-1987; US-059355. 

PA (ZYM0-) Zynogenetics Inc; (UNIH) Univ of Washington. 

PI Fujikawa Kr Irani MM? Carter BLA? 

DR WPI; 88-235049/33. 

PT Hunan proteins having anticoagulant and antiinf lannatory activity - 

PT isolated from biological fluid by anion-exchange chronatographoc media. 

PS Disclosure; p; English. 

CC The biological fluid is a highly vacularised human tissue? e.g. 

CC placenta. The product binds to phospholipid and inhibits phopholipase A2 

CC The protein can substitute heparin or other anticoagulants in 

CC the treatment of disseminated intravascular coagulation? deep vein 

CC thronbosis? or other disorders. It also has antiinflammatory 

CC properties. 

SQ Sequence 145 AA; 

SQ 10 A; 6 R; 2 N; 14 D; OB? 1 C? 4 Qt 9 E; 0 Z? 12 G; 1 HJ 

SQ 121; 18 L? 13 k; 4 h; 2 f; op; 12 s? 9 t; o w; 9 y; 6 v; 
SQ 1 Others; 

Initial Score = 6 Optimized Score ~ 6 Significance = 3.95 

Residue Identity = SOX Matches = 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II II II 

DDLKGDLSGHMVALVMKGAGTNEDALIEILTTHKDIX8AYYTVYKKSLGDDISGETSGDFRKALLVLAVSRS 
40 50 60 70 80 90 X 100 

E I DLLD I RTEFKKR YG YSL YSA I KSDTSGDYE I TLLK I CG 
110 120 130 140 



5. US-08-249-182-10 (1-12) 

R22403 Partial sequence of N-lipocortin encoded by lanbda 

ID R22403 standard; Protein; 304 AA. 
AC R22403; 

DT 13-MAY-1992 (first entry) 

DE Partial sequence of N-lipocortin encoded by lanbda-NLip6-lX. 

KM Antiinf lannatory; arthritis; allergy; dermatology; placenta. 

KW phospholipase A2 inhibitor. 

OS Hono sapiens. 

PN US5081019-A. 

PD 14-JAN-1992. 

PF 02-MAY-1990; 519256. 

PR 10-JAN-1985; US-690146. 

PR 15-MAR-1985; US-712376. 

PR 14-AUG-1985; US-765877. 

PR 05-SEP-1985; US-772892. 



PR 22-FEB-1989; US-314316. 

PR 02-MAY-1990; US-519256. 

PA (BIOS ) BIOGEN INC. 

PI Wallner BP* Pepinsky RB; 

DR WPI; 92-048295/06. 

DR N-PSDB; 823218. 

PT Deoxyribonucleic acid encoding lipo coriicoid polypeptide(s) - 

PT for producing antiinflammatory agents for treating arthritic* 

PT* allergici dermatologic ophthalmic and collagen diseases 

PS Disclosure; Fig 31 J 56pp; English. 

CC The sequence uas deduced from DNA obtd. from a plasmid isolated from 

CC a cDNA library prepd. fron polyA+ mRNA fron human macrophage cell 

CC line U937 (see Q23218 for details). Based on sequence homology 

CC with lipocortin it is estimated that ca. 200 bp are Hissing fron the 

CC 5' end of the DNA sequence. Recombinant N-lipocortin prepd. using 

CC the DNA sequence has antiinflammatory activity and can be used for 

CC the treatment of arthritic* allergic* dernatologic* ophthalmic and 

CC collagen diseases. 

CC See also R22402. 

SQ Sequence 304 AA; 

S8 19 A? 21 R; 9 Ni 28 D; 0 B; 3 C? 13 8; 20 E? 0 2; 17 GJ 2 Hi 

S8 22 I; 31 L? 30 K; 8 M; 6 F; 5 P; 21 S; 15 T? 1 W; 17 Y; 16 V? 

Initial Score = 6 Optimized Score = 6 Significance = 3.95 
Residue Identity = 50X Matches = 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions ' 0 

X 10 X 

DIEHLTSLDFFR 
II II II 

8YDASELKASMKGLGTDEDSLIEIICSRTN8ELQEINRVYKEMYKTDLEKDIISDTSGDFRKLMVALAKGRR 
80 90 100 110 120 X 130 X 140 

AEDGSV I DYEL I DQDARDLYDAGVKRKGTDVPKW ISIMTE 
150 160 170 180 



6. US-08-249-182-10 (1-12) 

R10689 Cephalosporin antibiotic biosynthetic enzyme #3. 

ID R10689 standard; Protein; 319 AA. 

AC R10689; 

DT 27-MAR-1991 (first entry) 

DE Cephalosporin antibiotic biosynthetic enzyme #3. 

KW cephalosporin; antibiotic; 

KW S-tL-alpha-aminoadipyD-L-cysteinyl-D-; valine synthetase; 

KW isopenicillin N synthetase; isopenicillin N epimerase; 

KW deacetoxycephalosporin C synthetase; beta-lactamase; 

KW deacetoxycephalosporin C hydroxylase. 

OS Lysobacter lactamgenus. 

PN J02291274-A. 

PD 03-DEC-1990. 

PF 10-JAN-1990; 003762. 

PR 01-FEB-1989; JP-024710. 

PR 10-JAN-1990; JP-003762. 

PA (TAKE ) TAKEDA CHEMICAL IND KK. 

DR WPI; 91-018854/03. 

DR N-PSDB; 810190. 

PT Prepn. of cephalosporin series antibiotics - comprises culturing 

PT transformant of microbe transformed by plasmid contg. new DNA 

PT fragment 

PS Disclosure? Fig 15; 67pp; Japanese. 

CC This protein is encoded by 0RF3 of the 23666bp sequence 

CC isolated from L. lactamgenus and comprising the genes for the 

CC cephalosporin biosynthetic enzymes listed in the KEYWORDS. Plasmids 



CC nicrobes. such as bacteria or yeast. 

CC See also Q10191-2. ' 

S9 Sequence 319 A A; 

SG 27 A; 28 R; 9 N>' 27 DJ 0 B» 5 C; 10 S; 19 E; 0 Z; 22 G; 4 Hi 

so 12 i; 23 i; 7 k; 7 tt; 22 F; 19 P; 26 s; 16 T; 2 w; 9 Y; 25 v; 



Initial Score = 6 Optinized Score = 6 Significance = 3.95 
Residue Identity - 50X Hatches - 6 Mismatches = 6 

Gaps - 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II II II 

MTDSGiaiFDLDELEHGVRLDSFRKSLFERGVFYVREDDSIKTEHAKAHDAVHDLFENGSAESKNALRNLTP 
10 X 20 X 30 40 50 60 70 



Nv 



7. US-08-249-182-10 (1-12) 

P91362 Hunan lipocortin-III. 



ID P91362 standard; protein; 323 AA. 

AC P91362; 

DT 10-NAR-1993 (revised) 

DT 22-DEC-1989 (first entry) 

DE Hunan lipocortin-III. 

KU Hunan lipocortin-III; lanbdaHLipo III -5; anti-inf lannatory agent. 

OS Hono sapiens. 

PN EP-330396-A. 

PD 30-AUG-1989. 

PF 20-FEB-1989; 301603. 

PR 26-FEB-1988; US-160866. 

PA (8I0J) Biogen. Inc. 

PI Hallner BP. Pepinsky RBi Browning JL. 

DR HPIf 89-250486/35. 

PT Hunan lipocortin cpds. III. IV. V. and VI - used in treatnent of 

PT arthritic, allergic, dernatologic. ophthalnic and collagen disorders 

PT involving inf lannatory processes. 

PS Clain 15; fig 3; 32pp>' English. 

CC Hunan lipocortin-III uas isolated fron a lanbda gtll hunan lung cONA 

CC library uith rat lipocortin-III cDNA of lanbda RLipo 1 1 1-5 as probe. 

CC Lipocortins are anti-inf lannatory agents and can be used to treat 

CC arthritic, allergic, dernatologic. ophthalnic. and collagen diseases. 

CC See also N90598, N90599, and P91363. 

SQ Sequence 323 AA; 

SO 23 a; 19 R; 7 N; 30 D; OB; 3 C; 11 a; 22 e; 0 z; 22 G; 7 h; 

SQ 24 i; 36 L; 26 K; 6 M; 11 F; 6 P? 23 S; 21 T; 2 u; 12 yj 12 v; 



Initial Score = 6 Optinized Score = 6 Significance = 3.95 
Residue Identity = 50X Matches = 6 Mi snatches = 6 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
II II II 

VFDAK8LKKSMKGAGTNEDALIEILTTRTSRQHKDIS9AYYTVYKKSLGDDITSETSGDFRKALLTLADGRR 
100 110 120 130 140 X 150 X 160 



DESLKLDEHLAK9DA9ILYKAGENRUGTDEDKFTEILCLR 
170 180 190 200 



8. US-08-249-182-10 (1-12) 



ID P61523 standard; Protein; 346 AA. 

AC P61523; 

DT 08-AUG-1991 (first entry) 

DE Sequence of hunan lipocortin. 

KU Anti-inf lannatory agent; steroid mediation; arthritis therapy; 

KU allergy. 

OS Mono sapiens. 

PN U08604094-A. 

PD 17-JUL-1986. 

PF 10-JAN-1986; U00027. 

PR 10-JAN-1985; US-690146. 

PR 15-HAR-1985! US-712376. 

PR 14-AUG-19855 US-765877. 

PR 05-SEP-1985! US-772892. 

PR 10-JAN-1986; US-929199. 

PA (BIOJ ) BIOGEN NV. 

PI Hallner BP, Pepinsky BR» Garuin JL, Schindler DG, Huang KS; 

DR UPI; 86-196888/30. 

OR N-PSDB; N60556. 

PT Neu hunan lipocortin-like polypeptide (s) - are obtd. by 

PT reconbinant DNA techniques and are antiinflammatory agents 

PT without usual side effects 

PS Claim 18; Page 79; 113pp; English. 

CC A hunan cDNA library of Escherichia coli cells contg. hunan 

CC Macrophage cDNA sequences inserted into a phage cloning vector uas 

CC screened using antisense oligonucleotide DNA probes corresp. to 

CC those regions of rat phospholipase A2 inhibitor protein having 

CC nininal nucleotide degeneracy (N60561-N60564) to obtain a sequence 

CC coding for hunan lipocortin (LO-like polypeptide. 

S8 Sequence 346 AA; 

so 32 A; 19 R; 13 N; 26 D; o B; 4 c; 12 a; 27 e; 0 Z; 20 G; 5 h; 

S8 21 I; 34 L; 32 K; 9 M; 11 F; 8 P; 19 S; 22 T; 1 u; 11 V; 20 V; 



Initial Score = 6 Optinized Score = 6 Significance = 3.95 
Residue Identity = 507. Hatches = 6 Mi snatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II II II 

EFDADELRAAMKGLGTDEDTLIEILASRTNKEIRDINRVYREELKRDLAKDITSDTSGDFRNALLSLGKGDR 
120 130 140 150 160 170 180 



SEDFGVNEDLADSDARALYEAGERRKGTDVNVFNTILTTR 
190 200 210 220 



9. US-08-249-182-10 (1-12) 

R06560 Hunan lipocortin obtained fron lanbda LC. 

ID R06560 standard; protein; 346 AA. 
AC R06560; 

DT 14-JAN-1991 (first entry) 

DE Hunan lipocortin obtained fron lanbda LC. 

KU Hunan lipocortin; lanbda LC; inflannation reduction; arthritis; 

KU phospholipase A2. 

OS Hono sapiens. 

PN US4950646-A. 

PD 21-AUG-1990. 

PF 10-JAN-1986! 929199. 

PR 10-JAN-1985J US-690146. 

PR 15-HAR-1985! US-712376. 

PR 14-AUG-1985; US-765877. 

PR 05-SEP-1985? US-772892. 



rrv iv'tfnn- i7qq$ uq-ici ill * 

PA (BIOJ ) BIOGEN NV. 

PI Wallner BPt Pepinsky RB» Garuin JL. Schindler DG, Huang KS; 

DR HPI; 90-274549/36. 

DR N-P5DB; 905809. 

PT Pure fragnent of hunan lipocortin - useful for reducing 

PT inflannation or for treating arthritis r etc. 

PS Disclosure; Fig 4; 51pp; English. 

CC cDNA can be operatively linked to expression control sequences and used 

CC in various nannalian or other eukaryotic or prokaryotic host cells to 

CC produce the hunan lipocortinlike polypeptide. The peptide uas 

CC shoun to inhibit exogenous phospholipase A2 in in vitro assays. 

CC The 37 kD protein can be used for reducing inf lannation or treating 

CC arthritic* allergic, dermatologicr opthalnic and collagen diseases 

CC and other diseases involving inf lannation processes. 

CC See also 805805-25 and R07926-37. 

58 Sequence 346 AAJ 

S8 34 A; 19 R; 13 N; 25 D; 0 B; 4 c; 13 8; 26 E; o Z; 20 G; 6 H; 

S8 21 It 34 L; 32 K; 9 N? 11 F; 8 P? 18 S; 22 T? 1 W? 11 Yf 19 V; 

Initial Score = 6 Optimized Score = 6 Significance = 3.95 
Residue Identity = 50X Hatches = 6 Risnatches = 6 

Gaps - 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
11 II II 

8FHADELRAAHKGLGTDEDTLIEILASRTNKEIRDINRVYREELKRDLAKDITSDTSGDFRNALLSLAKGDR 
120 130 140 150 160 170 180 

SEDFGVNEDLADSDARALYEAGERRKGTDVNVFNTILTTR 
190 200 210 220 



10. US-08-249-182-10 (1-12) 

P82063 Hunan lipocortin 

ID P82063 standard? protein; 346 AA. 

AC P82063; 

DT 22-OCT-1990 (first entry) 

DE Hunan lipocortin 

KU reconbinant rat lipocortin; rat aboninal dropsy; hunan lipocortin ; ss. 

OS Hono sapiens. 

PN J63276497-A. 

PD 14-N0V-1988. 

PF 08-MAY-1987; JP-1 12145. 

PR 08-MAY-1987; J6-JP-1 12145. 

PA (YAMA) Yananouchi Pharn KK. 

DR WPI; 88-365616/51. 

PT Reconbinant rat lipocortin - 

PT obtd using gene derived fron cells in rat abdoninal dropsy 

PS Disclosure; J Japanese. 

CC Hunan lipocortin has strong homology to rat lipocortin* differing 

CC at only 38 positions out of 346. (See P82063 for rat sequence). 

CC To obtain the rat sequence nRNA uas isolated fron rat abdoninal 

CC dropsy cells and cDNA synthesised fron it. Probes were synthesised 

CC according to the partial anino acid sequence of rat lipocortin 

CC (see N82038 and N82039). These uere used to isolate plasnids contg 

CC the desired sequence of lipocortin DNA. 

S8 Sequence 346 AA; 

SQ 34 A; 19 R; li n; 26 D; 0B; 5 c; 14 8; 25 E; 0 z; 20 G; 5 H; 

S8 21 I; 34 l; 33 K; 9 fi; 10 f; a P; 19 S; 21 T; o u; 13 Y? 19 v; 

Initial Score = 6 Optinized Score = 6 Significance = 3.95 
Residue Identity = SOX Hatches = 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
II II II 

QFDADELRAANKGLGTDED7LIEILASRTNKEIRDINRVYREELKRDLAKDITSDTSGDFRNALLSLAKGDR 
120 130 140 150 160 170 180 

SEDFGVNEDLADSDARALYEAGERRKGTDVNVFNTILTTR 
190 200 210 220 



11. US-08-249-182-10 (1-12) 

P82062 Reconbinant rat lipocortin 



ID P82062 standard; protein; 346 AA. 

AC P8 20 62; 

DT 22-0CT-1990 (first entry) 

DE Reconbinant rat lipocortin 

KM reconbinant rat lipocortin; rat aboninal dropsy; ss. 

OS Rattus. 

PN J63276497-A. 

PD 14-N0V-1988. 

PF 08-MAY-1987; JP-112145. 

PR 08-HAY-1987; J6-JP-112145. 

PA (YAH A) Yananouchi Pharn KK. 

DR UPl; 88-365616/51. 

DR N-PSDB; N82025. 

PT Reconbinant rat lipocortin - 

PT obtd using gene derived fron cells in rat abdoninal dropsy 

PS Disclosure; >' Japanese. 

CC nRNA uas isolated fron rat abdoninal dropsy cells and cDNA 

CC synthesised fron it. Probes were synthesised according to the 

CC partial anino acid sequence of rat lipocortin (see N82038 and 

CC N82039). These uere used to isolate plasnids contg the desired 

CC sequence of lipocortin DNA. 

CC See also P82063. 

SQ Sequence 346 AA; 

SO 33 A; 19 R; ll N; 24 d; OB; 6 c; 16 6; 26 E; o Z; 19 G; 5 H; 

ss 21 I; 33 l; 31 k; io fi; 9 F; 9 P; 16 s; 26 T; 0 u; 14 y; 18 V; 



Initial Score = 6 Qptinized Score = 6 Significance = 3.95 
Residue Identity - 50X Hatches = 6 Hi snatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 

II II II 

QFDADELRAAHKGLGTDEDTLIEILTTRSNQ9IREITRVYREELKRDLAKDITSDTSGDFRNALLALAKGDR 
120 130 140 150 160 170 180 



CEDHSVNODLADTDARALYEAGERRKGTDVNVFNTILTTR 
190 200 210 220 



12. US-08-249-182-10 (1-12) 

P82318 Lipocortin I isolated fron biological fluid, used 

ID P82318 standard; protein; 346 AA. 

AC P82318; 

DT 13-N0V-1990 (first entry) 

DE Lipocortin I isolated fron biological fluid* used as anticoagulant. 

KU Lipocortin I; anticoagulant; anti-inf lannatory agent; phospholipid; 

KM phospholipase A2; disseninated intravacular coagulation; 

KU deep vein thronbosis. 

OS Hono sapiens. 

PN U08805659-A. 



ru x i "*nwb~ i too * 

PF 05-FEB-1988; U00340. 

PR 06-FEB-1987; US-011782. 

PR 05-JUN-1987; US-Q59355. 

PA (ZYM0-) ZyRogenetics Inc; (UNIW) Univ of Washington. 

PI Fujikawa K* Irani HH» Carter BLA; 

DR WPI? 88-235049/33. 

PT Human proteins having anticoagulant and antiinf laRRatory activity - 

PT isolated fron biological fluid by anion-exchange chrowatographoc Redia. 

PS Disclosure? p; English. 

CC The biological fluid is a highly vacularised hunan tissue* e.g. 

CC placenta. The product binds to phospholipid and inhibits phopholipase A2. 

CC The protein can substitute heparin or other anticoagulants in 

CC the treatment of disseninated intravascular coagulation* deep vein 

CC throRbosis* or other disorders. It also has antiinf laRRatory 

CC properties. 

SQ Sequence 346 AA; 

SQ 33 A; 19 R; 14 n; 26 D; 0 B; 4 c; 13 fi; 26 E; 0 ZJ 20 G; 5 h; 

SO 211? 34 u 32 K; 9 «; 11 F; 8 P? 19 S? 22 T; 1 u; 11 Y! 18 VJ 



Initial Score = 6 Optimized Score = 6 Significance = 3.95 
Residue Identity = SOX Hatches = 6 Mi snatches = 6 

Gaps = 0 Conservative Substitutions ~ 0 



X 10 X 

DIEHLTSLDFFR 
II II II 

QFDADELRAAMKGLGTDEDTLIEILASRTNKEIRDINRVYREELKRDLAKDITSDTSGDFRNALLSLAKGDR 
120 130 140 150 160 170 180 



SEDFGVNEDLADSDARALYEAGERRKGTDVNVFNTILTTR 
190 200 210 220 



13. US-08-249-182-10 (1-12) 

R22402 Hunan lipocortin. 

ID R22402 standard; Protein; 363 AA. 

AC R22402; 

DT 13-MAY-1992 (first entry) 

DE Hunan lipocortin. 

KW Antiinf laRRatory; arthritis; allergy; dernatology; recombinant. 

KW Phospolipase A2 inhibitor. 

OS Horo sapiens. 

FH Key Location/Qualifiers 

FT Peptide 1.-19 

FT /labels signal_sequence 

FT Protein 20.. 363 

FT /label= lipocortin 

PN US5081019-A. 

PD 14-JAN-1992. 

PF 02-HAY-1990J 519256. 

PR 10-JAN-1985; US-690146. 

PR 15-MAR-1985; US-712376. 

PR 14-AUG-1985; US-765877. 

PR 05-SEP-1985J US-772892. 

PR 06-NAR-1986; US-837019. 

PR 22-FEB-1989; US-314316. 

PR 02-HAY-1990; US-519256. 

PA (BIOS ) BI0GEN INC. 

PI Wallner BP* Pepinsky RB? 

DR WPI; 92-048295/06. 

DR N-PSDB; Q23216. 

PT Deoxyribonucleic acid encoding lipo corticoid polypeptide(s) - 

PT for producing antiinf I amatory agents for treating arthritic* 

PT allergic* derRatologic ophthalRic and collagen diseases 



ro iMsnusurei rig ft aopp> cngiibn. 

CC The sequence deduced from DNA obtd. from 14 overlapping clones 

CC isolated from a cDNA library prepd. from polyA+ mRNA from human 

CC Racrophage cell line U937 (see Q23216 for details), Recombinant 

CC lipocortin prepd. using the DNA sequence has anti inflammatory 

CC activity and can be used for the treatment of arthritic r allergic* 

CC dermatologies ophthalmic and collagen diseases. 

CC See also R22403. 

SQ Sequence 363 AA; 

SG 33 A; 20 R; 13 N? 28 D; 0 B; 4 C; 14 G; 27 E; o Z; 21 G; 5 H; 

SQ 21 I; 36 L; 34 K; 9 «? 14 F; 8 P; 22 SZ 23 T; 1 U; 11 Y? 19 V; 

Initial Score = 6 Optimized Score = 6 Significance = 3.95 
Residue Identity = SOX Matches = 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II II II 

8FDADELRAAMKGLGTDEDTLIEILASRTNKEIRDINRVYREELKRDLAKDITSDTSGDFRNALLSLAKGDR 
140 150 160 170 180 X 190 X 200 

SEDFGVNEDLADSDARALYEAGERRKGTDVNVFNTILTTR 
210 220 230 240 



14. US-08-249-182-10 (1-12) 

P60657 Sequence of human lipocortin. 

ID P60657 standard? Protein; 363 AA. 

AC P60657; 

DT 08-AUG-1991 (first entry) 

DE Sequence of human lipocortin. 

KW Anti-inflammatory agent; steroid mediation; arthritis therapy; 

KW allergy. 

OS Homo sapiens. 

PN H08604094-A. 

PD 17-JUL-1986. 

PF 10-JAN-1986; U00027. 

PR 10-JAN-1985; US-690146. 

PR 15-MAR-1985J US-712376. 

PR 14-AUG-1985? US-765877. 

PR 05-SEP-1985; US-772892. 

PR 10-JAN-1986; US-929199. 

PA (BI0J ) BI0GEN NV. 

PI Wallner BPr Pepinsky BR> Garuin JL* Schindler DG* Huang KS? 

DR HPI; 86-196888/30. 

DR N-PSDB; N60555. 

PT New human lipocortin-like polypeptide (s) - are obtd. by 

PT recombinant DNA techniques and are antiinflammatory agents 

PT without usual side effects 

PS Claim 18; page 78-79 and Fig 4; 113pp; English. 

CC A human cDNA library of Escherichia coli cells contg. human 

CC macrophage cDNA sequences inserted into a phage cloning vector was 

CC screened using anti sense oligonucleotide DNA probes corresp. to 

CC those regions of rat phospholipase A2 inhibitor protein having 

CC minimal nucleotide degeneracy (N60561-N60564) to obtain a sequence 

CC coding for human lipocortin (LO-like polypeptide. 

SQ Sequence 363 AA; 

SQ 33 A; 20 R; 13 N; 28 D; 0 B; 4 c; 14 Q; 27 E; 0 z; 21 G; 5 H; 

SQ 21 I; 36 l; 34 K; 9 h; 14 F; 8 P; 22 S; 23 T; l u; li Y; 19 v; 

Initial Score = 6 Optimized Score = 6 Significance = 3.95 
Residue Identity = 507. Matches = 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 



DIEHLTSLDFFR 
II II II 

6FDADELRAAHKGLGTDEDTLI E I LASRTNKE I RD I NR VYREELKRDL AKD I TSDTSGDFRN ALLSLAKGDR 
140 150 160 170 180 X 190 X 200 



SEDFGVNEDLADSDARALYEAGERRKGTDVNVFNTILTTR 
210 220 230 240 



15. US-08-249-182-10 U-12) 

R41017 Insecticidal protein gene ORF-2 prod. 



ID R41017 standard; Protein; 370 AA. 

AC R41017; 

DT 25-MAR-1994 (first entry) 

DE Insecticidal protein gene ORF-2 prod. 

KU Caulobacter; plasnid! insecticidal protein; Bacillus thuringiensis; 

KU Bacillus sphaericus; larva; nosquitoe; Culex; Anopheles; Psorophoa; 

KU Clansonia; Aedes. 

OS Bacillus sphaericus strain 2297. 

PN J052U866-A. 

PD 24-AUG-1993. 

PF 05-JUN-1991; 140963. 

PR 06-JUN-1990; JP-148444. 

PA (SUM-) SILNARAN SO TANABAL. 

DR WPI; 93-298916/38. 

OR N-PSDB; 048714. 

PT Expression of insecticidal protein - by transforning Caulobacter 

PT with plasnid contg. gene coding for insecticidal protein 

PS Disclosure; Page 10-14; 27pp; Japanese. 

CC Caulobacter transforned uith a plasnid contg. a gene encoding 

CC insecticidal protein derived fron Bacillus thuringiensis or 

CC Bacillus sphaericus bill proliferate in aq. environnent. 

CC They nay be consulted by larvae of mosquitoes and are lethal to 

CC Culexi Anophelesi Psorophoa. Hansonia and Aedes. 

SQ Sequence 370 AA; 

SQ 13 A; 14 R; 27 N; 19 D; 0 B; 3 C; 13 8; 22 E; 0 z; 22 G; 6 H; 

SQ 33 I; 22 L; 15 k; 8 M; 21 f; 22 p; 32 S; 40 T; 3 U; 21 y; 14 V; 



Initial Score = 6 Optinized Score = 6 Significance = 3.95 
Residue Identity = 50% Hatches = 6 Mi snatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II I III 

SAPNGDIWTEICSRENM8YFIFFPTDDGRVIIANRHNGSVFTGEATSVVSDIYTGSPL8FFREFKRTMSTVY 
40 50 60 70 80 X 90 100 



LAI 8NPESATDVRALEPNSHELPSRL YFTNN I ENNSN I LI 
110 120 130 140 

> 0 < 

0| |0 Intel liGenetics 

> 0 < 



FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 



Results file u249_10p.res nade by on Thu 22 Sep 94 10:3l:46-PDT. 



8uery sequence being conparedJUS-08-249-182-10 (1-12) 
NuRber of sequences searched: 70848 
Nunber of scores above cutoff J 4472 



Data bank ; PIR 41 > all entries 



100000- 



U50000- 

H 

6 

E 

R 

0 

F10000- 
S 

E 5000- 

Q 

U 

E 

N 

C 

E 

S 1000- 



500- 



100- 



50- 



10- 



* 



II II I I I II II I I I II III I 
SCORE 0| jl | 3 i 4| |5 | 7 | 8| 9 |11 12 
STDEV -1 01 234 567 89 



PARAMETERS 



Sinilarity natrix Unitary 

Mi snatch penalty 1 

Gap penalty 1.00 

Gap size penalty 0.05 

Cutoff scors 0 



K-tuple 

Joining penalty 
Window size 



2 

20 
5 



u 



Initial scores to save 
Optimized scores to save 



40 Alignments to save 15 
0 Display context 50 



Scores J 



Tines? 



SEARCH STATISTICS 

Mean Median 
2 3 



CPU 
00:01:27.03 



Standard Deviation 
1,03 

Total Elapsed 
00:01:37.00 



Number of residues: 20816057 
Number of sequences searched: 70848 
Number of scores above cutoff: 4472 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 



A 1007. similar sequence to the query sequence uas found: 
Sequence Name Description 



Init. Opt. 
Length Score Score Sig. Frame 



1. A42329 autotaxin - human (fragments) 114 12 12 9.72 0 

The list of other best scores is: 



Sequence Name Description 



Init. Opt. 
Length Score Score 



Sig* Frame 



**** 5 standard deviations above mean **** 



2. 


A29770 


cerebellar degeneration-relat 


223 8 


8 


5.83 


0 






»**» 4 standard deviations above Bean «*» 








3. 


UMBEA1 


ribonucleoside-diphosphate re 


321 7 


7 


4.86 


0 


4. 


S07743 


cytochrone b - Paraneciun tet 


391 7 


7 


4.86 


0 


5. 


S18819 


catalase (EC 1.11.1.6) - naiz 


491 7 


8 


4.B6 


0 


6. 


A36062 


catalase (EC 1.11.1.6) - naiz 


491 7 


8 


4.86 


0 


7. 


SI 0770 


catalase (EC 1.11.1.6) - upla 


492 7 


8 


4.86 


0 


8. 


SI 0395 


catalase (EC 1.11.1.6) chain 


492 7 


8 


4.86 


0 


?. 


S07124 


catalase (EC 1.11.1.6) - suee 


492 7 


8 


4.86 


0 


10. 


CSSV 


catalase (EC 1.11.1.6) - soyb 


492 7 


8 


4.86 


0 


11. 


CSPH 


catalase (EC 1.11.1.6) - gard 


494 7 


8 


4.B6 


0 


12. 


S35908 


ATP-dependent Clp proteinase 


763 7 


7 


4.86 


0 






*«»» 3 standard deviations above nean **t* 








13. 


C20554 


henocyanin LpIIa - Atlantic h 


20 6 


6 


3.89 


0 


14. 


A20554 


henocyanin chain Lpl - Atlant 


24 6 


6 


3.89 


0 


15. 


F20554 


henocyanin Lpl V - Atlantic ho 


26 6 


6 


3.89 


0 


16. 


S31019 


gene 74 protein - Hycobacteri 


72 6 


6 


3.89 


0 


17. 


S21559 


DNA gyrase chain B - Lyne dis 


84 6 


6 


3.89 


0 


18. 


JN0145 


hypothetical 13. 6K protein (d 


117 6 


7 


3.89 


0 


19. 


S33081 


G4L protein - variola virus 


124 6 


6 


3.89 


0 


20. 


A36844 


H4L protein - variola virus ( 


124 6 


6 


3.89 


0 


21. 


142511 


G4L protein - vaccinia virus 


124 6 


6 


3.89 


0 


22. 


D35252 


probable henD protein - Bacil 


130 6 


6 


3.89 


0 


23. 


S27507 


hypothetical protein 1 - Baci 


145 6 


6 


3.89 


0 


24. 


S41300 


gene ninC protein - Bacteroid 


163 6 


6 


3.89 


0 


25. 


S05361 


hypothetical protein B (tdcR 


184 6 


6 


3.89 


0 


26. 


S39221 


hypothetical protein YBR0712 


211 6 


6 


3.89 


0 



28. 


S27789 


hypothetical protein - Caenor 


239 




u u • u / 

6 3.89 


u 

0 


29. 


BVECAY 


appY protein - Escherichia co 


243 


6 


6 3.89 


0 


30. 


BVECH5 


115 polypeptide - Escherichia 


249 


6 


6 3.89 


0 


31. 


A45921 


chorisnate piutase (EC 5.4.99. 


256 


b 


6 3.89 


0 


32. 


B42728 


uroporphyrinogen-III synthase 


262 


6 


6 3.89 


0 


33. 


A29633 


arabinose operon regulatory p 


281 


b 


6 3.89 


0 


34. 


RGEBAT 


arabinose operon regulatory p 


281 


b 


6 3.89 


0 


35. 


JT0961 


glutathione synthase (EC 6.3. 


285 


b 


6 3.89 


0 


36. 


S32480 


hypothetical protein B (L41 3 


286 


b 


6 3.89 


0 


37. 


513800 


centrosonin A - nouse 


289 


b 


6 3.89 


0 


38. 


S40580 


arabinose operon regulatory p 


292 


b 


6 3.89 


0 


39. 


RGECA 


arabinose operon regulatory p 


292 


b 


6 3.89 


0 


40. 


YXUNTP 


thyn idyl ate synthase (EC 2.1. 


297 


b 


6 3.89 


0 



1. US-08-249-182-10 (1-12) 

A42329 autotaxin - hunan (fragments) 



ENTRY A42329 #type fragnents 

TITLE autotaxin - hunan (fragments) 

ORGANISM ♦fornal^nane Hono sapiens ♦connon_nane nan 

DATE 04-Har-l993; ♦sequence.revision Ol-Jan-1993; ♦text change 

08-May-1993 
ACCESSIONS A42329 
REFERENCE A42329 

#authors Stracker N.L.; Krutzsch* H.C.; Unsuorth* E.J.; Arestadi A.; 
Ciocei V.; Schiffnann* E.; Liotta* L.A. 

♦journal J. Biol. Chen. (1992) 267:2524-2529 

#title Identification* purif ication* and partial sequence analysis 

of autotaxini a novel notility-stinulating protein, 
♦cross-references MUID;92129337 
♦accession A42329 

♦♦status prel ininary 

♦♦nolecule_type protein 

♦♦residues" 1-114 ♦♦label STR 

♦♦cross-references NCBIP:78526; NCBIP:78523; NCBIPJ78521? NCBIP:78518; 

NCBIP:78515; NCBIPi78512; NCBIP:7851l; NCBIP:78510; 
NCBIP:78509J NCBIP:7S508; NCBIP:78503 
♦♦note sequence extracted fron NCBI backbone 

SUMMARY ♦length 114 ♦checksun 7335 

SEQUENCE 



Initial Score = 12 
Residue Identity = 1007. 
Gaps = 0 



Optimized Score = 12 Significance = 9.72 
Matches = 12 Misnatches = 0 

Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 

illinium 

TEFLSNYLTNVDDITLVPGTLGRDIEHLTSLDFFRVNSM8TVFVGYGPTFKGGSPLHITATKSPPFENINLY 
10 20 X 30 X 40 50 60 70 



YDVPWNETIPEEV 
80 



2. US-08-249-182-10 (1-12) 

A29770 cerebellar degeneration-related protein 



hunan 



ENTRY A29770 Hype conplete 

TITLE cerebellar degeneration-related protein - hunan 

ORGANISM #fornal_nane Hono sapiens ♦connon_nane nan 

DATE 03-Nov-1987 ♦sequence_revision 03-Nov-1987 ♦text change 

24-Jun-1993 
ACCESSIONS A29770 



•authors Dropchoi E.J.; Cheni Y.T.; Posner, J.B.; Did* L.J. 

•journal Proc. Natt. Acad. Sci. U.S.A. (1987) 84:4552-4556 

♦title Cloning of a brain protein identified by autoantibodies from 

a patient with paraneoplastic cerebellar degeneration, 
•cross-references tlUID : B726084& 
•accession A29770 

••status preliminary 
••nolecule_type mRNA 
••residues" 1-223 ##label DRO 
SUMMARY ^length 223 •nolecular-ueight 27034 #checksum 9794 

SEQUENCE 

Initial Score = 8 Optimized Score = 8 Significance = 5.83 
Residue Identity = 66X Matches = 8 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II I lllll 

LEDVDFSEDPNYPEDLDCWEDVDFLEDWRLLEDMDFLEDMDFLEDVDLQEDIYWLEDLDFFRKMWIDHKTUI 
150 160 170 180 190 200 210 

UUKT 
220 



3. US-08-249-182-10 (1-12) 

HM8EA1 ribonucleoside-diphosphate reductase (EC 1.17.4.1) 



ENTRY WMBEA1 #type complete 

TITLE ribonucleoside-diphosphate reductase (EC 1.17, 4.1) snail 

chain - equine herpesvirus 1 (strain Ab4p) 
ribonucleotide reductase snail chain 
•formaljtame equine herpesvirus 1 
host Equus caballus (donestic horse) 
30-Sep-1992 #sequence_revision 30-Sep-1992 •text^change 

04-Mar-1994 
C36797 
A36805 

Telford* E.A.R.; Watson> M.S.; McBride» K.? Davison, A.J. 
submitted to GenBank* March 1992 
The DNA sequence of equine herpesvirus-l. 
C36797 
••flolecule_type DNA 
••residues" 1-321 ttlabel TEL 
••cross-references GB:M86664 
REFERENCE A41831 

•authors Telford, E.A.R.; Watson» M.S.; McBridei K. ; Davisoni A.J. 
•journal Virology (1992) 189:304-316 
•title The DNA sequence of equine herpesvirus-l. 

•cross-references HUID:92295566 



ALTERNATE.NAMES 
ORGANISM 
•note 
DATE 

ACCESSIONS 

REFERENCE 
•authors 
•submission 
•description 
•accession 



•contents 
•note 

GENETICS 
•gene 
CLASSIFICATION 

KEYWORDS 

SUMMARY 

SEQUENCE 



annotation; possible protein-coding frames 
neither protein nor nucleotide sequence is given in this 
paper 

20 

•superfanily herpesvirus ribonucleoside-diphosphate reductase 

small chain 
early protein; oxidoreductase 

•length 321 ftmolecular-ueight 36017 •checksum 5528 



Initial Score 
Residue Identity 
Gaps 



7 Optimized Score = 7 Significance = 4.86 
587. Matches = 7 Mismatches = 5 

0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
Mill II 

MSIENSKEAALTAELSLAGAFFYTPECPDIEHLRSLSVANRHLDTDLPISDDLKDVAKLTPAEREFYRFLFA 
10 20 30 40 50 60 70 

FLSAADDLVNLNLGDLSA 
80 90 



4. US-08-249-182-10 (1-12) 

S07743 cytochrome b - Paramecium tetraurelia nitochondrio 

ENTRY S07743 Stype complete 

TITLE cytochrome b - Paramecium tetraurelia mitochondrion (SGC6) 

ORGANISM tformal _name mitochondrion Paramecium tetraurelia 

DATE 31-Mar-1990 #sequence_revision 31-Mar-1990 #text change 

18-Jun-1993 
ACCESSIONS S07743 
REFERENCE S07725 

tauthors Pritchardi A.E.; Seilhamen J.J. ? Mahalingami R.; Sable* 

C.L.i Venutii S.E.; Cummingsi D.J. 
#journal Nucleic Acids Res. (1990) 18:173-180 
♦title Nucleotide sequence of the mitochondrial genome of 

Paramecium, 
ftcross-references MUID:90174913 
iaccession S07743 
##nolecule_type DNA 
##residues~ 1-391 iilabel PRI 
##cro5s-references EMBL:X15917 

##note the translation of the nucleotide sequence is not given 

in this paper 

GENETICS 
#gene 
#genome 

#genetic_code SGC6 
#&tart_codon ATC 
KEYWORDS mitochondrion 

SUMMARY #length 391 Smolecular-ueight 46044 Checksum 3513 

SEQUENCE 



cytB 

mitochondrion 



Initial Score = 7 
Residue Identity = 58X 
Gaps = 0 



Optimized Score = 7 Significance = 4.86 
Matches = 7 Mismatches = 5 

Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
III I III 

KNLRVSFHEVFSLFGFFTFMTIIVQLVSGTMLAFSSVPEPMLIPTVRDEEDIEDLYTDDFFHLHERGVDLIF 
10 20 30 40 50 60 70 80 

IFSYFHLLRKLYLNVFDLETEASWKSGVFSFLVFGVVVFF 
90 100 110 120 



5. US-08-249-182-10 (1-12) 

S18819 catalase (EC 1.11.1.6) 



maize 



ENTRY S18819 itype complete 

TITLE catalase (EC 1.11.1.6) - maize 

ORGANISM #formal_name Zea mays #common_name maize 

DATE 22-Nov-1993; #sequence revision 22-Nov-1993; ttext change 

22-Nov-1993 

ACCESSIONS S18819 

REFERENCE S18819 



#journal Plant Physiol. (1991) 96:1379-1381 

#title Comparison of the cat2 complementary DNA sequences of a 

normal catalase activity line (W64A) and a high catalase 
activity line (R6-67) of maize. 
Saccession S18819 

#tstatus preliminary 
#«residues 1-491 ##label GUA 
••cross-references EHBL:X54819 
SUMMARY ilength 491 inolecular-ueight 56465 ichecksum 8415 

SEQUENCE 

Initial Score = 7 Optinized Score = 8 Significance = 4.86 
Residue Identity = 66X Matches = 8 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II III II I 

DSSLTVGARGPILLEDYHCEKLANFDRERIPERVVHARGASAKGFFEVTHDITHLTCADFLRAPGV8TPVIV 
30 40 50 60 70 80 90 100 

RFSTVIHERGSPETLRDPRGFAVKFYTREGNWDLVGNNFP 
110 120 130 140 



6. US-08-249-182-10 (1-12) 

A36062 catalase (EC 1.11.1.6) - maize 

ENTRY A36062 #type complete 

TITLE catalase (EC 1.11.1.6) - maize 

ORGANISM #formal_nane Zea nays #common_name maize 

DATE 16-Nov-1990 #sequence_revision 13-Jan-1993 ftext change 

23-Jun-1993 
ACCESSIONS A36062 
REFERENCE A36062 

#authors Bethardsi L.A.; Skadseni R.W.; Scandal ios> J.G. 
•journal Proc. Natl. Acad. Sci. U.S.A. (1990) 87:6927 
•cross-references MUID:90370897 
•contents Erratum 
•accession A36062 

##status preliminary 
••molecule_type mRNA 
••residues" 1-491 §Slabel BET 
••cross-references GB:J02976 

##note the authors translated the codon AAC for residue 124 as 

Lys 

CLASSIFICATION •superfamily catalase 
KEYWORDS oxidoreductase 

SUMMARY #length 491 •molecular-ueight 56506 #checksun 8423 

SEQUENCE 



Initial Score = 
Residue Identity = 
Gaps - 



7 Optinized Score = 8 Significance = 4.86 
662 Matches = 8 Mismatches = 4 

0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 

II III II I 

DSSLTVGARGPILLEDYHCEKLANFDRERIPERVVHARGASAKGFFEVTHDITHLTCADFLRAPGVQTPVIV 
30 40 50 60 70 80 90 100 



RFSTVIHERGSPETLRDPRGFAVNFYTREGNUDLVGNNFP 
110 120 130 140 



S10770 catalase (EC 1.11.1.6) - upland cotton 



ENTRY S10770 #type complete 

TITLE catalase (EC 1.11.1.6) - upland cotton 

ORGANISH •fornal_nane Gossypium hirsutum #common_name upland cotton 

DATE 21-Nov-1993; #sequence_revision 21-Nov-1993? »text_change 

21 -Nov- 1993 
ACCESSIONS S10770 
REFERENCE S10770 

#authors Nil W.; Turleyi R.B.; Trelease, R.N. 

ijournal Biochim. Biophys. Acta (1990) 1049:219-222 

•title Characterization of a cDNA encoding cottonseed catalase. 

•cross-references ItUID; 90304227 

•accession S10770 

••status preliminary 
••residues 1-492 ••label NIA 
SUMMARY #length 492 •molecular-aeight 56B55 #checksum 5792 

SEQUENCE 



Initial Score = 7 
Residue Identity = 66% 
Gaps = 0 



Optimized Score = 8 Significance = 4.86 
Matches = 8 Mismatches = 4 

Conservative Substitutions « 0 



X 10 X 

DIEHLTSLDFFR 
II III II I 

SSLTVGPRG8YLLEDYHLVEKLANFDRERIPERVVHARGASAKGFFEVTHDISHLTCADFLRAPGVSTPVIV 
30 40 50 60 70 80 90 100 



RFSTVIHERGSPETLRDPRGFAVKFYTREGNFDLVGNNFP 
110 120 130 140 



8. US-08-249-182-10 (1-12) 

S10395 catalase (EC 1.11.1.6) chain 1 - upland cotton 



ENTRY S10395 ttype complete 

TITLE catalase (EC 1.11*1.6) chain 1 - upland cotton 

ORGANISM •formal^name Gossypium hirsutum •common_name upland cotton 

DATE 31-Dec~1990 #sequence_revision 31-Dec-1990 tttext change 

28-Apr-1993 
S10395 
S10395 

Weiting* N. I Turley» R.B.; Treleasei R.N. 
submitted to the EMBL Data Library* March 1990 
S10395 
•#molecule_type nRNA 
••residues' 1-492 ••label WEI 
••cross-references EMBL:X52135 
CLASSIFICATION •superfamily catalase 
KEYWORDS oxidoreductase 

SUMMARY t length 492 •molecular-ueight 56859 #checksu« 5891 

SEQUENCE 



ACCESSIONS 

REFERENCE 
•authors 
•submission 
•accession 



Initial Score = 
Residue Identity = 
Gaps = 



7 Optimized Score = 8 
66X Matches = 8 

0 Conservative Substitutions 



Significance = 
Mismatches = 



4.86 
4 

0 



X 10 X 

DIEHLTSLDFFR 
II III II I 

SSLTVGPRGQYLLEDYHLVEKLANFDRERIPERVVHARGASAKGFFDVTHDISHLTCADFLRAPGVQTPVIV 
30 40 50 60 70 80 90 100 



RFSTVIHERGSPETLRDPRGFAVKFYTREGNFDLVGNNFP 



9. US-08-249-182-10 (1-12) 

S07124 catalase (EC 1.11.1.6) - sueet potato 



ENTRY S07124 #type complete 

TITLE catalase (EC 1.11.1.6) - sueet potato 

ORGANISM Sfornat _nane Iponoea batatas #coRRon_nane sweet potato 

DATE 29-Jan-1993 #sequence_revision 29-Jan-1993 #text_change 

28-Apr-1993 
ACCESSIONS S07124 
REFERENCE S07124 

tauthors Sakajo, S.; Nakanurai K.; Asahir T. 

tjournal Eur. J. Biochen. (1987) 165:437-442 

#title Molecular cloning and nucleotide sequence of full-length cDNA 

for sueet potato catalase nRNA. 
tcross-references MUID:87246622 
^accession S07124 
##nolecule_type nRNA 
##residues~ 1-492 ##label SAK 
IMkross-references EMBL:X05549 
CLASSIFICATION #superfanily catalase 
KEYWORDS heae! heterotetraner; oxidoreductase 

SUMMARY #length 492 ftpiolecular-ueight 56985 Ichecksuw 8615 

SEQUENCE 



Initial Score = 7 
Residue Identity - 66X 
Gaps = 0 



Optimized Score = 8 
Matches = 8 

Conservative Substitutions 



Significance = 
Mismatches = 



4.86 
4 
0 



X 10 X 

DIEHLTSLDFFR 
II III II I 

CALTVGSRGPILLEDYHLVEKI8NFTRERIPERVVHARGASAKGFFEVTHDITHLTCADFLRAPGV8TPLIV 
30 40 50 60 70 80 90 100 



RFSTVIHERGSPETIRDPRGFAVKMYTRGGNWDLVGNNFP 
110 120 130 140 



10. US-08-249-182-10 (1-12) 

CSSY catalase (EC 1.11.1.6) - soybean 



ENTRY 
TITLE 
ORGANISM 
DATE 



CSSY itype complete 

catalase (EC 1.11.1.6) - soybean 
#fornal_nane Glycine nax SconRon^nane soybean 
31-Dec-1992 #sequence revision 31-Dec-1992 #text change 

30-Jun-1993 
S20999 
S20999 
Allen, R. 

submitted to the EM6L Data Library, May 1992 
S20999 



ACCESSIONS 
REFERENCE 
tauthors 
tsubflission 
#accession 

##nolecule_type DNA 
##residues 1-492 Mlabel ALL 
##cross-references EMBL:Z12021 
GENETICS 

5/3? 38/lJ 389/3; 419/3; 442/2; 473/3 
tsuperf aoily catalase 
hene; oxidoreductase 



#introns 
CLASSIFICATION 
KEYWORDS 
FEATURE 

65.104,138 

348 

SUMMARY 



#active_site His, Ser, Asn #status predicted\ 
#binding_site hene iron (Tyr) (axial ligand) Sstatus 
predicted 

Slength 492 inolecular-weight 56847 tchecksun 9929 



Initial Score = 7 Optimized Score = 8 Significance = 4.86 
Residue Identity = 66X Hatches - 8 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II III II I 

SSLTVGSRGPILLEDYHLVEKLANFDRERIPERVVHARGASAKGFFEVTHDISHLTCADFLRAPGVGTPLIV 
30 40 SO 60 70 80 90 100 

RFSTVIHERGSPETLRDPRGFAVKFYTREGNFDLVGNNFP 
110 120 130 140 



11. US-08-249-182-10 (1-12) 

CSPH catalase (EC 1.11.1.6) - garden pea 



ENTRY CSPM «type complete 

TITLE catalase (EC 1.11.1.6) - garden pea 

ORGANISM lfornal_nane Pisun sativun #connon_nane garden pea 

DATE 31-Har-1993 #sequence_revision 31-Har-1993 ttext_change 

30-Jun-1993 
ACCESSIONS SI 8346; SI 5559 

REFERENCE SI 8346 

# authors Isin. S.H.; Alleni R.D. 

#journal Plant Hoi. Biol. (1991) 17M263-1265 

ttitle Isolation and characterization of a pea catalase cDNA. 

Icross-references HUI 0:92032793 

faccession S18346 
#8nolecule_type nRNA 
#iresidues~ 1-494 #8 label ISI 
##cross-references EHBL:X60169 
CLASSIFICATION isuperfanily catalase 
KEYWORDS hene; oxidoreductasei peroxisome? tetraner 

FEATURE 

65,104,138 

348 



SUMHARY 
SEQUENCE 



#active_site His, Ser> Asn istatus predicted\ 
#binding_site hene iron (Tyr) (axial ligand) Istatus 
predicted 

# length 494 Inolecular-ueight 57344 Schecksun 6516 



Initial Score = 
Residue Identity - 
Gaps = 



7 Optimized Score = 8 Significance = 4.86 
66% Hatches = 8 Hi snatches = 4 

0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
II III II I 

SSLTVGSRGPILLEDYHLVEKLA8FDRERIPERVVHARGASAKGFFEVTHDISHLTCADFLRAPGV8TPVIV 
30 40 50 60 70 80 90 100 



RFSTVIHERGSPETLRDPRGFAVKFYTREGNYDLVGNNFP 
110 120 130 140 



12. US-08-249-182-10 (1-12) 

S35908 ATP-dependent Clp proteinase (EC 3.4.21.-) chain c 

ENTRY S35908 ttype conplete 

TITLE ATP-dependent Clp proteinase (EC 3.4.21.-) chain dpL - 

Lactococcus lactis subsp. lactis plasnid pUCL22 
ORGANISH #fornal_nane Lactococcus lactis subsp. lactis 

DATE 03-Feb-1994 #sequence_revision 03-Feb-1994 #text change 

03-Feb-1994 



REFERENCE S35907 

♦authors Huang r D.C.; Huang* X.F.; Novel t G.; Novel r M. 

#journal Hoi. Microbiol. (1993) 7:957-965 

fttitle Two genes present on a transposon-like structure in 

Lactococcus lactis are involved in a Cip-fanily proteolytic 
activity. 
#accession S35908 
##fflo 1 ecu le_ type DNA 
#tresidues" 1-763 ##label HUA 
Mcross-references EMBL:X62333 
GENETICS 

#gene clpL 
igenone plasnid 
KEYWORDS ATP binding; hydrolase 

SUMMARY #length 763 fmolecular-ueight 86018 #checksun 3552 

SEQUENCE 



Initial Score = 7 
Residue Identity = 58X 
Gaps - 0 



Optimized Score = 7 Significance = 4.86 
Matches = 7 Mismatches = 5 

Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
Mill II 

KAADIKRVDRGTKflKIKKTHQKEKITATIDDVAflSVERLTGIPVSDMGANDIEHLKNLDKRLKVMVIGEDEA 
420 430 440 450 460 470 X 480 



VKMVAKAIRRNRAGFSEGDQPKGSFLFVGPTGVGKTELSQ 
490 500 510 520 



13. US-08-249-182-10 (1-12) 

C20554 hemocyanin LpIIa 



Atlantic horseshoe crab (fragme 



ENTRY 
TITLE 
ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
#authors 

#journal 
ttitle 
#accession 
Sinolecule, 
t#residues 
SUMMARY 
SEQUENCE 



C20554 #type fragment 

henocyanin LpIIa - Atlantic horseshoe crab (fragment) 
#formal_name Limulus polyphenus #conmon_name Atlantic 

horseshoe crab 
05-Jun-1987 #sequence_revision 05-Jun-1987 Stext change 

18-Jun-1993 
C20554 
A90478 

Lanyr J.; Lamy» J.; Sizaret» P.Y.; Billialdf P.; Jollesr 

Jollesi J.; Feldnannr R.J.; Bonaventurai J. 
Biochemistry (1983) 22:5573-5583 

Quaternary structure of Linulus polyphemus henocyanin. 
C20554 

,type protein 

1-20 ftlabel LAM 
* length 20 Checksum 6229 



P.i 



Initial Score = 6 
Residue Identity = 66X 
Gaps = 0 



Optimized Score = 6 Significance = 3.89 
Matches = 6 Misnatches * 3 

Conservative Substitutions = 0 



X X 
DIEHLTSLDFFR 

linn 

TVKEKQSRLLPLFEHLTSLP 
10 X 20 



14. US-08-249-182-10 (1-12) 

A20554 henocyanin chain Lpl - Atlantic horseshoe crab (fr 



ENTRY 
TITLE 
ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
iauthors 

#journal 
ttitle 
#accession 
tinolecule 
itresidues 
SUMMARY 
SEQUENCE 



A20554 #type fragnent 

henocyanin chain Lpl - Atlantic horseshoe crab (fragment) 
#fornal_nane Linulus polyphenus #connon_nane Atlantic 

horseshoe crab 
05-Jun-1987 #sequence_revision 05-Jun-1987 #text change 

18-Jun-1993 
A20554 
A90478 

Lanyt J.J Lany> J.J Sizaretf P.Y.J Billialdr P.J Jolles* P.J 

Jollesi J.J Feldnannr R.J.J Bonaventura* J. 
Biochenistry (1983) 22:5573-5583 
Quaternary structure of Linulus polyphenus henocyanin. 
A20554 

.type protein 

1-24 ttlabel LAM 
ftlength 24 ichecksun 2682 



Initial Score = 6 
Residue Identity = 50% 
Gaps = 0 



Optinized Score = 6 Significance = 3.89 
Matches = 6 Mismatches = 6 

Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
llllll 

TIKKKSASILALFEHLTSLPKQHI 
10 X 20 X 



15. US-08-249-182-10 (1-12) 

F20554 henocyanin Lpl V - Atlantic horseshoe crab (fragnen 



ENTRY 
TITLE 
ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
#authors 

^journal 
Stitle 
#accession 
##nolecule 
Itresidues 
SUMMARY 
SEQUENCE 



F20554 #type fragnent 

henocyanin LpIV - Atlantic horseshoe crab (fragment) 
#fornal_nane Linulus polyphenus #connon_nane Atlantic 

horseshoe crab 
05-Jun-1987 #sequence_revision 05-Jun-1987 #text change 

18-Jun-1993 
F20554 
A90478 

Lany, J.J Lany , J.J Sizaret? P.Y.J Billialdi P.J Jollesi P.J 

Jollesr J.J Feldnannr R.J.J Bonaventura» J. 
Biochenistry (1983) 22:5573-5583 
Quaternary structure of Linulus polyphenus henocyanin. 
F20554 

.type protein 

1-26 ##label LAM 
tlength 26 #checksun 6863 



Initial Score = 6 
Residue Identity = SOX 
Gaps = 0 



Optinized Score = 6 Significance = 3.89 
Matches = 6 Mi snatches = 6 

Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
llllll 

TLKEKQDRILVLFEHLTSLTKHQLPE 
10 X 20 X 

< 

|0 IntelliGenetics 

< 



FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249_10s.res wade by on Thu 22 Sep 94 10J55J09-PDT. 



Query sequence being conpared:US-08-249-182-10 (1-12) 
Nunber of sequences searched? 36000 
Nuflber of scores above cutoffs 4217 

Results of the initial comparison of US-08-249-182-10 U-12) with: 
Data bank : Suiss-Prot 28 r all entries 

100000- 

N 

U50000- 
« 
B 
E 

R * 

0 - * 
F10000- 

S 

E 5000- * 

a 
u 

E 

N # 

C 

E 

S 1000- 



500- 



100- 



50- 



10- 



II III II I I I I I I I 

SCORE 0| 1 | 2 | 2 3 4|5|5 b 
STDEV -10 12 3 4 



PARAMETERS 



Similarity matrix Unitary 

Mismatch penalty i 

Gap penalty 1.00 

Gap size penalty 0.05 

Cutoff score 0 

Randonization group 0 

Initial scores to save 40 

Optimized scores to save 0 



K-tuple 

Joining penalty 
Hindou size 



2 

20 
5 



Alignments to save 15 
Display context 50 



SEARCH STATISTICS 



Scores J 



Mean Median Standard Deviation 

2 3 0.94 



Tines: 



CPU 
00:00:51.89 



Total Elapsed 
00:00:57.00 



Nuftber of residues: 12496420 
Nunber of sequences searched: 36000 
Nunber of scores above cutoff? 4217 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 



A 100% identical sequence to the query sequence was not found. 



The list of best scores is: 



Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 









»**• 5 standard deviations 


above nean *tt* 








1. 


RIR2.HSVEB 


RIBONUCLE0SIDE-DIPH0SPHATE RE 


321 7 


7 


5.31 


0 


2. 


CYB.PARTE 


CYTOCHROME B (EC 1.10.2.2). 


391 7 


7 


5.31 


0 


3. 


CAT2. 


.MAIZE 


CATALASE ISOZYME 2 (EC 1.11.1 


491 7 


8 


5.31 


0 


4. 


CATA. 


.S0YBN 


CATALASE (EC 1.11.1.6). 


492 7 


8 


5.31 


0 


5. 


CATA. 


.IP0BA 


CATALASE (EC 1.11.1.6). 


492 7 


8 


5.31 


0 


6. 


CAT1. 


.LYCEB 


CATALASE ISOZYME 1 (EC 1.11.1 


492 7 


8 


5.31 


0 


7. 


CAT1. 


G0SHI 


CATALASE ISOZYME 1 (EC 1.11.1 


492 7 


8 


5.31 


0 


8. 


CATA. 


.PEA 


CATALASE (EC 1.11.1.6). 
«*»* 4 standard deviations 


494 7 
above nean **** 


8 


5.31 


0 


9. 


VG74. 


.BPHL5 


GENE 74 PROTEIN (GP74) . 


72 6 




4.25 


0 


10. 


ysca" 


.BACSU 


HYPOTHETICAL 13.6 KD PROTEIN 


117 6 




4.25 


0 


11. 


VG04. 


VARV 


PROTEIN G4. 


124 6 




4.25 


0 


12. 


VG04. 


.VACCC 


PROTEIN G4. 


124 6 




4.25 


0 


13. 


YME1. 


.BACSU 


HYPOTHETICAL 17.2 KD PROTEIN 


145 6 




4.25 


0 


14. 


YKA8. 


.CAEEL 


HYPOTHETICAL 27.3 KD PROTEIN 


239 6 




4.25 


0 


15. 


APPY. 


.ECOLI 


APPY PROTEIN (M5 POLYPEPTIDE) 


243 6 




4.25 


0 


16. 


CHMlT 


.YEAST 


CHORISMATE MUTASE (EC 5.4.99. 


256 6 




4.25 


0 


17. 


HEN4. 


.BACSU 


PUTATIVE UR0P0RPHYR1N0GEN-III 


262 6 




4.25 


0 


18. 


ARAC 


.SALTY 


ARABINOSE OPERON REGULATORY P 


281 6 




4.25 


0 


19. 


ARAC 


.CITFR 


ARABINOSE OPERON REGULATORY P 


281 6 




4.25 


0 


20. 


YL44. 


.KLULA 


HYPOTHETICAL 31.8 KD PROTEIN 


286 6 




4.25 


0 


21. 


CENA. 


MOUSE 


CENTROSOMIN A. 


289 6 




4.25 


0 


22. 


ARAC. 


.ECOLI 


ARABINOSE OPERON REGULATORY P 


292 6 




4.25 


0 


23. 


TYSY. 


.PNECA 


THYMIDYLATE SYNTHASE (EC 2.1. 


297 6 




4.25 


0 



25. OIF.BOVIN 

26. ANX3.HUMAN 

27. ANX3_RAT 

28. VS09J10TEL 

29. VG16JPPZA 

30. VG16.BPPH2 

31. RIR2.HSV23 

32. ANX2.M0USE 

33. ANX2_HUMAN 

34. ANX2_CHICK 

35. ANX2J0VIN 

36. RIR2_HSV1K 

37. ANXB.XENLA 

38. ANX2.XENLA 

39. RIR2.HSV11 

40. YEJE ECOLI 



OSTEOINDUCTIVE FACTOR PRECURS 
ANNEX IN III (LIPOCORTIN III) 
ANNEX IN III (LIPOCORTIN III) 
GLYCOPROTEIN VP7 (SEROTYPE-SP 
ENCAPSIDATION PROTEIN (LATE P 
ENCAPSIDATION PROTEIN (LATE P 
R I BDNUCLEDS I DE-DI PHOSPHATE RE 
ANNEX IN II (LIPOCORTIN II) (C 
ANNEX IN II (LIPOCORTIN II) (C 
ANNEX IN II (LIPOCORTIN II) (C 
ANNEX IN II (LIPOCORTIN II) (C 
RIBONUCLEOSIDE-DIPHOSPHATE RE 
ANNEX IN II TYPE I (LIPOCORTIN 
ANNEX IN II TYPE II (LIPOCORTI 
RIBONUCLEOSIDE-DIPHOSPHATE RE 
HYPOTHETICAL 38.1 KD PROTEIN 



299 
323 
324 
326 
332 
332 
337 
338 
338 
338 
338 
339 
339 
339 
340 
341 



6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 
6 



4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 

4.25 0 



1. US-08-249-182-10 (1-12) 

RIR2_HSVEB RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE SHALL CHAIN ( 

ID RIR2_HSVEB STANDARD; PRTf 321 AA. 

AC P28847; 

DT 01-DEC-1992 (REL. 24, CREATED) 

DT 01-DEC-1992 (REL. 24. LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE SHALL CHAIN (EC 1.17.4.1) 

DE (RIBONUCLEOTIDE REDUCTASE). 

GN 20. 

OS EQUINE HERPESVIRUS TYPE 1 (STRAIN AB4P) (EHV-1). 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; HERPESVIRIDAE; ALPHAHERPESVIRINAE. 

RN [11 

RP SEQUENCE FROH N.A. 

RH 92295566 

RA TELFORD E.A.R., WATSON M.S., MCBRIDE K., DAVISON A.J.; 

RL VIROLOGY 189:304-316(1992). 

CC -!- FUNCTION: PROVIDES THE PRECURSORS NECESSARY FOR DNA SYNTHESIS. 

CC -!- CATALYTIC ACTIVITY: 2'DE0XYRIB0NUCLE0SIDE DIPHOSPHATE + OXIDIZED 
CC THIOREDOXIN + H(2)0 = RIBONUCLEOSIDE DIPHOSPHATE + REDUCED 

CC THIOREDOXIN. 

CC -!- PATHWAY : FIRST REACTION IN THE DNA REPLICATION PATHWAY. 

CC -!- SUBUNIT: HETERODINER OF A LARGE AND A SHALL CHAIN. 

CC -!- COFACTOR: CONTAINS TWO IRON IONS. 

CC -!- SIMILARITY: HIGH TO OTHER EUKARYOTIC, PROKARYOTIC, AND VIRAL 

CC RIBONUCLEOSIDE DIPHOSPHATE REDUCTASE SHALL CHAINS. 

OR EHBL; (186664; HEHSECOHG. 

DR P1R; C36797; WHBEA1. 

DR PROSITE; P500368; RIBORED.SHALL. 

KU OXIDOREDUCTASE; DNA REPLICATION; IRON; EARLY PROTEIN. 



HETAL 


78 


78 


IRON 1 (BY SIMILARITY) . 


METAL 


108 


108 


IRON 1 AND 2 (BY SIMILARITY). 


METAL 


111 


111 


IRON 1 (BY SIMILARITY) . 


METAL 


171 


171 


IRON 2 (BY SIMILARITY). 


METAL 


205 


205 


IRON 2 (BY SIMILARITY). 


METAL 


208 


208 


IRON 2 (BY SIMILARITY). 


ACT.SITE 


115 


115 


BY SIMILARITY. 


SEQUENCE 


321 AA; 


36017 


MW; 522774 CN5 



Initial Score = 7 Optinized Score = 7 Significance = 5.31 
Residue Identity = 58% Matches = 7 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 

iiiii ii 



no icnsncHRL i HCLSLflunrr 1 1 rcvruicnLnsLsvnnnHLU i uir isuviinuvhivl i rntncr mrLr h 
10 20 30 40 50 60 70 



FLSAADDLVNLNLGDLSA 
80 90 



2. US-08-249-182-10 (1-12) 

CYB.PARTE CYTOCHROME B (EC 1.10.2.2). 

ID CYB.PARTE STANDARD; PRT; 391 AA. 

AC P15585; 

DT 01-APR-1990 (REL. 14, CREATED) 

DT 01 -APR- 1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-HAR-1992 (REL. 21. LAST ANNOTATION UPDATE) 

DE CYTOCHROME B (EC 1.10.2.2). 

GN COB OR CYTB. 

OS PARAMECIUM TETRAURELIA. 

OG MITOCHONDRION. 

OC E UK A RYOT A; PROTOZOA; CILIOPHORA; CILIATA; HOLOTRICHA; HYHENQSTONATIDA. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=STOCK 515 

RM 90174913 

RA PRITCHARD A.E., SEILHAMER J.J., MAHAL I NG AM R., SABLE C.L., 

RA VENUTI S.E., CUMMINGS D.J.; 

RL NUCLEIC ACIDS RES. 185173-180(1990). 

CC -!- FUNCTION; COMPONENT OF THE UBIQUINOL-CYTOCHROME C REDUCTASE 
CC COMPLEX (COMPLEX III OR CYTOCHROME B-Cl COMPLEX), WHICH IS A 

CC RESPIRATORY CHAIN THAT GENERATES AN ELECTROCHEMICAL POTENTIAL 

CC COUPLED TO ATP SYNTHESIS. 

CC -!- CATALYTIC ACTIVITY? 0H(2) + 2 FERRICYTOCHROHE C = 9 + 
CC 2 FERROCYTOCHROME C. 

CC -!- SUBUNIT: THE MAIN SUBUNITS OF COMPLEX B-Cl ARE: CYTOCHROME B, 
CC CYTOCHROME CI AND THE RIESKE PROTEIN. 

CC -!- COFACTOR: TWO HEME GROUPS (B562 AND B566) WHICH ARE NOT COVALENTLY 

CC BOUND TO THE PROTEIN. 

DR EMBL5 X 1 59 1 7 ; MIPAGEN. 

DR PIR5 S077435 S07743. 

DR PROSITE! PS00192; CYTOCHROME.B.HEME. 

DR PROSITE? PS00193; CYT0CHR0ME_B_80. 

KU ELECTRON TRANSPORT; MITOCHONDRION; RESPIRATORY CHAIN; TRANSMEMBRANE 5 



KU 


HEME. 










FT 


METAL 


72 


72 


IRON 1 


(HEME B562 AXIAL LIGAND). 


FT 


METAL 


86 


86 


IRON 2 


(HEME B566 AXIAL LIGAND). 


FT 


METAL 


173 


173 


IRON 2 


(HEME B562 AXIAL LIGAND). 


FT 


METAL 


187 


187 


IRON 1 


(HEME B566 AXIAL LIGAND). 


SQ 


SEQUENCE 


391 AA; 


46044 


P1WI 895845 CN; 



Initial Score = 7 Optinized Score = 7 Significance = 5.31 
Residue Identity - 587. Matches = 7 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
III I III 

KNLRVSFHEVFSLFGFFTFHT 1 1 VQLVSGTML AFSSVPEPML I PTVRDEED I EDL YTDDFFWLHERGVDL I F 
10 20 30 40 50 60 70 80 

IFSYFHLLRKLYLNVFDLETEASWKSGVFSFLVFQVVVFF 
90 100 110 120 



3. US-08-249-182-10 (1-12) 

CAT2_MAIZE CATALASE ISOZYME 2 (EC 1.11.1.6). 



AC P12365; 

DT 01-0CT-1989 (REL. 12, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE CATALASE ISOZYME 2 (EC 1.11.1.6). 

GN CAT2. 

OS ZEA MAYS (MAIZE) . 

oc eukaryota; planta; embryophyta; angiospermae; monocotyledoneae; 

oc cyperales; gramineae. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAINS 64 A; 

RA GUAN L., RUZSA S., SKADSEN R.W., SCANDALIOS J.G.; 

RL PLANT PHYSIOL. 96? 1379-1381 (1991) . 

RN [2] 

RP PRELIMINARY SEQUENCE FROM N.A. 

RC STRAIN=R6-67; 

RM 88016183 

RA BETHARDS L.A., SKADSEN R.U., SCANDALIOS J.G.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 84:6830-6834(1987). 

RN [31 

RP REVISIONS TO C-TERMINAL SEQUENCE. 

RH 90370897 

RA BETHARDS L.A., SKADSEN R.U., SCANDALIOS J.G.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 87:6927-6927 ( 1990) . 

CC -!- FUNCTION: OCCURS IN ALMOST ALL AEROBICALLY RESPIRING ORGANISMS AND 

CC SERVES TO PROTECT CELLS FROM THE TOXIC EFFECTS OF HYDROGEN 

CC PEROXIDE AND OTHER ACTIVE OXYGEN SPECIES. 

CC -!- CATALYTIC ACTIVITY: 2 H(2)0(2) = 0(2) ♦ 2 H(2)0. 

CC -!- COFACTOR: HEME GROUP. 

CC -!- SUBUNIT: HOMOTETRAMER. 

CC -!- SUBCELLULAR LOCATION: PEROXISOMAL OR CYTOPLASMIC. 

CC -!- SIMILARITY: TO OTHER EUKARYOTIC CATALASES. 

DR EMBL; X54819; ZHCAT2R. 

DR EMBL; J02976; ZHCAT2. 

DR PIR; A36062; A36062. 

DR PIR; S18819; S18819. 

DR PROSITE; PS00437; CATALASEJ. 

DR PROSITE? PS00438; CATALASE.2. 

KU OXIDOREDUCTASE; PEROXIDASE; IRON; HEME; HYDROGEN PEROXIDE; 
KU 
FT 
FT 
FT 
FT 
S9 

Initial Score = 7 Optinized Score = 8 Significance = 5.31 
Residue Identity = 66X Matches = 8 Misnatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II III II I 

DSSLTVGARGPILLEDYHCEKLANFDRERIPERVVHARGASAKGFFEVTHDITHLTCADFLRAPGV8TPVIV 
30 40 50 60 70 80 90 100 

RFSTV I HERGSPETLRDPRGF A VKFYTREGNWDLVGNNFP 
110 120 130 140 



PEROXISOME; 


MULT I GENE FAMILY. 




ACT.SITE 


64 


64 


BY SIMILARITY. 


ACT_SITE 


137 


137 


BY SIMILARITY. 


BINDING 


347 


347 


PROXIMAL HEME LIGAND 


VARIANT 


296 


296 


T -> R. 


SEQUENCE 


491 aa; 


56465 MU; 


1188896 CN; 



4. US-08-249-182-10 (1-12) 

CATAJOYBN CATALASE (EC 1.11.1.6). 



ID CATA.SOYBN STANDARD; 



PRT; 492 AA. 



DT 01-APR-1993 (REL. 25. CREATED) 

DT Ol-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT Ol-APR-1993 (REL. 25, LAST ANNOTATION UPDATE) 

DE CATALASE (EC 1.11.1.6). 

OS GLYCINE MAX (SOYBEAN). 

oc eukaryota; planta; embryophytaj angiosperhae; dicotyledoneae; fabales; 

oc FABACEAE. 

RN [11 

RP SEQUENCE FROM N.A. 

RC STRAIN=CV. DttPL 415? 

RA IS1N S.H., ALLEN R.D.; 

RL PLANT MOL. BIOL. 0:0-0(1992). 

CC -!- FUNCTION: OCCURS IN ALMOST ALL AEROBICALLY RESPIRING ORGANISMS AND 
CC SERVES TO PROTECT CELLS FROM THE TOXIC EFFECTS OF HYDROGEN 

CC PEROXIDE. 

CC -!- CATALYTIC ACTIVITY: 2 H(2)0(2) = 0(2) + 2 H(2)0. 

CC -!- COFACTOR: HEME GROUP. 

CC -!- SUBUNIT: HOHOTETRAMER. 

CC -!- SUBCELLULAR LOCATION: PEROXISOMAL. 

CC -!- SIMILARITY: TO OTHER EUKARYOTIC CATALASES. 

DR EHBL; Z 12021; GNCATALG. 

DR PIRi S20999; CSSY. 

DR PROSITE; PS00437J CATALASEJ. 

DR PROSITE; PS00438; CATALASE~2. 

KM OXIDOREDUCTASE; PEROXIDASE; IRON; HEME; HYDROGEN PEROXIDE; 

KH PEROXISOME. 

FT ACT.SITE 65 65 BY SIMILARITY. 

FT ACT.SITE 138 138 BY SIMILARITY. 

FT BINDING 348 348 PROXIMAL HEME LIGAND (BY SIMILARITY). 

S8 SEQUENCE 492 AA; 56847 NH; 1224278 CN; 

Initial Score = 7 Optinized Score = 8 Significance = 5.31 
Residue Identity = 66X Matches = 8 Misnatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II III II I 

SSLTVGSRGPILLEDYHLVEKLANFDRERIPERVVHARGASAKGFFEVTHDISHLTCADFLRAPGV8TPLIV 
30 40 50 60 70 80 90 100 

RFSTV I HERGSPETLRDPRGFAVKFYTREGNFDLVGNNFP 
. 110 120 130 140 



5. US-08-249-182-10 (1-12) 

CATA.IPOBA CATALASE (EC 1.11.1.6). 

ID CATAJPOBA STANDARD; PRT; 492 AA. 

AC P07145? 

DT 01-APR-1988 (REL. 07, CREATED) 

DT 01-APR-1988 (REL. 07, LAST SEQUENCE UPDATE) 

DT 01-FEB-1991 (REL. 17, LAST ANNOTATION UPDATE) 

DE CATALASE (EC 1.11.1.6). 

OS IPOMOEA BATATAS (SHEET POTATO) (BAT ATE) . 

OC EUKARYOTA; PLANTA; EMBRYOPHYTA; ANGIOSPERHAE; DICOTYLEDONEAE; 

OC SOLANALES; CONVOLVULACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CV. KOKEI NO. 14; 

RM 87246622 

RA SAKAJO S., NAKAMURA K., ASAHI T.; 

RL EUR. J. BIOCHEM. 165:437-442(1987). 

CC -!- FUNCTION: OCCURS IN ALMOST ALL AEROBICALLY RESPIRING ORGANISMS AND 
CC SERVES TO PROTECT CELLS FROM THE TOXIC EFFECTS OF HYDROGEN 



cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
KM 
KU 
FT 
FT 
FT 
S3 



CATALYTIC ACTIVITY: 2 H(2)0(2) = 0(2) + 2 H(2)0. 
COF ACTOR: HEME GROUP. 
SUBUNIT: HDMOTETRAHER. 
SUBCELLULAR LOCATION: PEROXISOMAL. 
SIMILARITY: TO OTHER EUKARYOTIC CATALASES. 
EMBL'r X05549; IBCATR. 
PIR; S07124; S07124. 
PROSITE; PS00437J CATALASE.l. 
PROSITE; PS00438? CATALASE.2. 

OXIDOREDUCTASE; PEROXIDASE; IRON; HEME; HYDROGEN PEROXIDE; 



PEROXISOME. 






ACT_SITE 65 


65 


BY SIMILARITY. 


ACT.SITE 138 


138 


BY SIMILARITY. 


BINDING 348 


348 


PROXIMAL HEME LIGAND 


SESUENCE 492 AA; 


56985 


MW! 1209173 CN; 



(BY SIMILARITY). 



Initial Score = 7 
Residue Identity = 66% 
Gaps = 0 



Optimized Score = 8 
Matches = 8 

Conservative Substitutions 



Significance = 
Hi snatches = 



5.31 
4 
0 



X 10 X 

DIEHLTSLDFFR 
II III II I 

CALTVGSRGPI LLED YHL VEK I SNFTRER IPERVVHARGASAKGFFEVTHD I THLTCADFLRAPGVflTPLI V 
30 40 50 60 70 80 90 100 



RFSTVIHERGSPETIRDPRGFAVKMYTRGGNHDLVGNNFP 
110 120 130 140 



6. US-08-249-182-10 (1-12) 

CAT1_LYCES CATALASE ISOZYME 1 (EC 1.11.1.6). 

ID CATl.LYCES STANDARD? PRT5 492 AA. 

AC P30264; 

DT 01-APR-1993 (REL. 25. CREATED) 

DT 01-APR-1993 (REL. 25. LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25. LAST ANNOTATION UPDATE) 

DE CATALASE ISOZYME 1 (EC 1.11.1.6). 

GN CAT1. 

OS LYCOPERSICON ESCULENTUM (TOMATO). 

oc eukaryota; planta; embryophyta; angiospermae; dicotyledoneae; 

OC SOL AN ALES; SOLAN ACE AE. 

RN [1] 

RP SESUENCE FROM N.A. 

RA DRORY A. . WOODSON W.R.; 

RL SUBHITTED (XXX-1992) TO EHBL/GENBANK/DDBJ DATA BANKS. 

CC -!- FUNCTION: OCCURS IN ALMOST ALL AEROBICALLY RESPIRING ORGANISMS AND 

CC SERVES TO PROTECT CELLS FROM THE TOXIC EFFECTS OF HYDROGEN 

CC PEROXIDE. 

CC -!- CATALYTIC ACTIVITY: 2 H(2)0(2) = 0(2) + 2 H(2)0. 

CC -!- COFACTOR: HEME GROUP. 

CC -!- SUBUNIT: HOHOTETRAMER. 

CC -!- SUBCELLULAR LOCATION: PEROXISOMAL. 

CC -!- SIMILARITY: TO OTHER EUKARYOTIC CATALASES. 

DR EMBLJ M93719J LECAT1A. 

DR PROSITE; PS00437; CATALASE 1. 

DR PROSITE; PS00438; CATALASE_2. 

KM OXIDOREDUCTASE; PEROXIDASE; IRON; HEME; HYDROGEN PEROXIDE; 

KH PEROXISOME; HULTIGENE FAMILY. 

FT ACTJITE 65 65 BY SIMILARITY. 

FT ACT.SITE 138 138 BY SIMILARITY. 

FT BINDING 348 348 PROXIMAL HEME LIGAND (BY SIMILARITY). 

SQ SEQUENCE 492 AA; 56505 HW; 1223422 CN; 



Residue Identity = 667. Hatches = 8 Mi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II III II I 

SSLTVGPRGPVLLEDYYLIEKLATFDREKIPERVVHARGASAKGFFEVTHDISHLTCADFLRAPGAGTPVIC 
30 40 SO 60 70 80 90 100 

RFSTVVHERGSPESIRDIRGFAVKFYTREGNFDLVGNNVP 
110 120 130 140 



7. US-08-249-182-10 (1-12) 

CAT1_G0SHI CATALASE ISOZYME 1 (EC 1.11.1.6). 

ID CAT1_G0SHI STANDARD; PRTJ 492 AA. 

AC PI 7598; 

DT 01-AUG-1990 (REL. 15. CREATED) 

DT 01-AUG-1990 (REL. 15, LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25, LAST ANNOTATION UPDATE) 

DE CATALASE ISOZYME 1 (EC 1.11.1.6). 

GN CAT1. 

OS GOSSYPIUN HIRSUTUH (UPLAND COTTON) . 

oc eukaryota; plant a; embryophytas angiospermae; dicotyledoneae; 

OC HAL VALES; MALVACEAE. 

RN cn 

RP SEQUENCE FROM N.A. 

RC STRAIN=CV. DELTAPINE 62; TISSUE=COTYLEDON? 

RM 90304227 

RA NI W.. TURLEY R.B., TRELEASE R.N.; 

RL BIOCHIM. BIOPHYS. ACTA 1049.*219-222(1990) . 

CC -!- FUNCTION: OCCURS IN ALMOST ALL AEROBICALLY RESPIRING ORGANISMS AND 
CC SERVES TO PROTECT CELLS FROM THE TOXIC EFFECTS OF HYDROGEN 

CC PEROXIDE. 

CC -!- CATALYTIC ACTIVITY: 2 H(2)0(2) = 0(2) + 2 H(2)0. 

CC -!- COF ACTOR: HEME GROUP. 

CC -!- SUBUNIT: HOMOTETRAMER. 

CC -!- SUBCELLULAR LOCATION: PEROXISOMAL. 

CC -!- THERE ARE AT LEAST FIVE ISOZYMES OF CATALASE IN COTTON. 

CC -!- SIMILARITY: TO OTHER EUKARYOTIC CATALASES. 

DR EHBL; X52135; GHCAT1. 

DR PIR; S10770; S10770. 

DR PIR; S10395; S10395. 

DR PROSITE! PS00437J CATALASE.l. 

DR PROSITE! PS00438; CATALASE_2. 

KU 0X1D0REDUCTASE; PEROXIDASE; IRON; HEME; HYDROGEN PEROXIDE; 

KM PEROXISOME; MULTIGENE FAMILY. 

FT ACT_SITE 65 65 BY SIMILARITY. 

FT ACT_SITE 138 138 BY SIMILARITY. 

FT BINDING 348 348 PROXIMAL HEME LIGAND (BY SIMILARITY). 

SQ SEQUENCE 492 AA; 56859 MM; 1209561 CN; 

Initial Score = 7 Optinized Score = 8 Significance = 5.31 
Residue Identity = 66Z Hatches = 8 Mi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II III II I 

SSLTVGPRGQYLLEDYHLVEKLANFDRER I PERVVHARGASAKGFFDVTHD I SHLTCADFLRAPGV8TPVI V 
30 40 50 60 70 80 90 100 



RFSTVIHERGSPETLRDPRGFAVKFYTREGNFDLVGNNFP 
110 120 130 140 



8. US-08-249-182-10 (1-12) 

CATA_PEA CATALASE (EC 1.11.1.6). 



ID CATA.PEA STANDARD; PRT; 494 AA. 

AC P25890; 

DT 01 -HAY- 1992 (REL. 22, CREATED) 

DT 01-MAY-1992 (REL. 22, LAST SEQUENCE UPDATE) 

DT 01-HAY-1992 (REL. 22, LAST ANNOTATION UPDATE) 

DE CATALASE (EC 1.11.1.6). 

OS PISUH SATIVUM (GARDEN PEA). 

oc eukaryota; PL ant a; ehbryophyta; angiospermae; dicotyleddneae; fabales; 

OC FAB ACE AE. 

RN [13 

RP SEQUENCE FROM N.A. 

RC tissue=leaf; 

RH 92032793 

RA ISIN S.H.. ALLEN R.D.; 

RL PLANT MOL. BIOL. 17 s 1 263-1265 < 1 991 > . 

CC -!- FUNCTION: OCCURS IN ALMOST ALL AEROBIC ALLY RESPIRING ORGANISMS AND 
CC SERVES TO PROTECT CELLS FROM THE TOXIC EFFECTS OF HYDROGEN 

CC PEROXIDE. 

CC -!- CATALYTIC ACTIVITY? 2 H(2)0(2) = 0(2) + 2 H(2)0. 

CC -!- COF ACTOR: HEME GROUP. 

CC -!- SUBUNIT: HOMOTETRAMER. 

CC -!- SUBCELLULAR LOCATION: PEROXISOMAL. 

CC -!- SIMILARITY: TO OTHER EUKARYOTIC CATALASES. 

DR EMBL; X60169; PSCATAL. 

DR PIR; SI 8346; CSPM. 

DR PROSITE; PS00437; CATALASE.l. 

DR PROSITE; PS00438; CATALASEJ. 

KU OXIDOREDUCTASE: PEROXIDASE? IRON; HEME; HYDROGEN PEROXIDE; 

KH PEROXISOME. 

FT ACT_S1TE 65 65 BY SIMILARITY. 

FT ACT.SITE 138 138 BY SIMILARITY. 

FT BINDING 348 348 PROXIMAL HEME LIGAND (BY SIMILARITY) . 

SO SEQUENCE 494 AA; 57344 MM; 1224857 CN; 



Initial Score = 7 Optinized Score = 8 Significance = 5.31 
Residue Identity = 66X Hatches = 8 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
II III II I 

SSLTVGSRGPILLEDYHLVEKLA8FDRERIPERVVHARGASAKGFFEVTHDISHLTCADFLRAPGV8TPVIV 
30 40 50 60 70 80 90 100 



RFSTVIHERGSPETLRDPRGFAVKFYTREGNYDLVGNNFP 
110 120 130 140 



9. US-08-249-182-10 (1-12) 

VG74_BPML5 GENE 74 PROTEIN (GP74). 

ID VG74JPML5 STANDARD; PRT; 72 AA. 

AC 005289; 

DT 01-FEB-1994 (REL. 28, CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01 -FEB- 1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE GENE 74 PROTEIN (GP74). 

GN 74. 

OS MYCOBACTER IOPHAGE L5. 

OC VIRIDAE; NOT YET CLASSIFIED. 

RN [13 



RH 93211282 

RA HATFULL G.F.. SARKIS G.J.; 

RL «0L. MICROBIOL. 7:395-405(1993). 

DR EMBL; Z18946; MLCGA. 

SQ SEQUENCE 72 AA; 8469 MW; 24914 CN; 

Initial Score = 6 Optimized Score = 6 Significance = 4.25 
Residue Identity = 50X Matches = 6 Wisnatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

D1EHLTSLDFFR 
llllll 

MEFPFGGNTFQCLHLTSLDKVGLWFRWRSRDWQFGHAKPIPEGMDAASVANLWRQFIVGEISHDEYNRLLPN 
10 20 X 30 40 50 60 70 



10. US-08-249-182-10 (1-12) 

YSCAJACSU HYPOTHETICAL 13.6 KD PROTEIN IN SECA 5'REGION (ORF 

ID YSCAJACSU STANDARD; PRTi 117 AA. 

AC P28368J 

DT 01-DEC-1992 (REL. 24, CREATED) 

DT 01-DEC-1992 (REL. 24, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 13.6 KD PROTEIN IN SECA 5'REGION (0RF1). 

OS BACILLUS SUBTILIS. 

OC PROKARYOTA; FIRMICUTES; ENDOSPORE-FORMING RODS AND COCCI; BACILLACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RH 91192600 

RA SADAIE Y., TAKAHATSU H., YAMANE K.; 

RL GENE 98:101-105(1991). 

DR EMBL; D90218; BSSECA. 

DR EMBL; D 10279; BSSECA 1. 

DR PIR; JN0145; JN0145. 

KW HYPOTHETICAL PROTEIN. 

SO SEQUENCE 117 AA; 13593 MWi 67264 CN; 

Initial Score = 6 Optinized Score = 7 Significance = 4.25 
Residue Identity = 531 Matches = 7 Misnatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
III III I 

YNA 1 DLATNKLERQI RKHKTKVNRKFREGGSPK YLL ANGLGSDTD I AV9DD IEEEESLD I VR9KRFNLKPMD 
10 20 30 40 50 X 60 X 70 

SEEAILQHNNLGHNFFVFTNAETNLTNVVYRRNDGKYGLI 
80 90 100 110 



11. US-08-249-182-10 (1-12) 
VG04.VARV PROTEIN G4. 

ID VG04_VARV STANDARD; PRT; 124 AA. 

AC P32994; 

DT 01-0CT-1993 (REL. 27, CREATED) 

DT 01-0CT-1993 (REL. 27, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE PROTEIN G4. 

GN G4L OR H4L. 

OS VARIOLA VIRUS. 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; POXVIRIDAE; CHORDOPOXVIRINAE; 



WW VltlHUI WAV llWwLUi 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=INDIA-1967 / ISOLATE IND3; 

RA SHCHELKUNOV S.N., BLINOV V.M., RESENCHUK S.H., TOTHENIN A.V., 

RA SANOAKHCHIEV L.S.J 

RL VIRUS RES. 30:239-258(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=INDIA-1967 / ISOLATE IND3,' 

RN 93190624 

RA SHCHELKUNOV S.N., BLINOV V.H., TOTMENIN A.V.. MARENNIKOVA S.S., 

RA KOLYKHALOV A. A., FROLOV I.V., CHIZHIKOV V.E., GYTOROV V.V.. 

RA GASHIKOV P.V., BELANOV E.F., BELAVIN P. A.. RESENCHUK S.H., 

RA ANDZHAPARIDZE O.G., SANOAKHCHIEV L.S.i 

RL VIRUS RES. 27:25-35(1993). 

RN £3] 

RP COMPLETE GENOME. 

RC STRAIN=INDIA-1967 / ISOLATE IND3,' 

RA BLINOV V.M.; 

RL SUBMITTED (NOV-1992) TO EMBL/GENBANK/DDBJ DATA BANKS. 

DR EMBL; X67119; VVHINDQKH. 

DR EMBL; X69198; VVCGAA. 

DR PIR; A36844; A36844. 

DR PIR.' S3 3081! S33081. 

SO SEQUENCE 124 AA; 14014 NU; 86978 CNJ 

Initial Score = 6 Qptinized Score = 6 Significance = 4.25 
Residue Identity = 502 Hatches = 6 Nisoatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II I I II 

MKNVL I I FGKPYCSI CENVSEAVEELKSE YD ILHVD I LSFFLKDGDSSHLGD VKRGTL I GNFAAHLSNY I VS 
10 20 30 40 X 50 60 70 

I FKYNP9TK9HAF VD I NKSL 
80 90 



12. US-08-249-182-10 (1-12) 
VG04.VACCC PROTEIN G4. 

ID VG04.VACCC STANDARD! PRT? 124 AA. 

AC P21025; 

DT 01-FEB-1991 (REL. 17, CREATED) 

DT 01-FEB-1991 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-FEB-1991 (REL. 17, LAST ANNOTATION UPDATE) 

DE PROTEIN G4. 

GN G4L. 

OS VACCINIA VIRUS (STRAIN COPENHAGEN). 

oc viridae; ds-dna enveloped viruses; poxviridae; chordopoxvirinae; 

oc ORTHOPOXVIRUSES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 91021027 

RA GOEBEL S.J., JOHNSON G.P., PERKUS M.E., DAVIS S.U.. UINSLOU J. P., 

RA PAOLETTI E. ; 

RL VIROLOGY 179:247-266(1990). 

RN [2] 

RP COMPLETE GENOME. 

RA GOEBEL S.J., JOHNSON G.P., PERKUS M.E., DAVIS S.U.. UINSLOU J. P., 

RA PAOLETTI E.f 

RL VIROLOGY 179:517-563(1990). 

DR EMBL? M35027; PXVACCG. 



SO SEQUENCE 124 AA; 13987 Ml; 88525 CN? 



Initial Score = 6 Optimized Score = 6 Significance = 4.25 
Residue Identity = 50% Hatches = 6 Mismatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II I I II 

MKNVL I IFGKPYCSI CENVSDAVEELKSEYD I LHVD I LSFFLKDGDSSMLGDVKRGTL I GNFAAHLSNYIVS 
10 20 30 40 X 50 60 70 

I FK YNPQTK8N AFVD I NKSL 
80 90 



13. US-08-249-182-10 (1-12) 

YMEl.BACSU HYPOTHETICAL 17.2 KD PROTEIN IN MEND 5'REGION (ORF 

ID YNEl.BACSU STANDARD; PRT; 145 AA. 

AC P23972; 

DT 01-NAR-1992 (REL. 21. CREATED) 

DT 01-MAR-1992 (REL. 21. LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26. LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 17.2 KD PROTEIN IN MEND 5'REGION (0RF1). 

OS BACILLUS SUBTILIS. 

oc prokaryota; firmicutes; endospore-forhing rods and cocci; bacillaceae. 

RN [1] 

RP SEQUENCE FROM N.A. 

RA ROWLAND B., HILL K.. MUELLER J.. DRISCOLL J.. TABER H.; 
RL SUBMITTED (OCT-1991) TO EMBL/GENBANK/DDBJ DATA BANKS. 
OR EMBL; H74538; BSMENAQOP. 
DR PIRf S27507; S27507. 

KM HYPOTHETICAL PROTEIN; MENAQUINONE BIOSYNTHESIS. 
SQ SEQUENCE 145 AA; 17159 MW; 108895 CN! 

Initial Score = 6 Optinized Score = 6 Significance = 4.25 
Residue Identity = 50% Matches = 6 Mi snatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II I I II 

MVTT V8RTFRKK VLH ALHKAKEVNHAvl I SYSRQ I ESLDPLSFFNYGAKKYTGNRFFHSDPESELT I VGLGK 
10 20 30 X 40 X 50 60 70 

EAVFQTNQKNSERYREVFEQWER 
80 90 



14. US-08-249-182-10 (1-12) 

YKA8.CAEEL HYPOTHETICAL 27.3 KD PROTEIN B0303.8 IN CHROMOSOME 

ID YKA8_CAEEL STANDARD; PRT; 239 AA. 

AC P34259; 

DT 01-FEB-1994 (REL. 28. CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 27.3 KD PROTEIN B0303.8 IN CHROMOSOME III. 

GN B0303.8. 

OS CAENORHABDITIS ELEGANS. 

OC eukaryota; metazoa; acoelemates; nematoda; secernentea; RHABDITIDA. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BRISTOL N2; 



RA SULSTON J., DU Z.> THOMAS K.. WILSON R., HILLIER L.> ST ADEN R.> 

RA HALLORAN N., GREEN P.. TH1ERRY-MIEG J., QIU I., DEAR S.. COULSON A.. 

RA CRAXTON H.. DURE IN R.K., BERKS M., HETZSTEIN H. , HAWKINS T., 

RA AINSCOUGH R. , MATERSTON R.J 

RL NATURE 356:37-41(1992). 

DR EMBL; H77697; CEB0303. 

DR PIR; S27789; S277S9. 

DR WORMPEP! B0303.8; CE00S62. 

KW HYPOTHETICAL PROTEIN. 

S9 SEQUENCE 239 AA! 27341 MW; 323313 CN! 

Initial Score = 6 Optinized Score = 6 Significance - 4.25 
Residue Identity - 50X Hatches = 6 Mi snatches = 6 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

DIEHLTSLDFFR 
II II II 

SVRSTFYCENGYFGPICDRRSRTFAPKSDIQTSTPGYQTQVLKFDFKISDDI IIYSSLAFFVLLLI IFNCIL 
120 130 140 150 160 X 170 X 180 

CCYRPKKSQKYLDVSLGSPKVFSICGYSADKSGNTTEYLD 
190 200 210 220 



15. US-08-249-182-10 (1-12) 

APPY.ECOLI APPY PROTEIN <H5 POLYPEPTIDE) . 

ID APPY.ECOLI STANDARD? PRTJ 243 AA. 

AC P05052! 

DT 13-AUG-1987 (REL. 05, CREATED) 

DT 01-APR-1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-N0V-1991 (REL. 20, LAST ANNOTATION UPDATE) 

DE APPY PROTEIN (H5 POLYPEPTIDE). 

GN APPY. 

OS ESCHERICHIA COLI. 

OC PRDKARYOTA; GRACILICUTES; SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTERI ACE AE . 

RN m 

RP SEQUENCE FROM N.A. 

RN 89155479 

RA ATLUNG T., NIELSEN A., HANSEN F.G.; 

RL J. BACTERIOL. 171:1683-1691(1989). 

RN [21 

RP SEQUENCE FROM N.A. 

RC STRAIN=JM 83; 

RM 87231084 

RA KEMP E.H., MINTON N.P.. MANN N.H.; 

RL NUCLEIC ACIDS RES. 15:3924-3924(1987). 

CC -!- FUNCTION: INDUCES THE SYNTHESIS OF ACID PHOSPHATASE (APPA) AND 

CC SEVERAL OTHER POLYPEPTIDES DURING THE DECELERATION PHASE OF 

CC GROWTH. IT ALSO ACTS AS A TRANSCRIPTIONAL REPRESSOR FOR ONE GROUP 

CC OF PROTEINS THAT ARE SYNTHESIZED PREFERENTIALLY IN EXPONENTIAL 

CC GROWTH AND FOR ONE GROUP SYNTHESIZED ONLY IN THE STATIONARY PHASE. 

CC -!- SIMILARITY: BELONGS TO THE ARAC/XYLS FAMILY OF TRANSCRIPTIONAL 

CC REGULATORS. 

DR EMBL! (124530; ECAPPYAA. 

DR EMBL; Y00138! ECM5. 

DR PIR; A29260; BVECN5. 

DR PIR; JS0110; BVECAY . 

DR ECOGENE; EG 10050; APPY. 

DR PROSITE; PS00041; HTH_ARAC_FAHILY. 

KU TRANSCRIPTION REGULATION; DNA-BINDING; REPRESSOR. 

FT DNA.BIND 149 168 H-T-H MOTIF (BY SIMILARITY). 

FT CONFLICT 25 25 F -> FKKNSLF (IN REF. 2). 



Initial Score = 6 Optinized Score = h Significance = 4.25 
Residue Identity = 50% Hatches = 6 Mi snatches = b 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

DIEHLTSLDFFR 
II I I II 

MDYVCSVVF ICSSFDL I INRRVI SFI VSDK IRRELPVCPSKLRI VDI DKKTCLSFFIDVNNELPGKFTLDKN 
10 20 30 40 X 50 X 60 70 

GY I AEEEPPLSL VFSLFEG I K I ADSHSLHLKERLC 
80 90 100 

> 0 < 

0| |0 Intel UGenetics 

>l!< 5e«). i; 

FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 

Results file u249_lla.res nade by on Thu 22 Sep 94 10:36:47-PDT. 



Query sequence being conpared:US-08-249-182-ll (1-23) 
Number of sequences searched: 42145 
Nunber of scores above cutoff: 4803 

Results of the initial coaparison of US-08-249-182-11 (1-23) with: 
Data bank : A-GeneSeq 15, all entries 
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STDEV 0 2 3 
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PARAMETERS 




Similarity aatrix 


Unitary 


K-tuple 


2 


Hi snatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 


15 


Optica zed scores to save 0 


Display context 


50 




SEARCH STATISTICS 




Scores: 


Mean 


Median Standard Deviation 




1 


3 1.52 




Tines? 


CPU 


Total Elapsed 




00:00:29.95 


00:00:39.00 




Nunber of residues: 




5287517 




Number of sequences 


searched: 


42145 




Nunber of scores above cutoff: 


4803 





22 



Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 



A 100X identical sequence to the query sequence uas not found. 



The list of best scores is? 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



**** 13 standard deviations above Rean **** 

1. R37453 Autotaxin peptide ATX 103. 23 22 22 13.81 0 

**** 5 standard deviations above nean **** 

2. P82098 Protein 2 encoded by anylolyt 33 9 9 5.26 0 

ft*** 4 standard deviations above ptean **** 



4. 


R06262 


3 v S n. 

Calf acetylcholine receptor ( 


520 


o 

8 


a 

8 


-T.OV 

4.60 


0 


5. 


R06261 


Mouse acetylcholine receptor 


520 


8 


8 


4.60 


0 


6. 


R33646 


Rabbit pre-pro serun albunin. 


608 


8 


8 


4.60 


0 


7. 


Rl 1255 


Murine IL-4 receptor. 


810 


8 


8 


4.60 


0 


8. 


R04574 


Derived anino acid sequence o 


810 


8 


9 


4.60 


0 


9. 


R11993 


Glutanate receptor 5-1. 


920 


8 


8 


4.60 


0 


10. 


R32882 


Cardiac adenylyl cyclase type 


1184 


8 


8 


4.60 


0 


11. 


R40227 


ACVS. 


3639 


8 


8 


4.60 


0 


12. 


R13896 


ACV synthetase. 


3712 


8 


8 


4.60 


0 






**«# 3 standard deviations above nean 










13. 


P90430 


l«(j|L lupnl tint 




7 


8 


3.95 


0 


14. 


R32203 


Annie PiMiit PPR nQP7 
nppie Truit rru port/. 




7 


7 


3.95 


0 
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R25119 


iiun Hi won o nepai/itis virus 
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7 
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P40026 
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rusiun protein qt insulin im 


AQ 
07 


7 


7 


3.95 


0 
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R 12603 


QTR t?1 intact in;il anrin 
□ id ici lriteatlrldl Rutin. 


QK 
73 


7 


7 


3.95 


0 


18. 


P60578 


Hunan nppnrn- cnoat naoHi n— C 
nunan pi epr u surtatuneuifi %*• 


1 \9 
117 


7 


7 


3.95 


0 


19. 


P70277 






7 


7 


3.95 


0 


20. 


R12521 


R rpl 1 rii f f Prpnt i at. i nn farlnr 
u b?i i ux i i ?i criiifi aviuii i olvvi 


tit 


7 


9 


3.95 


0 


21. 


P90363 


Rpra.nhin.Ant Hrrtun C Pineria t 

ntTLUnUillallw Ul UUp w klnei 1<3 t 


ceo 


7 




3.95 


0 


22. 


R05026 


Rota ciihimit nT rat hinh aPfi 


C*t J 


7 


7 


3.95 


0 


23. 


R14770 


Rota cuhunit nf hinh affinitii 
octd auuumt ut niyn dTTinity 


C*tO 


7 


7 


3.95 


0 


24. 


R22997 


icdii prutedSUne IV/ dipnd su 


CJC 


7 


7 


3.95 


0 


25. 


P71672 


nUnan serine protease. 


COC 


7 


7 


3.95 


0 


26. 


R06639 


UrOtrlOinr 3 pnOSpnaLc ucLdrD 


9A7 
CO/ 


7 


7 


3.95 


0 


27. 


R06640 


urotiome j pnobpnate oecaro 


COO 


7 


7 


3.95 


0 


28. 


R13467 


vc protein. 


97A 

c/o 


7 




3.95 


0 


29. 


R03339 


vri sequence tor nnv Serotype 


CO/ 


7 


9 


3.95 


0 


30. 


R03340 


vri sc^ucriLc tut ni\v serotype 


C71 


7 




3.95 


0 


31. 


P60326 


Interleukin-1 gene product. 


295 


7 


7 


3.95 


0 


32. 


R37346 


PEP Ph\ 


313 


7 


7 


3.95 


0 


33. 


P93143 


Sequence encoded by 0RF2 of p 


314 


7 


7 


3.95 


0 


34. 


R33279 


43 kD endoflagellum sheath pr 


320 


7 


9 


3.95 


0 


35. 


R25116 


Non-A» Non-B Hepatitis Virus 


336 


7 


9 


3.95 


0 


36. 


R27163 


CD2 binding LFA-3-Ig fusion p 


347 


7 


7 


3.95 


0 


37. 


R42424 


Rat gustducin alpha subunit. 


354 


7 


7 


3.95 


0 


38. 


R25698 


Murine adrenergic beta-3 rece 


388 


7 


7 


3.95 


0 


39. 


R07130 


H20B receptor. 


392 


7 


7 


3.95 


0 


40. 


R32501 


Beta-adrenergic receptor. 


400 


7 


7 


3.95 


0 



1. US-08-249-182-11 (1-23) 

R37453 Autotaxin peptide ATX 103. 

ID R37453 standard; peptide; 23 AA. 

AC R37453; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 103. 

KU Celt motility stimulating; cancer metastasis; antibody; detection; 

KW immunostains; disease outcome prediction; therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAM-1992; US-822043. 

PA (USSH ) US DEPT HEALTH fir HUNAN SERVICE. 

PI Krutzsch H» Liotta LAi Schiffmann Ei Stracke ft. 

DR DPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 103. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in imnunostains of patient samples to detect the 
fc^oresence of autotaxin. The level of autotaxin in tissue or body 



t lulus Ldn useu lu preDin. Disease uun-unes una /or tnoite or 

CC therapy which nay also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinlted to toxins (e.g. ricin A) for cancer 

CC therapy. 

S9 Sequence 23 AA; 

SQ 0 A; i r; 2 N; 2 d; 0 B; 0 c; o 8; l E; 0 Z; 2 G; 0 H; 

SQ 1 I; 4 L5 0 K; 0 H; 1 Fi 1 P; 1 SJ 4 T? 0 M, 1 YJ 2 V; 

Initial Score - 22 Optinized Score = 22 Significance = 13.81 
Residue Identity - 951 Hatches = 22 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

iiiiiiiiiiiiiiiiii mi 

TEFLSNYLTNVDD I TLVPGTLGR 
X 10 20 X 



2. US-08-249-182-11 (1-23) 

P82098 Protein 2 encoded by anylolytic enzyne expression/ 

ID P82098 standard; protein; 33 AA. 

AC P82098; 

DT 29-0CT-1990 (first entry) 

DE Protein 2 encoded by anylolytic enzyne expression/secretion sequence 

KU anylolytic enzyne; protein secretion; ss. 

OS synthetic. 

PN J63219381-A. 

PD 13-SEP-1988. 

PF 09-NAR-1987; 052072. 

PR 09-HAR-1987; JP-052072. 

PA (0JIP) 0JI Paper KK; (0JIC-) 0JI Corn Starch KK. 

DR WPI; 88-297738/42. 

PT DNA contg regions relating to expression and secretion of protein - 

PT comprise promoter region relating to anylolytic enzyne gene. 

PT ribosone region, region relating to secretion of protein expression 

PT product 

PS Clain 6; page 510; 14pp; Japanese. 

CC DNA includes a promoter region for expression of the anylolytic 

CC enzyne of an alkali Bacillus, the ribosone binding site and the 

CC region responsible for secretion of the enzyne. 

CC See also NB2034.P82097 and N82033. 

SQ Sequence 33 AA; 

SO 2 a; l R; o N; l D; 0 B; 0 c; 0 6; IE; 0 z; 2 G; 0 H; 

SQ 3 l; 6 L; 3 K; 2 «; 3 F; l p; 2 s; 3 T; 0 us 0 y; 3 v; 

Initial Score = 9 Optinized Score = 9 Significance = 5.26 
Residue Identity = 39X Hatches = 9 Hisnatches = 14 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDD I TLVPETLGR 

III I II I II 
HKHRTGKKGFLSILLAFLLVITSIPFTLVDVEA 
10 20 30 



3. US-08-249-182-11 (1-23) 
R25462 glgA. 

ID R25462 standard; Protein; 477 AA. 

AC R25462; 

DT 15-JAN-1993 (first entry) 

DE glgA. 

KU Glycogen synthase gene; glgA; polynerase chain reaction; k 12 618; 



r\w pidsbio* bpetirit gravity! tree sugar convent. 

OS Escherichia coli. 

PN WQ9211382-A. 

PD 09-JUL-1992. 

PF 19-DE<M99t; U09654. 

PR 21-DEC-1990? US-632383. 

PR 16-JUL-1991? US-731226. 

PR 24-JUL-1991? US-735065. 

PA (CALJ ) CALGENE INC, 

PI Sheunaker CK, Stalker DM; 

DR DPI; 92-250101/30. 

DR N-PSDB? 825978. 

PT Glycogen biosynthesis enzyne-encoding genes - for prodn. of 

PT transgenic plants having modified starch content, e.g. decreased 

PT anylase 

PS Disclosure? Fig 2? 65pp? English. 

CC The sequence given is encoded by the E. coli glycogen synthase gene 

CC (glgA) uhich was generated by polymerase chain reaction fron E. coli 

CC strain k 12 618. The sequence encoding this polypeptide was used in 

CC the construction of an oligonucleotide uhich also contained an 

CC endogenous plant sequence. This construct is useful in studying and 

CC (manipulating the starch biosynthetic pathway. This enzyne can be 

CC targeted to a plastid where starch synthesis occurs. Plants or pint 

CC part uhich synthesise and store starch nay be obtained which have 

CC increased or decreased starch content and modified starch related 

CC properties! such as specific gravity, free sugar content and/or 

CC novel and useful starches » eg. potato starch with decrease anylose and 

CC modified anylopectin. 

S9 Sequence 477 AA; 

S9 51 A; 30 R? 11 n; 29 D; o B; 3 C? 21 S; 20 E; 0 Z; 45 G; 16 H; 

SS 17 I; 60 L? 12 K? 10 M? 23 F? 23 p; 25 S? 18 T? 9 W? 20 Y? 34 V? 

Initial Score = 8 Optimized Score = 8 Significance = 4.60 
Residue Identity = 34X Hatches = 8 Misnatches ~ 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II I II III 

GLEFNGQISFLKAGLYYADHITAVSPTYAREITEPflFAYGHEGLLflQRHREGRLSGVLNGVDEKIHSPETDL 
190 200 210 220 230 240 250 

X 
R 

LLASRYTRDTLEDKAEMKRQL8IAHGLKVDDKVPLFAVVSRLTS8KGLDLV 
260 270 280 290 300 310 



4. US-08-249-182-11 (1-23) 

R06262 Calf acetylcholine receptor (AchR) delta-subunit. 

ID R06262 standard; protein; 520 AA. 

AC R06262; 

DT 07-DEC-1990 (first entry) 

DE Calf acetylcholine receptor (AchR) delta-subunit. 

KH Nicotinic acetyl choline receptor; AchR? TE671? insecticides? 

KU Muscle relaxants? anthelmintics? 

OS Bos Taurus. 

PN CA2003459-A. 

PD 23-MAY-1990. 

PF 21-N0V-1989? 003459. 

PR 23-N0V-1988? US-275422. 

PA (SALK ) SALK INST FOR BIOL STUD. 

PI Lindstron JM, Schoepfer RD? 

DR WPI? 90-231525/31. 



* U^l 1 V 4 V VJ — %J~ It 1 I a 1/UliC 3 — U III 3rl Q if Wilt 3 A W 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDD I TLVPETLG 
II I I I III 

LTLGLLAALVVCALPGSWGLNEEflRLIQHLFNEKGYDKDLRPVARKEDKVDVALSLTLSNLISLKEVEETLT 
10 20 30 40 50 X 60 70 

X 
R 

TNVWIDHAWVDSRLQWDANDFGNITYLALPPDPtVULPEIVLENNNDGSFQI 
80 90 100 110 120 



6. US-08-249-182-11 (1-23) 

R33646 Rabbit pre-pro serun albunin. 

ID R33646 standard; Protein; 608 AA. 

AC R33646! 

DT 09-JUL-1993 (first entry) 

DE Rabbit pre-pro serun albunin. 

KM RSA; large scale prepn.l culture nediun; nedicine carrier. 

OS Oryctolagus cuniculus. 

PN J05038287-A. 

PD 19-FEB-1993. 

PF 10-JUL-1991; 194984. 

PR 10-JUL-1991? JP-194984. 

PA (TOY J ) TOSOH CORP. 

DR WPI; 93-096421/12. 

OR N-PSDB; G38280. 

PT Rabbit serun albunin derived fron pre pro-serun albunin gene - 

PT comprises pre pro-albumin genes expressed by defined 

PT polypeptide and DNA sequences 

PS Disclosure; Fig 2; 19pp." Japanese. 

CC The sequence is that of the rabbit pre-pro serun albunin which 

CC nay be reconbinantly produced on a large scale in e.g. Pichia pastoris. 

CC The rabbit serun albunin (RSA) nay be used for the prepn. of a 

CC culture nediun of aninal cells and as a carrier for nedicines. 

SO Sequence 608 AA; 

S9 56 A; 25 R; 12 N; 43 D; 0 B; 35 C; 14 Q; 56 E; 0 Z; 20 G; 23 H; 

SQ 17 I; 65 L; 59 K; 2 H; 28 F; 29 p; 29 s; 28 T; 2 u; 25 Y; 40 v; 

Initial Score = 8 Optinized Score = B Significance - 4.60 
Residue Identity = 34% Hatches = 8 Mi snatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDD I TLVPETLG 

II I II III 
EDYLSVVLNRLCVLHEKTPVSEKVTKCCSESLVDRRPCFSALGPDETYVPKEFNAETFTFHADICTLPETER 
480 490 500 510 520 X 530 540 

X 
R 

KIKKQTALVELVKHKPHATND9LKTVVGEFTALLDKCCSAEDKEACFAVEG 
X 550 560 570 580 590 



7. US-08-249-182-11 (1-23) 

R 11 255 Murine IL-4 receptor. 



ID Rl 1255 standard; Protein; 810 AA. 
AC Rl 1255; 



PT the effects of agents which affect acetylcholine receptors in 

PT skeletal muscles. 

PS Disclosure; p; English. 

CC Receptors nay be used in assay for materials which modify then. 

CC They nay be produced in substantial t pure quantities for use in 

CC experimentation development of insecticides without effect on 

CC hMNARs and treatnent of parasitic infections. MAbs raised to the 

CC peptides nay be useful in detection of the structure of MNARs. 

CC 24 unidentified residues are due to the poor quality of the 

CC sequence reproduction. 

SQ Sequence 520 AA; 

SQ 29 A; 24 R; 27 N; 25 D; 0 B? 9 C; 17 Q? 33 El 0 Z; 22 G; 9 H; 

SQ 34 l; 61 L? 23 K; 9 M? 27 F? 33 P; 39 S; 22 T; 15 U; 15 Y; 47 V; 

Initial Score = 8 Optinized Score = 8 Significance = 4.60 
Residue Identity = 34X Matches = 8 Mismatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II I I I III 

LTLGLLAALVVCALPGSUGLNEESRLIQHLFNEKGYDKDLRPVARKEDKVDVALSLTLSNLISLKEVEETLT 
10 20 30 40 50 X 60 70 

X 
R 

TNVWIDHAWVDSRLQHDANDFGNITVLALPPDRVHLPEIVLENNNDGSF8I 
80 90 100 110 120 



5. US-08-249-182-11 (1-23) 

R06261 Mouse acetylcholine receptor (AchR) delta-subunit. 

ID R06261 standard; protein; 520 AA. 

AC R06261; 

DT 07-DEC-1990 (first entry) 

DE Mouse acetylcholine receptor (AchR) delta-subunit. 

KW Nicotinic acetyl choline receptor? AchR; TE671; insecticides; 

KW Muscle relaxants; anthelnintics? 

OS Mus sp. 

PN CA2003459-A. 

PD 23-MAY-1990. 

PF 21-N0V-1989; 003459. 

PR 23-N0V-1988; US-275422. 

PA (SALK ) SALK INST FOR BIOL STUD. 

PI Lindstron JM» Schoepfer RD; 

DR HPI; 90-231525/31. 

PT Hunan muscle nicotinic acetylcholine receptor - used to assay 

PT the effects of agents which affect acetylcholine receptors in 

PT skeletal nuscles. 

PS Disclosure; p; English. 

CC Receptors nay be used in assay for materials which nodify then. 

CC They nay be produced in substantial! pure quantities for use in 

CC experimentation! development of insecticides without effect on 

CC hMNARs and treatment of parasitic infections. MAbs raised to the 

CC peptides nay be useful in detection of the structure of MNARs . 

CC Unidentified residues are due to the poor quality of the 

CC sequence reproduction. 

SQ Sequence 520 AA; 

SQ 28 A; 25 R; 28 N; 25 D; 0 B? 9 C; 17 Q; 33 E? 0 Z; 23 Gr 9 H? 

SQ 32 I; 61 L; 22 K; 12 M; 27 F; 33 P; 39 S; 24 T? 10 u; 16 YJ 46 v; 

SQ 1 Others; 

Initial Score = 8 Optinized Score = 8 Significance = 4.60 



ui jv-nttf-i77i wirsL entry/ 

DE Murine IL-4 receptor. 

KW Interleukin-4; soluble? IgE. 

OS Mus musculus. 

FN Key Location/Qualifiers 

FT Region 1..25 

FT /label= signal sequence 

FT Domain 26, .233 

FT /label= extracellular domain 

FT Domain 234.. 257 

FT /label= transmembrane region 

FT Domain 258.. 810 

FT /label= cytoplasmic domain 

PN EP-419091-A. 

PD 27-MAR-1991. 

PF 05-SEP-1990; 309700. 

PR 07-SEP-1989; US-404179. 

PR 20-MAR-1990; US-496449. 

PA (SCHE ) SCHERING CORP. 

PI Gal i 22 1 JPr Harada Nr Miyajima A; 

DR WPI; 91-088700/13. 

PT Nucleic acid encoding mammalian interleukin-4 receptor - used as 

PT antagonists of interleukin-4 in treating conditions associated 

PT uith excess IgE prodn. including allergic conditions. 

PS Disclosure; Fig 5; 18pp; English. 

CC The sequence uas deduced from cDNA clone 19 from a cDNA 

CC library prepd. from MC/9 cells. 

CC See also Rl 1254 and Ql 1055. 

SG Sequence 810 AA; 

SQ 50 Al 23 R; 29 N; 34 D; 0 B; 31 C? 39 Q; 53 E; 0 Z; 63 G; 18 H; 

SO 29 I; 81 L; 30 K; 15 M; 23 F; 82 P; 90 S; 38 T; 17 W; 18 Y? 47 V; 

Initial Score = 8 Optimized Score = 8 Significance = 4.60 
Residue Identity = 34X Matches = 8 Mismatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II I I II II 

KAFSSLLSSNGIRGDTAAAGTDDGHGGYKPF8NPVPN8SPSSVPLFTFGLDSELSPSPLNSDPPKSPPECLG 
610 620 630 640 650 X 660 670 

X 
R 

LELGLKGGDWVKAPPPADQVPKPFGDDLGFGIVYSSLTCHLCGHLK9HHS8 
X 680 690 700 710 720 



8. US-08-249-182-11 (1-23) 

R04574 Derived amino acid sequence of coding region of mu 

ID R04574 standard; protein; 810 AA. 
AC R04574; 

DT 17-SEP-1990 (first entry) 

DE Derived amino acid sequence of coding region of murine IL-4 receptor 
KW mammalian interleukin-4 receptor; cytokine; antibody production; 
OS synthetic. 

FH Key Location/Qualifiers 

FT misc_feature 209.. 232 

FT /label= putative transmembrane region 

PN EP-367566-A. 

PD 09-MAY-1990. 

PF 31-0CT-1989; 311244. 

PR 23-JUN-1989; US-370924. 

PA (IMMU-) Immunex Corp. 



DR HPI; 90-141470/19. 

DR N-PSDB; 004305. 

PT Recombinant mammalian interleukin-4 receptor used in diagnosis* 

PT assays and therapy and for prodn. of antibodies for diagnosis* therapy 

PT and for prodn. of antibodies 

PS Disclosure? p; English. 

CC The interleukin-4 receptor can be used to regulate immune responses or 

CC treat IgE-induced hypersensitivity. 

CC See also 804307. 

SO Sequence 810 AA; 

SQ 49 A; 23 RJ 28 N? 34 D; 0 B; 32 C? 39 0; 52 E? 0 Z? 63 G? 18 Hf 

S8 29 I; 80 L; 30 K; 15 M; 25 F; 83 P? 89 S? 39 T; 16 H; 18 Y; 48 V? 

Initial Score = 8 Optimized Score = 9 Significance = 4.60 
Residue Identity = 43X Matches = 10 Mismatches = 12 

Gaps = 1 Conservative Substitutions - 0 

X 10 20 

TEFLSNYLTN VDD I TL VPETLG 
II II II II II 

AFSSLLSSNAIRGDTAAAGTDDGHGGYKPFQNPVPNSSPSSVPLFTFGLDTE-LSPSPLNSDPPKSPPECLG 
610 620 630 640 650 X 660 670 

X 
R 

LELGLKGGDMVKAPPPADEVPKPFGDDLGFGIVYSSLTCHLCGHLKQHHS8 
X 680 690 700 710 720 



9. US-08-249-182-11 (1-23) 

Rl 1993 Glutamate receptor 5-1. 

ID R11993 standard; Protein; 920 AA. 

AC R11993; 

DT 31-JUL-1991 (first entry) 

DE Glutamate receptor 5-1. 

KU Glutamate receptor 5-1; probe; ligand; drug screening. 

OS Rattus rattus. 

FH Key Location/Qualifiers 

FT Peptide 1..30 

FT /label* sig^peptide 

FT Protein " 31.. 920 

FT /label* nat_protein 

FT Region " 402.. 416 

FT /label* insertion 

PN W091O6648-A. 

PD 16-MAY-1991. 

PF 25-0CT-1990; U06153. 

PR 27-0CT-1989; US-428116. 

PA (SALK ) SALK INST FOR BIOL STUD. 

PI Heinemann 5F* Boulter JR* Hollmann M* Bettler Br Jensen JE? 

DR WPI; 91-164197/22. 

DR N-PSDB ; 01 1853. 

PT Glutamate receptors - used to screen for functional ligands and 

PT identify and isolate further receptors 

PS Disclosure; Fig 10; 109pp? English. 

CC GluR5-l has a 15 amino acid insert (see features) compared to the 

CC shorter variant GluR5-2 and is unique among the receptors GluRl-7. 

CC It has a Nr of 100*000. The signal sequence cleavage site is after 

CC a Pro* uhich is atypical. 

CC The gene and protein can be used in drug screening* to 

CC determine whether a substance is a functional ligand for the 

CC receptor by monitoring ion channel activity. 

CC See also 011849-355. 



aa sequence icv 

Sfl 53 A; 47 R; 45 N; 46 D? 0 B; 13 C; 28 9; 54 E; o Z; 55 G; 14 H; 
Sfl 63 I? 9? U 59 KJ 28 M; 40 F; 37 P; 70 S; 59 T? 18 W; 38 Y; 54 V; 



Initial Score ~ 8 Optinized Score = 8 Significance = 4.60 
Residue Identity = 34X Matches = 8 Mi snatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II! I I III 

NRDRSNNITDSLANRTLIVTTILEEPYVMYRKSDKPLYGNDRFEGYCLDLLKELSNILGFLYDVKLVPDGKY 
440 450 460 470 480 X 490 500 

X 
R 

GAQNDKGEWNGHVKELIDHRADLAVAPLTITYVREKVIDFSKPFMTLGISI 
X 510 520 530 540 550 



10. US-08-249-182-11 (1-23) 

R32882 Cardiac adenylyl cyclase type V. 

ID R32882 standard; Protein; 1184 AA. 

AC R32882? 

DT 17-JUN-1993 (first entry) 

DE Cardiac adenylyl cyclase type V. 

KM CACVJ therapy; diagnostic; cardiac function? cyclic AMP; cAMP; heart 

KM failure. 

OS Canis faniliaris. 

PN EP-529622-A. 

PD 03-HAR-1993. 

PF 27-AUG-1992? 114637. 

PR 29-AUG-1991? US-751460. 

PA (AMCY ) AMERICAN CYANAMID CO. 

PI Ishikaua Y, Konski AF; 

DR WPI; 93-068688/09. 

DR N-PSDB; S37543. 

PT Isolated nucleic acid nol. encoding Cardiac adenylyl cyclase type 

PT V - useful for deternining and nodifying cardiac function 

PS Clain 4; Page 15-27; 38pp; English. 

CC Left ventricular tissue of canine heart uas used as a source of nRNA. 

CC A cDNA library uas prepd. in lanbda gtlO phage. A 970 bp Aat-Hincll 

CC fragnent fron type I adenylyl cyclase cDNA was used as probe. The 

CC clones isolated were used to obtain cDNA encoding CACV. This probe 

CC nay also be used to screen a hunan cardiac cDNA library to obtain 

CC the cDNA encoding hunan CACV. CACV, its analogues and antibodies 

CC are useful in therapy or diagnostic assays, e.g. in nodifying and 

CC deternining cardiac function. A decrease in CACV content of the 

CC heart contributes to inpaired cAMP prodn. and in heart failure. The 

CC CACV can also be used to screen for cpds. which stinulate or inhibit 

CC the activity of the cyclase. 

S9 Sequence 1184 AA; 

SQ 123A; 83 R; 52 N; 43 D; 0 B; 35 C; 43 3; 64 E; 0 Z; 84 G; 30 H; 

S3 58 I; 122L; 47 K; 36 M; 54 F; 51 P; 71 S? 52 T; 11 u; 32 Y; 93 v; 

Initial Score = 8 Dptinized Score = 8 Significance = 4.60 
Residue Identity = 34X Matches = 8 Hisnatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II III III 

VRKFLLTFREPDLEKKYSKQVDDRFGAYVACASLVFLFICFVQITIVPHSVFMLSFYLTCFLLLTLVVFVSV 

AAA A7ft AAA AOft 7Afl lif\ "TOft 71rt 



X 
R 



IYSCVKLFPGPLQSLSRKIVRSKTNSTLVGVFTITLVFLSAFVNMFMCNSE 
X 740 750 760 770 780 



11. US-08-249-182-11 (1-23) 
R40227 ACVS. 

ID R40227 standard; Protein; 3639 AA. 

AC R40227; 

DT Sl-FEB-1994 (first entry) 

DE ACVS. 

KU Delta-(L-alpha-aninoadipyl)-L-cystinyl-D-valine synthase; ACVS; 

KU beta-lactan; antibiotic; transforned; cephalosporin; vector. 

OS Acrenoniun chrysogenun. 

PN J05192162-A. 

PD 03-AUG-1993. 

PF 25-JUL-1991; 186222. 

PR 31-JUL-1990; JP-205677. 

PA (TAKE ) TAKEDA CHEN IND LTD. 

DR HPI; 93-277475/35. 

DR N-PSDB. 048231. 

PT DNA coding delta-(L-alpha-anino-adipyl) L-cystinyl D-valine 

PT synthase - for inproved productivity of cephalosporin antibiotics 

PS Claim 1; Page 14-27; 69pp; Japanese. 

CC The sequence (048231) is of a vector uhich includes the 

CC delta-(t-alpha-aninoadipyl)-L-cystinyl-D-valine synthase gene. 

CC This sequence uas transforned into a host cell to express the ACVS 

CC product. The protein produced (R40227) uas then used to nanufacture 

CC a beta-lactan antibiotic. 

SQ Sequence 3639 AA; 

SQ 258 A; 209R; 143N; 213D; 0 6; 39 C; 175Q; 232E; 0 Z? 232G; 126H; 

SO 1881; 405L; 135K; 58 H» 13 OF; 181P; 271S; 212T; 38 U; 127Y; 267V; 

Initial Score = 8 Optimized Score = 8 Significance - 4.60 
Residue Identity = 34% Matches = 8 Mi snatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTN VDD I TLVPETLG 

Hill I II 

T9KPSDLAYVIFTSGTTGKPKGVLVEHQSVVQLRNSLIERYFGETNGSHAVLFLSNYVFDFSLE8LCLSVLG 
2510 2520 2530 2540 2550 2560 2570 

X 
R 

GNKLIIPPEEGLTHEAFYDIGRREKLSYLSGTPSVL8QIELSRLPHLHHVT 
2580 2590 2600 2610 2620 



12. US-08-249-182-11 (1-23) 

R 13896 ACV synthetase. 

ID R13896 standard; Protein; 3712 AA. 
AC R 13396; 

DT 22-N0V-1991 (first entry) 
DE ACV synthetase. 

KU Beta lactam antibiotics; penicillin. 

OS Acrenoniun chrysogenun. 

FH Key Location/Qualifiers 

FT Donain 301.. 1068 

FT /label= I 



ri /TuriLLiun- at.biv0i.1or1 ut dftifiu alio suBbbrabtr 

FT Region 374.. 423 

FT /label* subdonain 

FT Region 474.. 501 

FT /label* subdonain 

FT Region 655.. 699 

FT /label* subdonain 

FT Region 725.. 754 

FT /label= subdonain 

FT Donain 1392.. 2154 

FT /labet= II 

FT /function= activation of anino acid substrate 

FT Region 1470. .1518 

FT /label= subdonain 

FT Region 1564.. 1590 

FT /label= subdonain 

FT Region 1745.. 1789 

FT /label= subdonain 

FT Region 1817.. 1846 

FT /label= subdonain 

FT Donain 2474.. 3295 

FT /label* III 

FT /function* activation of anino acid substrate 

FT Region 2554.. 2603 

FT /label* subdonain 

FT Region 2647.. 2673 

FT /label* subdonain 

FT Region 2827.. 2871 

FT /label* subdonain 

FT Region 2899.. 2928 

FT /label* subdonain 

FT Donain 3560.. 3647 

FT /label* IV 

FT /function* thioesterase 

PN EP-445868-A. 

PD ll-SEP-1991. 

PF 27-FEB-1991; 200423. 

PR 28-FEB-1990; EP-200475. 

PR 28-FEB-1990? EP-200488. 

PR 02-JUL-1990; EP-201768. 

PR 03-QCT-1990; EP-202628. 

PR 27-FEB-1991; EP-200423. 

PA (KONN ) GIST-BROCADES NV. 

PI Veenstra AE» Martin JFf Garcia BDi Guttierez Sr Barredo JL? 

PI Hontenegro PEi Von Doehren H> Palissa H, Van Lienpt H; 

DR HPI? 91-268735/37. 

DR N-PSDB; 913608. 

PT DNA encoding anino:adipyl-cysteinyl-valine synthetase - used for 

PT prodn. of the enzyne or enhanced prodn, of neu or known 

PT beta-lactan antibiotic cpds. 

PS Clam 1; Page 20; 54pp; English. 

CC The DNA sequence uas obtd. fron five subclones isolated fron a 

CC gene library of A. chrysogenun CIO (ATCC 48). The protein 

CC sequence uas deduced fron the DNA. Three distinct regions of 

CC honology have been identified! donains If II and III. Within 

CC these donains several even nore conserved elenents can be 

CC distinguished. Since the enzyne synthesises a tripeptidei which 

CC nost probably requires the activation of three anino acidsr a 

CC role for these donains in the activation reactions seens likely. 

CC A fourth donain is thought to act as a thioesterase. 

CC The gene can be used to express the synthetase enzyne which can 

CC be used for the prodn. of neu beta-lactan antibiotics. 

CC See also R13896. 

SO Sequence 3712 AA; 

SO 262A; 214R; 144N; 219D; 0 B; 42 C; 1809; 237E; 0 Z? 237G; 127H; 

SO 190U 413L; 135K; 58 H; 131F; 184P; 282S? 217TJ 39 Wl 128Y? 273V? 



Initial Score = 8 Optimized Score = 8 Significance - 4.60 
Residue Identity = 34X Hatches = 8 Hi snatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
IIIII I II 

TSKPSDLAYVIFTSGTTGKPKGVLVEHQSVVQLRNSLIERYFGETNGSHAVLFLSNYVFDFSLEflLCLSVLG 
2580 2590 2600 2610 2620 2630 2640 2650 



X 
R 



GNKLIIPPEEGLTHEAFYDIGRREKLSYLSGTPSVL89IELSRLPHLHHVT 
X 2660 2670 2680 2690 2700 



13. US-08-249-182-11 U-23) 
P90430 Cyclophilin. 



ID P90430 standard; peptide; 42 AA. 

AC P90430J 

DT 19-0CT-1939 (first entry) 

DE Cyclophilin. 

KM Cyclosporin A-binding; human cytosolic protein? chronic 

KW inflammatory disease; transplants; cyclophilin; rheumatoid arthritis; 

KW systemic lupus erythematosus; psoriasis; asthma; multiple sclerosis; 

KU myasthenia gravis; juvenile diabetes; autoimmune diseases. 

OS Homo sapiens (human). 

PN EP326067-A. 

PD 02-AUG-1989. 

PF 24-JAN-1989? 101164. 

PR 26-JAN-1988; US-148473. 

PA (DUP0) Du Pont De Nemours Co. 

PI Ackernan N R; Galbraith U; Irr J D; Jaffee 8 D; Lischue H A. 

DR HPI; 89-222037/31. 

PT Cyclophilin and neu isoforms 

PT - used as antiinf lammatory and immunomodulatory agent » eg to treat 

PT auto-immune diseases and prevent rejection. 

PS Claim 5; page 10; 12pp; English. 

CC Amino acids 1-42 of cyclosporin A-binding human cytosolic protein 

CC (17 kD> purity 957.). Val-1 is acetyl ated, or preceded by acetylated 

CC Het. This cyclophilin and its isoforms (see P90431-P90437) are used 

CC as anti-inflammatory and immunomodulatory agents* eg to treat 

CC autoimmune diseases and to prevent rejection (see further uses 

CC in Keywords) . 

S3 Sequence 42 AA; 

sq 4 a; 2 r; 2 n; 4 d; o b; o C; o q; 4 E; o z; 2 g; oh; 

sq l I; 3 L? 2 k; oh; 5 F; 3 P; 2 s? 4 T; oh; o y; 4 v; 



Initial Score = 7 Optimized Score = 8 Significance = 3.95 
Residue Identity = 44X Hatches = 8 Hismatches = 10 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

I III I Ml 
VNPTVFFDITADDEPLGRVSFELFADKVPKTAENFRALSTGE 
X 10 20 30 40 



14. US-08-249-182-11 (1-23) 

R32203 Apple fruit PPO pSR7. 

ID R32203 standard? Protein; 68 AA. 



«v, njccvji ; s 

DT 09-JUN-1993 (first entry) 

DE Apple fruit PPO pSR7. 

KW Polyphenol oxidase; PPO; catalyst; browning; fruit; plastid; vacuole; 

KW transform; coffee; tea; black olives; grapevine; chloroplast? apple; 

KW transit peptide; recombinant plasmid; PCR; broad bean; potato. 

OS Apple. 

PN W09302195-A. 

PD 04-FEB-1993. 

PF 16-JUL-1992; AU0356. 

PR 17-JUL-1991; AU-007248. 

PA (CSIR ) COMMONWEALTH SCI & IND RES ORG. 

PI Dry IB» Robinson SP; 

DR WPI; 93-058792/07. 

OR N-PSDB; 936664. 

PT DMA encoding polyphenol oxidase polypeptide or fragment - useful 

PT for modifying the oxidase activity in fruit and vegetables to 

PT decrease or enhance browning 

PS Claim 8? Fig 3; 44pp; English. 

CC The sequences given in R32201-06 and R33772 represent polyphenol 

CC oxidase (PPO) enzymes from various plants. PPO is thought to be the 

CC predominant catalyst in browning of fruit caused by injury or damage. 

CC PPO is localised in the plastids of plant cells whereas the phenolic 

CC substrates of the enzyme are stored in the plant cell vacuole. This 

CC compartmentation prevents the browning reation from occuring unless 

CC the plant cells are damaged and the enzyme and the substrate are 

CC mixed. The gene sequences encoding these proteins could be used to 

CC construct synthetic genes which may be used to transform plants to 

CC decrease expression of the enzyme gene. In some instances > eg. 

CC coffeei tea» black olives etd it is desirable to increase the level 

CC of PPO to produce desired levels of browning or changes in flavour 

CC compounds. The grapevine PPO codes for an additional 103 amino acids 

CC upstream of the N-terninus of the mature protein. This region has the 

CC properties of a chloroplast transit peptide and is most likely 

CC responsible for targetting of the protein to be imported into the 

CC chloroplast and processed to produce mature PPO. Transformation of 

CC plants with this gene may therefore result in correct targetting and 

CC maturation of the grapevine PPO in other species and result in 

CC accumulation of active grapevine PPO enzyme in the plastids of these 

CC tissues. 

SQ Sequence 68 AA; 

SQ 4 A; 4 R; 5 N; 9 D; 0 B; 2 C; 0 S; 4 E; 0 Z; 4 G; 2 H; 

SQ o I; 6 l; 4 k; 2 m; 4 F; 2 P; 3 s? 3 T; 3 w; 3 Y; 4 v; 



Initial Score = 7 Optimized Score = 7 Significance = 3.95 
Residue Identity = 30X Matches = 7 Mismatches = 16 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 
III I III 

EDMGNFYSAGRDPLFYAHHCNVDRMWNVMKTLGGKRKDPTDTDWLDAEFLFYDENAELVSCKVRDSLN 
10 X 20 30 X 40 50 60 



15. US-08-249-182-U (1-23) 

R25119 Non-Ar Non-B Hepatitis Virus antigen #10. 

ID R25119 standard; Protein; 72 AA. 
AC R25119? 

DT 07-DEC-1992 (first entry) 

DE Non-Ai Non-B Hepatitis Virus antigen #10. 

KW Antigen S29; NANBH; Hepatitis C? HCV; T064? T069; T06A; ELISA. 
OS Non-A Non-B Hepatitis Virus. 
PN W09209634-A. 
PD ll-JUN-1992. 



PR 29-N0V-1990; JP-325434. 

PR 29-NOV-1990; JP-325435. 

PR 16-JAN-199U JP-070231. 

PR 19-APR-1991! JP-179074. 

PR 07-JUN-1991J JP-232590. 

PA (TORA ) TORAY IND INC. 

PI Arina Tr Ida N, Kazani Jr Sato A; 

DR HPI! 92-217026/26. 

DR N-PSDB; 825749. 

PT Net) non-A non-B hepatitis virus antigen proteins - for highly 

PT specific detection of hepatitis 

PS Clain 1; Page 49; 80ppi Japanese. 

CC This sequence is one of 12 dained antigen sequences specific to 

CC NANBH virus. The antigens can be used singly or in combination in 

CC an ELISA diagnosis of hepatitis. See R24946 and R25110-R25121. 

SO Sequence 72 AAi 

ss 7 A; 4 R; l n; 5 D; o B; l c; 2 a; 5 E; o z? 4 G; 2 h; 

S8 2 Ii 5 L ; 0 KM N; 7 F; 7 P? 5 Si 6 T» 1 H; 0 Y? 7 V? 



Initial Score = 7 Optinized Score = 8 Significance = 3.95 
Residue Identity = 39X Hatches = 9 Nisnatches = 12 

Gaps = 2 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDD I TLVPETLGR 

II I II II I I 
EFFSHVDGV9IHRFAPTPGPFFRDEVTFTVGLNSFVVGS9LPCDPEPDTEVLASMLTDPSHIT — AEAAARR 
10 20 30 40 50 60 70 



LA 



> 0 < 

0| |0 Intel liGenetics 

> 0 < 



FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 



Results file u249_llp.res nade by on Thu 22 Sep 94 10:42:55-PDT. 



Query sequence being conpared:US-08-249-182-ll (1-23) 
Number of sequences searched: 70848 
Nunber of scores above cutoff: 3958 

Results of the initial conparison of US-08-249-182-11 (1-23) with: 
Data bank : PIR 41. all entries 

100000- 

N 

U50000- 

N 

B 

E - * 

R - * * 

0 

F 10000- * 
S 

E 5000- 
8 

U » « 
E 



c 

E 

S 1000- 



500- 



100- * 



50- 



10- 



5- 



0 

II II I II I I II I I I I II I I I 

SCORE 0| 1 2 1 5| | 7 10 | |12 | 15 17 20 22 
STDEV -1 01 234 567 89 



PARAMETERS 



Similarity natrix 


Unitary 


K-tuple 




2 


Misoatch penalty 


1 


Joining penalty 




20 


Gap penalty 


LOO 


Window size 




5 


Gap size penalty 


0.05 








Cutoff score 


0 








Randomization group 


0 








Initial scores to save 40 


Alignments to save 


15 




Optimized scores to 


save 0 


Display context 


50 





SEARCH STATISTICS 



Scores: Mean Median Standard Deviation 

3 4 1.36 

Tines; CPU Total Elapsed 

00:01:32.88 00:01:42.00 

Number of residues: 20816057 

Number of sequences searched: 70848 

Number of scores above cutoff: 3958 



Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 



cut-ott raiseo to d. 
Cut-off raised to 6. 
Cut-off raised to 7. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence was not found. 



The list of best scores is; 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frafte 



13 standard deviations above wean **** 



1. 


A42329 


auioiaxin - hunan (fragnents) 


114 


22 


22 


14.00 


0 






tin 5 standard deviations < 


ibove nean *»tt 








2. 


S33464 


hypothetical protein - Arabid 


341 


10 


10 


5.16 


0 






»e*i 4 standard deviations ■ 


above nean 


**«» 








3. 


A44464 


oxaloacetate decarboxylase be 


141 


9 


10 


4.42 


0 


4. 


S26196 


inidazoleglycerol-phosphate d 


208 


9 


9 


4.42 


0 


5. 


DAPSPC 


biphenyl-2i3-dio\ l>2-dioxyge 


293 


9 


9 


4.42 


0 


6. 


C44465 


sodiun ion punp oxaloacetate 


433 


9 


10 


4.42 


0 


7. 


A27705 


naltohexaose-producing anylas 


518 


9 


9 


4.42 


0 


8. 


S13163 


65K antigen - Chinese hanster 


556 


9 


9 


4.42 


0 


9, 


9QBEHA 


BSLF1 protein - sainiriine he 


835 


9 


9 


4.42 


0 


10. 


A35088 


phycobiliprotein L-CH - Calot 


1080 


9 


9 


4.42 


0 


11. 


S11455 


botulinun neurotoxin type D - 


1276 


9 


9 


4.42 


0 






**»* 3 standard deviations above nean *§*» 








12. 


S20764 


Ig heavy chain V region - hun 


63 


8 


8 


3.68 


0 


13. 


S04891 


hypothetical proteini 9.2K - 


82 


8 


8 


3.68 


0 


14. 


S01741 


hypothetical protein XI - por 


82 


8 


8 


3.68 


0 


15. 


C60007 


hypothetical protein C - pore 


82 


8 


8 


3.68 


0 


16. 


C36607 


NS4 protein - porcine respira 


82 


8 


8 


3.68 


0 


17. 


S24280 


hypothetical protein 4 - pore 


82 


8 


8 


3.68 


0 


IB. 


S36265 


Ig heavy chain V region (clon 


118 


8 


8 


3.68 


0 


19. 


S15611 


hypothetical protein - sline 


176 


8 


8 


3.68 


0 


20. 


B25599 


repB protein - Streptococcus 


210 


8 


8 


3.68 


0 


21. 


S05981 


repB protein - Streptococcus 


210 


8 


8 


3.68 


0 


22. 


A40084 


epidernal growth factor-relat 


265 


8 


8 


3.68 


0 


23. 


flOYV 


transforning protein nyb - av 


265 


8 


8 


3.68 


0 


24. 


S30311 


sugar transport protein - yea 


287 


8 


8 


3.68 


0 


25. 


S36863 


L-lactate dehydrogenase (EC 1 


319 


8 


9 


3.68 


0 


26. 


S35230 


hypE protein - Bradyrhizobiun 


321 


8 


8 


3.68 


0 


27. 


A43586 


hypothetical protein 1 - Sain 


345 


8 


9 


3.68 


0 


28. 


S28129 


gvpN protein - Halobacteriun 


347 


8 


8 


3.68 


0 


29. 


S22387 


cuticle-degrading proteinase 


388 


8 


8 


3.68 


0 


30. 


B48899 


cephalosporinasef AnpC - Vers 


388 


8 


8 


3.68 


0 


31. 


S24S62 


beta-lactanase (EC 3.5.2.6) - 


388 


8 


8 


3.68 


0 


32. 


A36314 


transforning protein (nyb) - 


388 


8 


8 


3.68 


0 


33. 


A27891 


RepA protein - Bacillus subti 


396 


8 


8 


3.68 


0 


34. 


PL0103 


fibronectin receptor alpha ch 


409 


8 


8 


3.68 


0 


35. 


B25937 


arsenical punp nenbrane prote 


429 


8 


9 


3.68 


0 


36. 


A47044 


threonine dehydratase (EC 4.2 


436 


8 


10 


3.68 


0 


37. 


A47124 


sporulation-specif ic i>3-beta 


445 


8 


8 


3.68 


0 


38. 


A40639 


exo-l>3-beta-glucanase - yeas 


445 


8 


8 


3.68 


0 


39. 


S36356 


glucan l>3-beta-glucosidase ( 


445 


8 


8 


3.68 


0 


40. 


JN0118 


exo-l»3-beta-glucanase (EC 3. 


448 


8 


8 


3.68 


0 



1. US-08-249-182-11 (1-23) 

A42329 autotaxin - hunan (Fragnents) 



ENTRY 
TITLE 



A42329 #type fragnents 
autotaxin - hunan (fragnents) 



UNUARlbPJ ♦Forftal.nane Hono sapiens 8coftRon_naRe nan 

DATE 04-Har-1993? ♦sequence revision 01-Jan-1993f #Uxt change 

08-May-19?3 
ACCESSIONS A42329 
REFERENCE A42329 

♦authors Strackei M.L.; Krutzsch* H.C.; Unsuorth* E.J.J Arestadi A,; 

Ciocer V.? Schiffwann, E.J Liottai L.A, 
♦journal J. Biol. Chen. (1992) 267:2524-2529 

♦title Identification purification* and partial sequence analysis 

of autotaxin* a novel Rotility-stiaulating protein, 
♦cross-references HU ID: 92 129337 
♦accession A42329 

♦♦status preliminary 

♦♦nolecule^type protein 

♦♦residues" 1-114 ♦♦label STR 

♦♦cross-references NCBIP:78526; NCBIP?78523; NCBIP:7852l; NCBIP?78518; 

NCBIP:78515; NCBIP:78512; NCBIP:7851l; NCBIP?78510? 
NCBIP:78509; NCBIP:78508; NCBIP:78503 
♦♦note sequence extracted fron NCBI backbone 

SUMMARY ♦length 114 ♦checksum 7335 

SEQUENCE 



Initial Score = 22 
Residue Identity = 957. 
Gaps = 0 



Optimized Score = 22 Significance = 14.00 
Matches = 22 Mi snatches - 1 

Conservative Substitutions = 0 



X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

imimmiiiiiii mi 

TEFLSNYLTNVDDITLVPGTLGRDIEHLTSLDFFRVNSM8TVFVGYGPTFKGGSPLHITATKSPPFENINLY 
X 10 20 X 30 40 50 60 70 



2. US-08-249-182-11 (1-23) 

S33464 hypothetical protein 



Arabidopsis thaliana 



ENTRY 
TITLE 
ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
♦authors 
♦subnission 
♦accession 
♦♦status 
♦♦residues 
♦♦cross-references EHBUX71915 
SUMMARY ♦length 341 ♦nolecular-aeight 37008 ♦checksum 2677 

SEQUENCE 



S33464 ♦type complete 
hypothetical protein - Arabidopsis thaliana 
♦fornaljiane Arabidopsis thaliana ♦coanon jiane Rouse-ear 
cress 

22-Nov-1993; ♦sequence^revision 22-Nov-1993? ♦text change 

22-Nov-1993 
S33464 
S33464 

Quigley? F.R. 

submitted to the EMBL Data Library, May 1993 
S33464 

prelifiinary 
1-341 ## label QUI 



Initial Score = 10 
Residue Identity - 437. 
Gaps = 0 



Optinized Score = 10 Significance = 5.16 
Matches = 10 Misnatches = 13 

Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 
1)1 I III I II 

FYDPASKKMTEVETTGAKPSARSVFAHAVVGKYIIIFAGEVHPDLNGHYGPGTLSNEGYALDTETLVHEKLG 
230 240 250 260 270 280 290 



A 

R 



EEGAPAIPRGWTAYTAATVDGKNGLLMHGGKLPTNERTDDLYFYAVNSA 
X 300 310 320 330 340 



3. US-08-249-182-11 (1-23) 

A44464 oxaloacetate decarboxylase beta subunii (C ierninu 



ENTRY A44464 #type fragment 

TITLE oxaloacetate decarboxylase beta subunit (C terminus) - 

Klebsiella pneumoniae (fragment) 
#formal_name Klebsiella pneumoniae 

30-Apr-1993? #sequence_revision 30-Apr-1993; ttext change 

30-Apr-1993 
A44464 
A44464 

Woehlker G.? Laussermain E.J Schwarzi E.J Qesterhelti D.J 

Reinke* H.; Beyreuthen K.? Dimroth* P. 
J. Biol. Chem. (1992) 267:22804-22805 
Appendix. Sequence of the beta-subunit of oxaloacetate 
decarboxylase from Klebsiella pneumoniae! a correction of 
the C-terninal part, 
•cross-references MUI 0:93054592 
•accession A44464 

••status prel iminary 

••residues 1-141 ••label WOE 
••cross-references NCBIP: 118089 

##note sequence extracted from NCBI backbone 

SUMMARY #length 141 #checksun 9344 

SEQUENCE 



ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
#authors 

#journal 
•title 



Initial Score = 9 
Residue Identity = 442 
Gaps = 2 



Optimized Score = 10 Significance » 4.42 
Matches = 11 Mismatches = 12 

Conservative Substitutions - 0 



X 10 20 X 

TEF— L5NYLTNVDDITLVPETLGR 

III! II I I III 
GNLMRESGVVERLSDTVQNALINIVTIFLGLSVGAKLVADKFLSPSTLGILVLGVIAFCVGTAAGVLMAKLM 
10 20 X 30 40 50 60 70 



NVFSRHKINPLIGSAGVSAVPMAARVSN 
80 90 100 



4. US-08-249-182-11 (1-23) 

S26196 imidazoleglycerol-phosphate dehydratase (EC 4.2.1. 



ENTRY 
TITLE 

ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
•authors 

•journal 
ttiile 



•accession 



S26196 Stype complete 

imidazoleglycerol-phosphate dehydratase (EC 4.2.1.19) - 

fungus (Trichoderma harzianun) 
•formal_name Trichoderma harzianun 
12-Feb-1993 isequence revision 12-Feb-1993 itext change 

31-Dec-1993 
S26196J S19787 
S26196 

Goldman, G.H.; Denolderi J.; Dewaelei 5.5 Herrera-Estrella* 
A.; Geremia» R.A.i van Montagu. M.; Contreras? R. 

Mol. Gen. Genet. (1992) 234:481-488 

Molecular cloning of the imidazoieglycerolphosphate 
dehydratase gene of Trichoderma harzianum by genetic 
complementation in Saccharomyces cerevisiae using a direct 
expression vector. 

S26196 



##residues" 1-208 ##label GOL 
Mcross-references EMBL:Z11528 
GENETICS 

#gene igh 

CLASSIFICATION SsuperfaRily inidazoleglycerol-phosphate dehydratase 
KEYWORDS carbon-oxygen lyase; hisiidine biosynthesis? hydro-lyase 

SUMMARY #length 208 #Rolecular-weight 22356 Schecksun 3322 

SEQUENCE 



Initial Score = 9 
Residue Identity = 392 
Gaps = 0 



Optinized Score = 9 Significance = 4.42 
Matches = 9 Mismatches = 14 

Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 
III I I I I II 

KGDLHIDDHHTAEDCCIAVGTTFAKALGALTGVARFGYAYAPLDEALSRAVVDLSNRPYTVVDLGLKREKLG 
90 100 110 120 130 X 140 150 

X 
R 



ELSCEMIPHCLQSFAQAARITLHVDCLRGDNDHHRAE5AFKALAVAVRWYD 
160 170 180 190 200 



5. US-08-249-182-11 (1-23) 

DAPSPC biphenyl-2,3-diol 1,2-dioxygenase <EC 1.13.11.39) 



ALTERNATE.NAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
tauthors 



ENTRY DAPSPC itype complete 

TITLE biphenyl-2r3-diol lf2-dioxygenase (EC 1.13.11.39) - 

Pseudononas sp. 
2 1 3-d i hydroxy b i pheny 1 d i oxygenase 
iforRaljtane Pseudononas sp. 

30-Jun-1990 #sequence_revision 30-Jun-1990 #text change 

30-Jun-1993 
A32312; JU0085 
A32312 

Kinbara* K.; HashinotOr T.; Fukudar M.r Koana* T.J Takagir 
M. ; Oishir M. > Yano, K. 
tjournal J. Bacteriol. (1989) 171:2740-2747 
ttitle Cloning and sequencing of two tandem genes involved in 

degradation of 2r3-dihydroxybiphenyl to benzoic acid in the 
polychlorinated biphenyl-degrading soil bacteriun 
Pseudononas sp. strain KKS102. 
icross-references MUID:89213965 
icontents Strain KKS102 
taccession A32312 
##nolecule_type DNA 



##residues 
REFERENCE 
#authors 
tsubftission 
#acces5ion 



1-293 *#label KIM 
JU0085 
Takagir M. 

submitted to JIPID, July 1989 
JU0085 
##nolecule_type DNA 
##residues" 1-293 ##label TAK 

##note the submitted nucleotide sequence was translated at 

JIPID 

COMMENT This enzyne catalyzes the third step in the najor degradative 
pathway for biphenyl and polychlorinated biphenyls (PCBs)» 
cleavage of a 2i3-dihydroxybiphenyl derivative at the 1 and 2 
positions to give a derivative of 2-hydroxy-6-oxo-6-phenylhexa-2* 
4-dienoate. 



GENETICS 
#aene 



bohC 



u.fl»inumuN ssupertanny Dipneny i-i. j-oioi iii-oioxygenase 

KEYWORDS oxidoreductasei PCB biodegradation 

SUMMARY ilength 293 tnolecular-weight 32244 #checksun 7776 

SEQUENCE 



Initial Score = 9 Optinized Score = 9 Significance = 4.42 
Residue Identity = 39X Matches = 9 Mi snatches = 14 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDD I TLVPETLG 

II I I II III 
LPLE I YYGP AEI FHEPFLPSAPVSGF VTGD8G I GHF VRCVPDTAKAMAF YTEVLGF VLSD 1 1 DI 8HGPETSV 
120 130 140 150 160 X 170 180 

X 
R 

P AHFLHCNGRHHT I ALAAFPI PKRI HHFMLQANT I DDVGY AFDRLDA AGR I 
X 190 200 210 220 230 



6. US-08-249-182-11 (1-23) 

C4446S sodiun ion punp oxaloacetate decarboxylase subunit 



ENTRY C44465 Hype complete 

TITLE sodiun ion punp oxaloacetate decarboxylase subunit beta - 

Salnonella typhinuriun 
ORGANISM #fornal_nane Salnonella typhi nuriun 

DATE 30-Apr-1993» #sequence_revision 30-Apr-1993; #text change 

30-Apr-1993 
ACCESSIONS C44465 
REFERENCE A44465 

fauthors Woehlker G.; Wifling. K.; Dinrothi P. 
•journal J. Biol. Chen. (1992) 267:22798-22803 
•title Sequence of the sodiun ion punp oxaloacetate decarboxylase 

fron Salnonella typhinuriun. 
•cross-references MUID:93054591 
•contents LT2 
•accession C44465 

••status prelininary 
••residues 1-433 t# label WOE 
••cross-references NCBIP: 118073 

••note sequence extracted fron NCBI backbone 

SUMMARY ilength 433 ftftolecular-weight 44923 Schecksun 4940 

SEQUENCE 



Initial Score = 9 Optinized Score = 10 Significance = 4.42 
Residue Identity = 44% Hatches = 11 Mismatches = 12 

Gaps = 2 Conservative Substitutions = 0 



X 10 20 

TEF--LSNYLTNVDDI TLVPET 
II II I I I I I 

FPVVLLMLVALLLPDAAPLLGMFCFGNLMRESG VVERLSDTVQNGL I N I VT IFLGLS VGAKLVADKFL9PQT 
270 280 290 300 310 320 330 



X 

LGR 
II 

LGILLLGVIAFGIGTAAGVLMAKLLNLCSKNKINPLIGSAGVSAVPMAARVSN 
340 X 350 360 370 380 390 



7. US-08-249-182-11 (1-23) 

A27705 naltohexaose-producing anylase (EC 3. ?..!.-> precur 



ALTERNATE.NAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

tfauthors 

#journal 

#title 



ENTRY A27705 #type complete 

TITLE naltohexaose-producing amylase (EC 3.2.1.-) precursor - 

Bacillus sp. 
G6-anylase 

#fornal_nane Bacillus sp. 

31-Mar-1989 #sequence_revision 31-Mar-1989 #text_change 

2S-Way-1993 
A27705 
A27705 

Tsukanoto* A. J Kimura, K.J Ishiif Y.» Takano* T.? Yanane> K. 
Biochem. Biophys. Res. Connun. (1988) 151:25-31 
Nucleotide sequence of the naltohexaose-producing anylase 
gene fron an alkalophilic Bacillus sp. #707 and structural 
similarity to liquefying type alpha-amylases. 
#cross-references MUIDJ88162814 
#contents Strain #707 
#accession A27705 
##nolecule_type DNA 
##residues~ 1-518 ##label TSU 
KEYWORDS glycosidase? hydrolase 

FEATURE 

1*33 Sdonain signal sequence ilabel SIG\ 

34-518 #protein naltohexaose-producing anylase Slabel MAT 

SUMMARY #length 518 #nolecular-weight 59008 #checksum 7204 

SEQUENCE 



Initial Score = 9 
Residue Identity = 39X 
Gaps = 0 



Optinized Score = 9 Significance - 4.42 
Matches = 9 Mi snatches = 14 

Conservative Substitutions = 0 



X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

111 I II I II 
MKMRTGKKGFLSILLAFLLVITSIPFTLVDVEAHHNGTNGTMM9YFEHYLPNDGNHHNRLNSDASNLKSKGI 
10 20 30 40 50 60 70 



TAVWIPPA 
80 



8. US-08-249-182-11 (1-23) 

S13163 65K antigen - Chinese hanster 

ENTRY S13163 #type conplete 

TITLE 65K antigen - Chinese hanster 

ORGANISM #formal_nane Cricetulus griseus #connon_name Chinese hanster 

DATE 21-Nov-1993; #sequence_revision 21-Nov-1993; #text change 

21-Nov-1993 
ACCESSIONS S13163 
REFERENCE S13163 

#authors Ahnad; S.? Gupta, R.S. 
#journal Biochin. Biophys. Acta (1990) 1087:253-255 
#title Cloning of a Chinese hanster protein honologous to the nouse 

t-conplex protein TCP-1: structural similarity to the 
ubiquitous 'chaperonin' fanily of heat-shock proteins. 
#cross-references MUID:91027940 
#accession S13163 

##status preliminary 
##re5idues 1-556 iilabel AHM 
SUMMARY #length 556 #nolecular-weight 60338 #checksun 2691 

SEQUENCE 



Initiat Score 
Residue Identity 
Gaps 



9 Optinized Score = 9 Significance = 4.42 
39X Matches = 9 Mismatches = 14 

0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 

II I I I I III 
RAFHNEAflVNPERKNLKHIGLDLINGKPRDNKfiAGVFEPTIVKVKSLKFATEAAITILRIDDLIKLHPETKD 
470 480 490 500 510 520 530 

X 
R 

DKHGSYENAVHSGALDD 
540 550 



9. US-08-249-182-11 (1-23) 

88BEHA BSLF1 protein - sainiriine herpesvirus 1 (strain 1 



ENTRY 98BEHA Slype conplete 

TITLE BSLF1 protein - sainiriine herpesvirus 1 (strain 11) 

ORGANISM #forpial_nane sainiriine herpesvirus 1 

tnote host Sainiri sciureus (connon squirrel donkey) 

DATE 30-Sep-19B9 #sequence_revision 31-Dec-1992 itext.change 

04-Har-1994 
136811 
A36806 
Albrechti J. 

subnitted to the EMBL Data Library, January 1992 
Prinary structure of the herpesvirus sainiri genone. 
136811 
Mnolecule_type DNA 
tires i dues 1-835 81 label ALB 
ttcross-references GB:X64346 



ACCESSIONS 

REFERENCE 
iauthors 
isubnission 
#description 
taccession 



REFERENCE 
iauthors 



f journal 
ititle 



A37309 

Albrecht: J.C.J Nicholas! J.; Biller> D. ; Caneron, K.R.; 

Biesingeri B.; Neunan, C.J Uittnann» S.J Craxton. N.A. 

Colenan. H. ; Fleckenstein, B.; Hones s. R.H. 
J. Virol. (1992) 66:5047-5058 

Prinary structure of the herpesvirus sainiri genone. 



Icross-references MUIDJ92333688 



•contents 
ftnote 

GENETICS 
igene 
CLASSIFICATION 
SUMMARY 
SEQUENCE 



annotation; possible protein-coding franes 
neither protein nor nucleotide sequence is given 
paper 



in this 



56 

isuperfanily varicella-zoster virus gene 6 protein 
ilength 835 inolecular-aeight 96127 ichecksun 1688 



Initial Score 
Residue Identity 
Gaps 



9 Optinized Score = 9 
39X Matches = 9 

0 Conservative Substitutions 



Significance = 
Mi snatches = 



4.42 
14 
0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 
III I I llll 

NYCHIKLARDSLES8AIDTSIDTLRGSLMLSNQDLVHYIYLSFFGCLNKDIFIKYSHLTNSDNIHFVPETEV 
240 250 260 270 280 X 290 300 

X 
R 



LAQSLDENFRKDMLTYYNKSTYLKTYITHKCIHLPDLIGYAP8DCTSFVYH 
X 310 320 330 340 350 



10. US-CS-249-182-11 (1-23) 



pnytuDi 1 1 protein L-^n - Laiei,nrix sp. 



ENTRY A3508B Hype complete 

TITLE phycobiliproiein L-CM - Calothrix sp. 

ORGANISM flfornaljiane Calothrix sp, 

DATE 03-Aug-1990 Isequence^revision 03-Aug-1990 #textj:hange 

18-Jun-1993 
ACCESSIONS A35088 
REFERENCE A35088 

iauthors Houmardi J.; Capuano> V.J Colombano* M.V.; Coursinr T.; 

Tandeau de Marsac> N. 
#journal Proc. Nail. Acad. Sci. U.S.A. (1990) 87:2152-2156 
#title Molecular characterization of the terminal energy acceptor of 

cyanobacterial phycobilisones. 
tcross-references MU ID. '90 192765 
#accession A35088 

##5tatus preliminary 
##molecule_type DNA 
##residues~ 1-1080 ##label HOU 
##cross-reference5 GB:M20806? GB:H31224 
SUMMARY #length 1080 #molecular-ueight 120456 Schecksun 6806 

SEQUENCE 



Initial Score = 9 
Residue Identity = 392 
Gaps = 0 



Optimized Score = 9 Significance = 4.42 
Matches = 9 Mi snatches = 14 

Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II I I I! I II 

IRKYN9ILAT8GIRAFIGALVSSAEYAEVFGEDTVPYRRYPTLPAANFPNTEKLYNQLTK9NDDLVVPSFKT 
830 840 850 860 870 X 880 890 



VQPRLTLAGTSSSGRNGFTDLGRSSTSA9G8LGETANRCKPARIYRLSGTN 
900 910 920 930 940 



11. US-08-249-182-11 (1-23) 

SI 1455 botulinum neurotoxin type D - Clostridium botulinu 



ENTRY SI 1455 #type complete 

TITLE botulinun neurotoxin type D - Clostridium botulinun 

ORGANISM #formal_name Clostridium botulinum 

DATE 18-Feb-1994; #sequence_revision 18-Feb-1994; #text_change 

18-Feb-1994 
ACCESSIONS SI 1455 

REFERENCE SI 1455 

iauthors Binz, T.f Kurazono, H.J Popoffr N.R.; Eklund* M.W.; 

Sakaguchir G.; Kozaki* S.J Krieglsteinr K.; Henschen> A.; 
Gill > D.M.? Niemann, H. 
#journal Nucleic Acids Res. (1990) 18:5556 
ititle Nucleotide sequence of the gene encoding Clostridium 

botulinum neurotoxin type D. 
ftcross-references MUID:91016853 
#accession SI 1455 

##status prel iminary 

##residues 1-1276 fllabel BIN 
ticross-references EMBL:X54254 
SUMMARY tlength 1276 inolecular-ueight 146871 Checksum 326 

SEQUENCE 



Initial Score = 9 Optimized Score = 9 Significance = 4.42 
Residue Identity = 39'/. Matches = 9 Mismatches = 14 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 
III I I I I II 

DGSVPINPEIVDPLLPNVNHEPLNLPGEEIVFYDDITKYVDYLNSYYVLES8KLSNNVENITLTTSVEEALG 
500 510 520 530 540 550 560 

X 
R 

YSNKIYTFLPSLAEKVNKGV8AGLFLNWANEVVEDFTTNIMKKDTLDKISD 
570 580 590 600 610 620 



12- US-08-249-182-11 (1-23) 

S20764 Ig heavy chain V region - human 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
#authors 
tsubnission 
iaccession 
##5iatus 
##residues 



S20764 Stype complete 

Ig heavy chain V region - human 

#formal_name Homo sapiens #connon_name man 

19-Feb-1994J #sequence revision 19-Feb-1994? #text change 

19-Feb-1994 
S20764 
S20764 

Mortari, F.J Wang, J.; Schroederi H.W. 
submitted to the EMBL Data Library , April 1992 
S20764 

preliminary 

1-63 ##label I10R 



llkross-references EMBL:Z11951 
SUMMARY #length 63 Smolecular-ueight 6858 #checksun 190 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



8 Optimized Score = 8 Significance = 3.68 
34X Hatches = 8 Mismatches = 15 

0 Conservative Substitutions - 0 



X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

II I III II 
RKSDGGTTDYAAPVKGRFTISRDDSKNTVYLOMNSLKTEDSGVYYCTTDDIGttGQGTLVTVSS 
10 20 30 40 50 60 



13. US-08-249-182-11 (1-23) 

S04891 hypothetical protein* 



9.2K - porcine transmissible 



porcine transmissible 



ENTRY S04891 Stype complete 

TITLE hypothetical protein, 9.2K 

gastroenteritis virus 
#formal_narae porcine transmissible gastroenteritis virus 
30-Jun-1992 #sequence_revision 30-Jun-1992 #text change 

18-Jun-1993 
S04891 
S04889 

Kapke, P. A.; Tung* F. Y.T.J Brian, D.A. 
submitted to the EMBL Data Library, September 1988 
Nucleotide sequence between the peplomer and matrix protein 
genes of the porcine transmissible gastroenteritis 
coronavirus identifies three large open reading frames, 
iaccession S04891 
tttmolecule_type DNA 
##residues" 1-82 Itlabel KAP 
##cross-references EMBL:X12800 
SUMMARY # length 82 imoleculsr-weight 9239 tchecksum 9602 



ORGANISM 
DATE 

ACCESSIONS 

REFERENCE 
iauthors 
tsubmission 
tdescription 



Initial Score = 8 Optinized Score = 8 Significance = 3,68 
Residue Identity = 347, Matches = 8 Nisnatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

II I I I III 

MTFPRALTVIDDNGMVINI IFWFLLI I ILILLSI ALLMI IKLCMVCCNLGRTVI IVPAQHAYDAYKNFHRIK 
10 20 30 40 50 60 70 

AYNPDGALLA 
80 



14. US-08-249-132-11 (1-23) 

S01741 hypothetical protein XI - porcine transnissible 

ENTRY S01741 #type complete 

TITLE hypothetical protein XI - porcine transnissible 

gastroenteritis virus (strain Purdue-115) 
ORGANISM aforfnal^naite porcine transmissible gastroenteritis virus 

DATE 30-Sep-1989 #sequence_revision 30-Sep-1989 #text change 

18-Jun-1993 
ACCESSIONS S01741 
REFERENCE S01738 

ftauthors Rasschaerti D.; Gelfi, J.; Laude* H. 
Ijournal Biochiwie (1987) 69:591-600 

fttitle Enteric coronavirus TGEV: partial sequence of the genonic 

RNA> its organization and expression, 
^cross-references MUID?88078100 
^accession S0I741 
#lnolecule_type piRNA 
#tresidues~ 1-82 ##label RAS 
fticross-references EMBLJX06371 
SUMMARY ilength 82 #nolecular-ueight 9239 Schecksun 9602 

SEQUENCE 



Initial Score = 8 
Residue Identity = 34X 
Gaps = 0 



Optimized Score = 8 
Matches = 8 

Conservative Substitutions 



Significance = 
Mi snatches = 



3.68 
15 
0 



X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

II I I I III 

MTFPRALTVIDDNGMVINI IFWFLLI I ILILLSIALLNI IKLCMVCCNLGRTVI IVPAQHAYDAYKNFHRIK 
10 20 30 40 50 60 70 



AYNPDGALLA 
80 



15. US-08-249-182-11 (1-23) 

C60007 hypothetical protein C 



porcine transnissible 



ENTRY C60007 #type conplete 

TITLE hypothetical protein C - porcine transnissible 

gastroenteritis virus (strain virulent Miller) 
ORGANISM #fornal_nane porcine transnissible gastroenteritis virus 

DATE 03-Mar-1993 #sequence_revision 03-Mar-1993 #text change 

30-Sep-1993 
ACCESSIONS C60007 
REFERENCE A60007 

Sauthors Wesley, R.D.; Cheung* A.K.; Michaeli D.D.; Woods, R.D. 

#journal Virus Res. (1989) 13:87-100 



ffutre Nucleotide sequence of coronavirus TutV genome RNAJ evidence 
for 3 nRNA species between the peploner and natrix protein 
genes . 
(accession C60007 
#fnolecule_type rRNA 
♦•residues 1-82 #*label WES 
SUMMARY ilength 82 Inolecular-ueight 9268 «checksun 238 

SEQUENCE 

Initial Score = 8 Optimized Score - 8 Significance = 3.68 
Residue Identity = 347. Hatches = 8 Hisnatches - 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDI TLVPETLGR 

II I I I III 

MTFPRALTVIDDNGMVISIIFWFLLIIILILLSIALLNI IKLCNVCCNLGRTVI IVPVBHAYDAYKNFHRIK 
10 20 30 40 50 60 70 

AYNPDGALLV 
80 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 

Results file u249_lls.res nade by on Thu 22 Sep 94 10:14:07-PDT. 



Query sequence being conpared:US-08-249-182-ll (1-23) 
Number of sequences searched: 36000 
Nunber of scores above cutoff: 4784 

Results of the initial conparison of US-08-249-1B2-11 (1-23) uiths 
Data bank : Suiss-Prot 28. all entries 

10000- * * * 

N * 

U 5000- 

H 

8 

E 

R 

* 

0 

F 1000- 

s * 
E 500- 
Q 

U - ♦ 

E 

N 

C 

E 

S 100- 



50- 



10- 



5- 



I II II II II I II II 

SCORE 0 1| 2| 3| 4| 5 6| 7| 
STDEV -10 12 3 4 



PARAMETERS 



Si&ilarity matrix 


Unitary 


K-tuple 


2 


Mismatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 15 




Optimized scores to 


save 0 


Display context 50 






SEARCH STATISTICS 




Scores: 


Mean 


Median Standard Deviation 




3 


4 1.28 




Tines? 


CPU 


Total Elapsed 






00:00:53.89 


00:01:00.00 




Number of residues; 




12496420 




Number of sequences 


searched: 


36000 




Number of scores above cutoff? 


4784 





Cut-off raised to 3. 

Cut-off raised to 4. 

Cut-off raised to 5. 

Cut-off raised to 6. 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100% identical sequence to the query sequence was not found. 



The list of best scores is? 



Sequence Nane Description 



Init. Opt. 
Length Score Score 



Sig. Frane 



**•* 4 standard deviations above nean # 

1. HIS7.TRIHA INIDAZOLEGLYCER0L-PH0SPHATE D 208 

2. BPHC_PSES1 BIPHENYL-2.3-DI0L 1,2-DIOXYGE 293 

3. DC0B_SALTY 0XAL0ACETATE DECARBOXYLASE BE 433 

4. AMT6.BACS7 GLUCAN 1,4-ALPHA-MALTOHEXAOSI 518 

5. TCP1_CR1GR T-C0NPLEX PROTEIN 1 (TCP-1) ( 556 

6. UL52_HSVSA PROBABLE DNA REPLICATION GENE 835 

7. APCE.SYNY4 PHYC0BILIS0ME 120 KD LINKER P 896 



»» 



9 
9 
10 
9 
9 
9 
9 



4.69 
4.69 
4.69 
4.69 
4.69 
4.69 
4.69 



9. 


BXD_CL0B0 


BOTULINUH NEUROTOXIN TYPE D P 


1276 


9 


9 


4.69 


0 






3 standard deviations above nean **** 








10. 


VNS4_CVPRH 


NONSTRUCTURAL PROTEIN 4 (XI P 


82 


8 


8 


3.91 


0 


11. 


VNS4_CVPPU 


NONSTRUCTURAL PROTEIN 4 (XI P 


82 


8 


8 


3.91 


0 


12. 


REPBJTRPN 


REPLICATION PROTEIN REPB. 


210 


8 


8 


3.91 


0 


13. 


HYPEJRAJA 


HYPE PROTEIN. 


321 


8 


8 


3.91 


0 


14. 


EGF2J5TRPU 


EPIDERMAL GROWTH FACTOR-RELAT 


325 


8 


8 


3.91 


0 


15. 


MYB.AVIMB 


MYB TRANSFORMING PROTEIN. 


382 


8 


8 


3.91 


0 


16. 


CUDPJIETAN 


CUTICLE-DEGRADING PROTEASE PR 


388 


8 


8 


3.91 


0 


17. 


REPAJACSU 


REPA PROTEIN. 


396 


8 


8 


3.91 


0 


18. 


ITA5.H0USE 


FIBRONECTIN RECEPTOR ALPHA SU 


409 


8 


8 


3.91 


0 


19. 


ARSB_ECOLI 


ARSENICAL PUMP MEMBRANE PROTE 


429 


8 


9 


3.91 


0 


20. 


THD1_C0RGL 


THREONINE DEHYDRATASE BIOSYNT 


436 


8 


10 


3.91 


0 


21. 


SPR1_YEAST 


SPORULATION-SPECIFIC GLUCAN 1 


445 


8 


8 


3.91 


0 


22. 


EXGlJfEAST 


GLUCAN 1,3-BETA-GLUCOSIDASE I 


448 


8 


8 


3.91 


0 


23. 


MKS1JEAST 


NEGATIVE REGULATOR OF RAS-CAM 


458 


8 


8 


3.91 


0 


24. 


US15.HCMVA 


HYPOTHETICAL PROTEIN HVLF3. 


484 


8 


8 


3.91 


0 


25, 


TRPE.PSEAE 


ANTHRANILATE SYNTHASE COMPONE 


492 


8 


8 


3.91 


0 


26. 


TRPE.PSEPU 


ANTHRANILATE SYNTHASE COMPONE 


493 


8 


8 


3.91 


0 


27. 


AMIC.STRPN 


OLIGOPEPTIDE TRANSPORT AMIC P 


498 


8 


9 


3.91 


0 


28. 


SYFD_YEAST 


PHENYLALANYL-TRNA SYNTHETASE 


502 


8 


8 


3.91 


0 


29. 


TRPE.PSESS 


ANTHRANILATE SYNTHASE COMPONE 


505 


8 


8 


3.91 


0 


30. 


ACHD_RAT 


ACETYLCHOLINE RECEPTOR. DELTA 


517 


8 


3 


3.91 


0 


31. 


ACHDJtOUSE 


ACETYLCHOLINE RECEPTOR PROTEI 


520 


8 


8 


3.91 


0 


32. 


VNS1JHSV4 


NONSTRUCTURAL PROTEIN NS1 (HY 


548 


8 


8 


3.91 


0 


33. 


TCPB_MOUSE 


T-COMPLEX PROTEIN 1 (TAILLESS 


556 


8 


8 


3.91 


0 


34. 


TCP1JAT 


T-CQMPLEX PROTEIN 1 (TCP-1). 


556 


8 


8 


3.91 


0 


35. 


TCPlJiUMAN 


T-COMPLEX PROTEIN 1 (TCP-1). 


556 


8 


8 


3.91 


0 


36. 


TCPl.DROME 


T-COMPLEX PROTEIN 1 HOMOLOG ( 


557 


8 


8 


3.91 


0 


37. 


MYB_CHICK 


MYB PROTO-ONCOGENE PROTEIN. 


641 


8 


8 


3.91 


0 


38. 


VID4_AGRTU 


VIRD4 PROTEIN PRECURSOR. 


656 


8 


8 


3.91 


0 


39. 


VID4_AGRT5 


VIRD4 PROTEIN. 


665 


8 


8 


3.91 


0 


40. 


VID4~AGRRA 


VIRD4 PROTEIN PRECURSOR. 


671 


8 


8 


3.91 


0 



1. US-08-249-182-11 (1-23) 

HIS7_TR1HA I PI I D AZOLEGLYCEROL-PHOSPHATE DEHYDRATASE (EC 4.2.1. 

ID HIS7JRIHA STANDARD; PRT; 208 AA. 

AC P34041I 

DT 01-FEB-1994 (REL. 28. CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE IHIDAZOLEGLYCEROL-PHOSPHATE DEHYDRATASE (EC 4.2.1.19). 

GN IGH. 

OS TRICHODERNA HARZIANUN. 

OC EUKARYOTA; FUNGI5 DEUTEROHYCOTINA (IMPERFECT FUNGI) . 

RN [ 1 ] 

RP SEQUENCE FROM N.A. 

RM 93024323 

RA GOLDMAN G.H., DEMOLDER J. , DEHAELE S., HERRERA-ESTRELLA A., 

RA GERENIA R.A., VAN MONTAGU M. , CONTRERAS R.; 

RL MOL. GEN. GENET. 234:481-488(1992). 

CC -!- CATALYTIC ACTIVITY: D-ERYTHR0-l-(IMIDAZ0L-4-YL) GLYCEROL 

CC 3-PHOSPHATE = 3-UNIDAZ0L-4-YD-2-0X0PR0PYL PHOSPHATE + H(2)0. 

CC -!- PATHWAY: SEVENTH STEP IN HISTIDINE BIOSYNTHETIC PATHWAY. 

CC -!- SIMILARITY: TO OTHER IHIDAZOLEGLYCEROL-PHOSPHATE DEHYDRATASES. 

DR EMBL; Z11528; TH1GPMR. 

DR PIR; S26196; S26196. 

KU HISTIDINE BIOSYNTHESIS; LYASE. 

SQ SEQUENCE 208 AA; 22356 MU; 196910 CN; 

Initial Score = 9 Optinized Score = 9 Significance = 4.69 
Residue Identity = 39X Matches = 9 Hi snatches = 14 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDD ITLVPETLG 
III I I I I II 

KGDLHIDDHHTAEDCCIAVGTTFAKALGALTGVARFGYAYAPLDEALSRAVVDLSNRPYTVVDLGLKREKLG 
90 100 110 120 130 X 140 150 

X 
R 

ELSCEH IPHCL9SFAQAAR ITLHVDCLRGDNDHHRAESAFKALAVAVRUYD 
160 170 180 190 200 



2. US-08-249-182-11 (1-23) 

BPHC.PSES1 BIPHENYL-2.3-DI0L 1 ,2-DIOXYGENASE (EC 1.13.11.39) 

ID BPHC.PSES1 STANDARD? PRT; 293 AA. 

AC P17297; 

DT 01-AUG-1990 (REL. 15. CREATED) 

DT 01-AUG-1990 (REL. 15. LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE BIPHENYL-2.3-DI0L 1 ,2-DIOXYGENASE (EC 1.13.11.39) (230HBP OXYGENASE) 

DE (2,3-DIHYDROXYBIPHENYL DIOXYGENASE) . 

GN BPHC. 

OS PSEUDOMONAS SP. (STRAIN KKS102). 

OC prokaryota; gracilicutes; scotobacteria; AEROBIC RODS AND COCCI; 

OC PSEUDOMONAD ACE AE . 

RN [1] 

RP SEQUENCE FROM N.A. 

RN 89213965 

RA KIM BAR A K.. HASHIMOTO T., FUKUDA II., KOANA T., TAKAGI H., OISHI H., 

RA YANO K.; 

RL J. BACTERIOL. 171:2740-2747(1989). 

CC -!- CATALYTIC ACTIVITY? BIPHENYL-2,3-DI0L + 0(2) = 2-HYDR0XY-6-0X0- 
CC 6-PHENYLHEXA-2.4-DIEN0ATE + H(2)0. 

CC -!- PATHWAY? DEGRADATION OF B I PHENYLS AND POLYCHLOROBIPHENYLS (PCB) TO 
CC BENZOIC ACID AND CHLOROBENZOIC ACIDS. 

CC -!- SUBUNIT: HOMOOCTAMER. 

CC -!- COFACTOR! REQUIRES FERROUS IRON AS THE PROSTHETIC GROUP. 

CC -!- SIMILARITY; WITH OTHER EX TRAD I OL RING-CLEAVAGE D I OXYGENASES. 

DR EMBL; H26433; N26433. 

DR PIR; A32312; DAPSPC. 

DR PROSITE! PS00082; EXTRADIOL.DIOXYGENAS. 

KU OXIDQREDUCTASE; DIOXYGENASE; AROMATIC HYDROCARBONS CAT ABOLISH; IRON. 

SQ SEQUENCE 293 AA; 32244 KU; 433312 CNl 

Initial Score = 9 Optinized Score = 9 Significance = 4.69 
Residue Identity = 39X Matches = 9 Misnatches = 14 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDD ITLVPETLG 

II I I II III 
LPLEIYYGPAEIFHEPFLPSAPVSGFVTGDQGIGHFVRCVPDTAKAHAFYTEVLGFVLSDIIDI9HGPETSV 
120 130 140 150 160 X 170 180 

X 
R 

P AHFLHCNGRHHT I ALA AFP I PKR I HHFHLQANT I DDVGYAFDRLDAAGRI 
X 190 200 210 220 230 



3. US-08-249-182-11 (1-23) 

DCOB.SALTY OXALOACETATE DECARBOXYLASE BETA CHAIN (EC 4.1.1.3) 



ID DCOB.SALTY STANDARD; PRTf 433 AA. 

AC Q03031; 

DT 01-JUL-1993 (REL. 26, CREATED) 

DT 01-JUL-1993 (REL. 26, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE OXALOACETATE DECARBOXYLASE BETA CHAIN (EC 4.1.1.3). 

GN OADB. 

OS SALMONELLA TYPHINURIUN. 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTERI ACEAE . 

RN II] 

RP SEQUENCE FROM N.A. 

RM 93054591 

RA WOEHLKE G., WIFLING K., DIMROTH P.; 

RL J. BIOL. CHEN. 267 : 22798-22803 ( 1992) . 

CC -!- FUNCTION: LYASE AND SODIUM TRANSPORTER. 

CC -!- CATALYTIC ACTIVITY? OXALOACETATE = PYRUVATE + C0(2). 

CC -!- SUBUNIT! COMPOSED OF THREE CHAINS (ALPHA, BETA, AND GAMMA). 

CC -!- COFACTORJ REQUIRES A SODIUM ION. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE-BOUND. 

DR EMBL; M96434; STOADGABA. 

DR PIR; C44465; C44465. 

KU DECARBOXYLASE; TRANSMEMBRANE; LYASE; SODIUM TRANSPORT. 



FT 


TRANSHEM 


22 


45 


I (BY SIMILARITY). 


FT 


TRANSMEM 


121 


145 


II (BY SIMILARITY). 


FT 


TRANSHEM 


159 


186 


Ill (6-HELIX MODEL) (BY SIMILARITY). 


FT 


TRANSMEM 


159 


180 


1IIA (7-HELIX MODEL) (BY SIMILARITY). 


FT 


TRANSMEM 


193 


213 


IIIB (7-HELIX MODEL) (BY SIMILARITY). 


FT 


TRANSHEM 


214 


241 


IV (6-HELIX MODEL) (BY SIMILARITY). 


FT 


TRANSMEM 


218 


241 


IV (7-HELIX MODEL) (BY SIMILARITY). 


FT 


TRANSMEM 


266 


293 


V (BY SIMILARITY). 


SS 


SEQUENCE 


433 aa; 


44923 NUi 


956333 CN; 



Initial Score = 9 Optimized Score = 10 Significance = 4.69 
Residue Identity = 44X Matches = 11 Mismatches = 12 

Gaps = 2 Conservative Substitutions - 0 

X 10 20 

TEF— LSNYLTNVDDITLVPET 
I I II I I I I I 

FPVVLLMLVALLLPDAAPLLGHFCFGNLMRESGVVERLSDTVQNGLINIVTIFLGLSVGAKLVADKFLQPQT 
270 280 290 300 310 320 330 



X 

LGR 



LG I LLLGVI AFG I GTAAGVLM AKLLNLCSKNK I NPL IGSAGVSAVPMAARVSN 
340 X 350 360 370 380 390 



4. US-08-249-182-11 (1-23) 

ANT6JACS7 GLUCAN 1 ,4-ALPHA-MALTOHEXAOSIDASE PRECURSOR (EC 3. 

ID AHT6.BACS7 STANDARD; PRT? 518 AA. 
AC P19571; 

DT 01-FEB-1991 (REL. 17, CREATED) 

DT 01-FEB-1991 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE GLUCAN 1 ,4-ALPHA-MALTOHEXAOSIDASE PRECURSOR (EC 3.2.1.98) (G6- AMYLASE) 
DE (MALTOHEXAOSE-PRODUCING AMYLASE) (EXO-MALTOHEXAOHYDROLASE) . 
OS BACILLUS SP. (STRAIN 707). 

OC PROKARYOTA; FIRMICUTES; END0SP0RE-F0RM1NG RODS AND COCCI; BAC ILL ACEAE. 

RN [H 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 34-36. 
RM 88162S14 



rh isunnnuiu a., lummft n.i ibtiii y.» iakanu i.» YAnAflit K., 

RL BIOCHEM. BIOPHVS. RES. CONHUN. 151 J 25-31 ( 1988) . 

CC -!- CATALYTIC ACTIVITY: HYDROLYSIS OF 1,4-ALPHA-D-GLUCOSIDIC LINKAGES 
CC IN AHYLACEOUS POLYSACCHARIDES SO AS TO REMOVE SUCCESSIVE 

CC MALTOHEXAOSE RESIDUES FROM THE NON-REDUCING CHAIN ENDS. 

CC -!- SUBCELLULAR LOCATION: EXTRACELLULAR. 

CC -!- PATHWAY: DEGRADATION OF STARCH. 

CC -!- SIMILARITY: BELONGS TO FAMILY 13 OF GLYCOSYL HYDROLASES, ALSO 
CC KNOWN AS THE ALPHA-AMYLASE FAMILY. 

DR EMBL; Ml 8862.' BSAKYG6. 

DR PIRJ A27705; A27705. 

KU HYDROLASE; GLYCOSIDASE; CARBOHYDRATE METABOLISM; SIGNAL. 

FT SIGNAL 1 33 

FT CHAIN 34 518 NALTOHEXAOSE-PRODUCING AMYLASE. 

SQ SEQUENCE 518 AA; 5900? MM? 1397695 CN; 

Initial Score = 9 Optiwized Score = 9 Significance = 4.69 
Residue Identity = 39X Matches = 9 Misnatches = 14 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSN YLTNVDD I TLVPETLGR 

111 I II I II 
NKMRTGKKGFLSILLAFLLVITSIPFTLVDVEAHHNGTNGTMNQYFEWYLPNDGNHWNRLNSDASNLKSKGI 
10 20 30 40 50 60 70 

TAVHIPPA 
80 



5. US-08-249-182-11 (1-23) 

TCP1.CRIGR T-COMPLEX PROTEIN 1 (TCP-1) (65 KD ANTIGEN) . 

ID TCP1.CRIGR STANDARD; PRT; 556 AA. 

AC P18279J 

DT Ol-NOV-1990 (REL. 16, CREATED) 

DT Ol-NOV-1990 (REL. 16, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE T-COMPLEX PROTEIN 1 (TCP-1) (65 KD ANTIGEN). 

OS CRICETULUS GRISEUS (CHINESE HAM5TER). 

oc eukaryota; metazoa; chordata; vertebrata; tetrapoda; mammalia; 
oc eutheria; rodentia. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 91027940 

RA AHMAD S., GUPTA R.S.; 

RL BIOCHIH. BIOPHYS. ACTA 1087:253-255(1990). 

CC -!- FUNCTION: MOLECULAR CHAPERONE. KNOWN TO PLAY A ROLE, IN VITRO. 
CC IN THE FOLDING OF ACTIN AND TUBULIN. 

CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC. 

CC -!- SUBUNIT: FORMS TWO STACKED RINGS, 12 TO 16 NM IN DIAMETER? 

CC ASSOCIATED WITH OTHER PROTEINS IN 850 KD TO 900 KD COMPLEX. 

CC -!- SIMILARITY: TO OTHER MEMBERS OF TCP-1 CHAPERONIN FAMILY. 

DR EMBL; M34665; CGTCP1A. 

DR PIR! S13163; S13163. 

DR PROSITE; PS00750; TCP1 1. 

DR PROSITE; PS00751; TCPIJ. 

KU CHAPERONE. 

SQ SEQUENCE 556 AA; 60338 MW; 1473942 CN; 

Initial Score = 9 Optinized Score = 9 Significance = 4.69 
Residue Identity = 39% Matches = 9 Misnatches = 14 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 



II I I I I III 
RAFHNEAQVNPERKNLKWIGLDLINGKPRDNKQAGVFEPTIVKVKSLKFATEAAITILRIDDLIKLHPETKD 
470 480 490 500 510 520 530 

X 
R 

DKHGSYENAVHSGALDD 
540 550 



6. US-08-249-182-11 (1-23) 

UL52_HSVSA PROBABLE DNA REPLICATION GENE 56 PROTEIN. 

ID UL52.HSVSA STANDARD; PRT; 835 AA. 

AC PI 4346; 

DT 01-JAN-1990 (REL. 13, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25, LAST ANNOTATION UPDATE) 

DE PROBABLE DNA REPLICATION GENE 56 PROTEIN. 

GN 56 OR EDRF4 . 

OS HERPESVIRUS SAII1IRI (STRAIN 11). 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; HERPESV1RIDAE; G ANN AHERPESV I R I N AE . 

RN 111 

RP SEQUENCE FROM N.A. 

RN 92333688 

RA ALBRECHT J.-C, NICHOLAS J., BILLER D., CAMERON K.R., BIESINGER B., 

RA NEWMAN C, MTTMANN S., CRAXTON M.A., COLEMAN H. , FLECKENSTEIN B., 

RA HONESS R.H.; 

RL J. VIROL. 66:5047-5058(1992). 

RN [21 

RP SEQUENCE FROM N.A. 

RM 92230228 

RA NICHOLAS J., CAMERON K.R., COLEMAN H., NEWMAN C, HONESS R.W.; 

RL VIROLOGY 188:296-310(1992) . 

RN [31 

RP SEQUENCE OF 631-835 FROM N.A. 

RM 88300875 

RA NICHOLAS J., GOMPELS U.A., CRAXTON M.A., HONESS R.U.; 

RL J. VIROL. 62:3250-3257(1988). 

CC -!- FUNCTION: PROBABLY INVOLVED IN DNA REPLICATION. 

CC -!- SIMILARITY: BELONGS TO FAMILY THAT GROUPS TOGETHER HSV-1 UL52, 

CC EHV-1 7, EBV BSLF1, HVS-1 56, HCMV UL70 AND VZV 6. 

DR ENBL; X 64346; HSGEND. 

DR EMBL; MB6409; HEHSV3PRG. 

DR EMBL; H21943; HEHSS52K. 

DR PIR; 136811; QQBEHA. 

KU DNA REPLICATION. 

SO SEQUENCE 835 AA; 96127 MM! 3760278 CN; 

Initial Score = 9 Optinized Score = 9 Significance = 4.69 
Residue Identity = 39X Matches = 9 Misnatches = 14 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDI TLVPETLG 

hi i i mi 

NYCHIKLARDSLESQAIDTSIDTLRGQLMLSNQDLVHYIYLSFFSCLNKDIFIKYSHLTNSDNIHFVPETEV 
240 250 260 270 280 X 290 300 

X 
R 

LAQSLDENFRKDMLTYYNKSTYLKTYITHKCIHLPDLIGYAPQDCTSFVYH 
X 310 320 330 340 350 



7. US-08-249-182-11 (1-23) 

APCE.SYNY4 PHYCOBILISOHE 120 KD LINKER POLYPEPTIDE. CORE (L-C 

ID APCE_SYNY4 STANDARD,' PRT; 896 AA. 

AC 002907; 

DT 01-JUL-1993 (REL. 26, CREATED) 

DT 01-JUL-1993 (REL. 26, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE PHYCOBILISOHE 120 KD LINKER POLYPEPTIDE, CORE (L-CM 92) (CORE-MEMBRANE 

DE LINKER PROTEIN). 

OS SYNECHOCYSTIS SP. (STRAIN PCC 6714). 

oc prokaryota; gracilicutes; oxyphqtobacteria; 

0C CYANOBACTERIA (BLUE-GREEN ALGAE); CHROOCOCCALES . 

RN 111 

RP SEQUENCE FROM N.A. 

RM 93222481 

RA DIMAGNO L.N., HASELKORN R.J 

RL PLANT MOL. BIOL. 21:835-846(1993). 

CC -!- FUNCTION? THIS PROTEIN IS POSTULATED TO ACT BOTH AS TERMINAL 
CC ENERGY ACCEPTOR (BY ITS PHYCOBILIN-LIKE DOMAINS) AND AS A LINKER 

CC POLYPEPTIDE (BY ITS REPEATS AND ARMS) THAT STABILIZES THE 

CC PHYCOBILISOHE CORE ARCHITECTURE. 

CC -!- SUBMIT; PHYCOBILISOHES OF THIS ORGANISM ARE COMPOSED OF A TWO 
CC CYLINDER CORE, FROM WHICH SIX RODS RADIATE. THE CORE IS MAINLY 

CC COMPOSED OF ALLOPHYCOCYANIN ALPHA AND BETA CHAINS AND OF MINOR 

CC COMPONENTS. 

CC -!- SUBCELLULAR LOCATION! ANCHORS THE PHYCOBILISOHE PERPENDICULARLY 

CC TO THE STROMAL SURFACE OF THE THYLAKOID MEMBRANE. 

CC -!- SIMILARITY! THE REPEATED DOMAINS ARE SIMILAR TO THE N-TERMINAL 

CC REGIONS OF PHYCOCYANIN ROD LINKER POLYPEPTIDES. 

CC -!- SIMILARITY: THE PHYCOBILIN-LIKE DOMAINS ARE SIMILAR TO PHYCOBILINS 

CC FROM VARIOUS SPECIES. 

DR EMBL; L02309; SSLCM. 

KM PHYCOBILISOHE; ELECTRON TRANSPORT! PHOTOSYNTHESIS; REPEAT. 

S9 SEQUENCE 896 AA; 100460 HW; 4106962 CN; 

Initial Score = 9 Qptinized Score = 9 Significance = 4.69 
Residue Identity = 392 Matches = 9 Mi snatches - 14 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDDITLVPETLG 
II I I II I II 

ISSYN8ILAS8GLKAF1GAMVNGMEYLQTFGEDTVPYRRFPTLPAANFPNTERLYNKLTKSDKELVVPSFTP 
820 830 840 850 860 870 880 890 

X 
R 

VVKVGG 
X 



8. US-08-249-182-11 (1-23) 

APCE.FREDI PHYCOBILISOHE 120 KD LINKER POLYPEPTIDE, CORE (L-C 

ID APCE FREDI STANDARD; PRT; 1079 AA. 

AC PI 6566! 

DT 01-AUG-1990 (REL. 15, CREATED) 

DT 01-AUG-1990 (REL. 15, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE PHYCOBILISOHE 120 KD LINKER POLYPEPTIDE, CORE (L-CH 92) (CORE -MEMBRANE 

DE LINKER PROTEIN) . 

GN APCE. 

OS FREMYELLA DIPLOSIPHON (CALOTHRIX PCC 7601). 



OC CYANOBACTERIA (BLUE-GREEN ALGAE) ; NOSTOCALES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 90192765 

RA HOUHARD J., CAPUANO V., C0L0NBANQ M.V., COURSIN T., DE HARSAC N.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 87:2152-2156(1990). 

CC -!- FUNCTION: THIS PROTEIN IS POSTULATED TO ACT BOTH AS TERMINAL 

CC ENERGY ACCEPTOR (BY ITS PHYCDBIL IN-LIKE DOMAINS) AND AS A LINKER 

CC POLYPEPTIDE (BY ITS REPEATS AND ARMS) THAT STABILIZES THE 

CC PHYCOBILISOME CORE ARCHITECTURE. 

CC -!- SUBUNIT: PHYCOBILISOMES OF THIS ORGANISM ARE COMPOSED OF A TWO 

CC CYLINDER CORE, FROM WHICH SIX RODS RADIATE. THE CORE IS MAINLY 

CC COMPOSED OF ALLOPHYCOCYANIN ALPHA AND BETA CHAINS, AND OF THREE 

CC MINOR COMPONENTS: THE ALLOPHYCOCYANIN ALPHA-B CHAIN, A 18.3 KD 

CC POLYPEPTIDE, AND THE ANCHOR POLYPEPTIDE LCM. 

CC -!- SUBCELLULAR LOCATION: ANCHORS THE PHYCOBILISOME PERPENDICULARLY 

CC TO THE STROMAL SURFACE OF THE THYLAKOID MEMBRANE. 

CC -!- SIMILARITY: THE REPEATED DOMAINS ARE SIMILAR TO THE N-TERMINAL 

CC REGIONS OF PHYCOCYANIN ROD LINKER POLYPEPTIDES. 

CC -!- SIMILARITY: THE PHYCOBILIN-LIKE DOMAINS ARE SIMILAR TO PHYCOBILINS 

CC FROM VARIOUS SPECIES. 

DR EMBL; M20806; FDAPCA. 

DR PIR? A35088; A35088. 

KM PHYCOBILISOME? ELECTRON TRANSPORT; PHOTOSYNTHESIS; REPEAT. 



INIT.MET 


0 


0 




DOMAIN 


17 


75 


PHYCOBILIN-LIKE. 


DOMAIN 


76 


143 


PHYCOBILIN-LIKE LOOP. 


DOMAIN 


144 


236 


PHYCOBILIN-LIKE. 


DOMAIN 


237 


284 


ARM 1 (SPACING SEQUENCE). 


REPEAT 


285 


409 


I. 


DOMAIN 


410 


546 


ARM 2 (SPACING SEQUENCE). 


REPEAT 


547 


669 


II. 


DOMAIN 


670 


743 


ARM 3 (SPACING SEQUENCE). 


REPEAT 


744 


869 


III. 


DOMAIN 


B70 


953 


ARM 4 (SPACING SEQUENCE). 


REPEAT 


954 


1079 


IV. 


SEQUENCE 


1079 AA! 120325 MU; 5971605 CN; 



Initial Score - 9 Optinized Score = 9 Significance = 4.69 
Residue Identity = 39X Matches = 9 Misnatches = 14 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTN VDD I TLVPETLG 
II I I II I II 

IRKYNQILATQGIRAFIGALVSSAEYAEVFGEDTVPYRRYPTLPAANFPNTEKLYNQLTKQNDDLVVPSFKT 
830 840 850 860 870 X 880 890 

X 
R 

VQPRLTLAGTSSSGRNGFTDLGRSSTSAQGQLGETANRCKPARIYRLSGTN 
900 910 920 930 940 



9. US-08-249-182-11 (1-23) 

BXD.CLOBO BOTULINUM NEUROTOXIN TYPE D PRECURSOR (EC 3.4.24.- 

ID BXD.CLOBO STANDARD; PRT; 1276 AA. 

AC P19321; 

DT 01-N0V-1990 (REL. 16, CREATED) 

DT 01-NOV-1990 (REL. 16, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE BOTULINUM NEUROTOXIN TYPE D PRECURSOR (EC 3.4.24.-) (BONT/D). 

GN BOTD. 



OC PROKARYOTA; FIRMICUTES; ENDOSPDRE-FORMING RODS AND COCCI; BACILLACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BVD/-3; 

RM 71016853 

RA BINZ T.r KURAZQNO H.> POPOFF H.R., EKLUND II. U.. SAKAGUCHI G.> 

RA KOZAKI S., KRIEGLSTEIN K., HENSCHEN A.. GILL D.H., NIEMANN H.5 

RL NUCLEIC ACIDS RES. 18:5556-5556(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CB16; 

RM 93042276 

RA SUNAGAWA H. > OHYAHA T.. WATANABE T., INDUE K.; 

RL J. VET. MED. SCI. 54:905-913(1992). 

RN [3] 

RP PARTIAL SEQUENCE. 

RC STRAIN=D-SA, AND D-1873; 

RM 89339741 

RA MORI ISHI K.. SYUTO B., KUBO S., QGUMA K.> 

RL INFECT. IHMUN. 57:2886-2891(1989). 

CC -!- FUNCTION: BOTULINUS TOXIN ACTS BY INHIBITING NEUROTRANSMITTER 
CC RELEASE. IT BINDS TO PERIPHERAL NEURONAL SYNAPSES, IS INTERNALIZED 

CC AND MOVES BY RETROGRADE TRANSPORT UP THE AXON INTO THE SPINAL CORD 

CC WHERE IT CAN HOVE BETWEEN POSTSYNAPTIC AND PRESYNAPTIC NEURONS. IT 

CC INHIBITS NEUROTRANSMITTER RELEASE BY ACTING AS A ZINC 

CC ENDOPEPTIDASE THAT CLEAVES SYNAPTOBREVIN-2. 

CC -!- SUBUNIT: D I SULFI DE-L I NKED HETERODIMER OF A LIGHT CHAIN (L) AND A 
CC A HEAVY CHAIN <H). THE LIGHT CHAIN HAS THE PHARMACOLOGICAL 

CC ACTIVITY, WHILE THE N- AND C -TERMINAL OF THE HEAVY CHAIN MEDIATE 

CC CHANNEL FORMATION AND TOXIN BINDING, RESPECTIVELY. 

CC -!- SUBCELLULAR LOCATION: SECRETED. 

CC -!- THERE ARE SEVEN ANTIGENICALLY DISTINCT FORMS OF BOTULINUM 

CC NEUROTOXIN: TYPES A, B, CI, D, E, F, AND G. 

CC -!- BOTULINUM TYPE D NEUROTOXIN IS SYNTHESISED BY D STRAIN OF 

CC CLOSTRIDIUM BOTULINUM WHICH CARRY THE APPROPRIATE BACTERIOPHAGE. 

CC -'- SIMILARITY: HIGH WITH OTHER BOTULINUM NEUROTOXINS AND WITH TETANUS 

CC NEUROTOXIN. 

CC -!- SIMILARITY: TO OTHER ZINC METALLOPROTE I NASES IN THE ACTIVE SITE 

CC REGION. 

DR EHBL, X 54254? CBNTTD. 

DR EMBL? S49407! S49407. 

DR PIR; SI 1455; S11455. 

DR PROSITEf PS00142; ZINC_PROTEASE. 

KW NEUROTOXIN; TRANSMEMBRANE; HYDROLASE; HETALLOPROTEASE; ZINC. 





CHAIN 


1 


442 


BOTULINUM NEUROTOXIN D, LIGHT-CHAIN. 




CHAIN 


443 


1260 


BOTULINUM NEUROTOXIN D, HEAVY-CHAIN. 




METAL 


229 


229 


ZINC (CATALYTIC) (BY SIMILARITY). 




ACT SITE 


230 


230 


BY SIMILARITY. 




METAL 


233 


233 


ZINC (CATALYTIC) (BY SIMILARITY). 




DISULFID 


437 


450 


INTERCHAIN (PROBABLE). 




VARIANT 


15 


16 


ND -> PV (IN STRAIN D-SA). 




VARIANT 


17 


18 


ND -> LQ (IN STRAIN D-1873). 




VARIANT 


452 


452 


K -> 8 (IN STRAIN D-SA). 




VARIANT 


457 


457 


R -> T (IN STRAIN D-SA) . 




VARIANT 


457 


457 


R -> F (IN STRAIN D-1873). 




VARIANT 


462 


462 


A -> D (IN STRAIN D-1873). 




VARIANT 


489 


489 


K -> N (IN STRAIN CB16). 




VARIANT 


644 


644 


N -> K (IN STRAIN CB16) . 




VARIANT 


1122 


1122 


8 -> R (IN STRAIN CB16). 


SQ 


SEQUENCE 


1276 


AA; 146871 


MW; 8705534 CN; 


Initial Score 




9 Optimized Score = 9 Significance = 4 


Residue Identity 




39X Matches 


= 9 Mi snatches = 


Gaps 






0 Conservative Substitutions = 



14 
0 



TEFLSNYLTNVDDITLVPETLG 
III I I I I II 

DG9VP I NPE I VDPLLPNVNHEPLNLPGEE I VFYDDI TKYVDYLNSYYYLES9KLSNNVENI TLTTSVEE ALG 
500 510 520 530 540 550 560 

X 
R 

YSNKIYTFLPSLAEKVNKGV8AGLFLNWANEVVEDFTTNINKKDTLDKISD 
570 580 590 600 610 620 



10. US-08-249-182-11 (1-23) 

VNS4_CVPRN NONSTRUCTURAL PROTEIN 4 (XI PROTEIN) (ORF 3). 

ID VNS4_CVPRH STANDARD; PRT; 82 AA. 

AC P24415,' 

DT 01-MAR-1992 (REL. 21, CREATED) 

DT 01-HAR-1992 (REL. 21, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE NONSTRUCTURAL PROTEIN 4 (XI PROTEIN) (ORF 3). 

GN NS4. 

OS PORCINE RESPIRATORY CORONAVIRUS (STRAIN RH4) (PRCV), AND PORCINE 

OS RESPIRATORY CORONAVIRUS (STRAIN 86/137004 / BRITISH ISOLATE) (PRCV). 

OC VIRIDAE; SS-RNA ENVELOPED VIRUSES; POSITIVE-STRAND; CORONAVIRIDAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=RH4» 

RM 91073120 

RA RASSCHAERT D. , DUARTE N., LAUDE H.; 

RL J. GEN. VIROL. 71 s 2599-2607 (1990). 

RN [21 

RP SEQUENCE FROM N.A. 

RC STRAIN=86/137004; 

RM 92116634 

RA BRITTON P., MAUD ITT K.L., PAGE K.W.; 

RL VIRUS RES. 21:181-198(1991). 

DR ENBL; X 60056; PRCVORFS. 

DR PIR; C36607; C36607. 

DR PIR? S21309; S21309. 

DR PIR; S24280; S24280. 

KM NONSTRUCTURAL PROTEIN. 

SO SEQUENCE 82 AA; 9268 HW; 34513 CN; 

Initial Score = 8 Optinized Score = 8 Significance = 3.91 
Residue Identity = 34X Matches = 8 Hisnatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

II I I I III 

NTFPRALT V I DDNGH V I S 1 1 FUFLLI I IL I LLS I ALLN 1 1 KLCH VCCNLGRTVI I VPVQHAYDAYKNFHRI K 
10 20 30 40 50 60 70 

AYNPDGALLV 
80 



11. US-08-249-182-11 (1-23) 

VNS4.CVPPU NONSTRUCTURAL PROTEIN 4 (XI PROTEIN) (ORF 3). 

ID VNS4_CVPPU STANDARD; PRT; 82 AA. 

AC P09048? 

DT 01-N0V-1988 (REL. 09, CREATED) 

DT Ol-NOV-1988 (REL. 09, LAST SEQUENCE UPDATE) 



DE NONSTRUCTURAL PROTEIN 4 (XI PROTEIN) (ORF 3). 

GN NS4. 

OS PORCINE TRANSMISSIBLE GASTROENTERITIS CORONAVIRUS (STRAIN PURDUE). 

OC VIRIDAE; SS-RNA ENVELOPED VIRUSES; POSITIVE-STRAND? CORONAVIRIDAE. 

RN m 

RP SEQUENCE FROM N.A. 

RM S807S100 

RA RASSCHAERT D., GELFI J., LAUDE H.; 

RL BIOCHIHIE 69:591-600(1987). 

DR EMBL; X06371; C0TGEV3. 

DR PIR; S01741; S01741. 

DR PIRJ S04891; S04891. 

KH NONSTRUCTURAL PROTEIN. 

SO SEQUENCE 82 AA; 9239 HW> 31619 CN; 

Initial Score = 8 Optimized Score = 8 Significance = 3.91 
Residue Identity = 34X Hatches = 8 Mi snatches - 15 

Gaps = 0 Conservative Substitutions - 0 

X 10 20 X 

TEFLSNYLTNVDD I TLVPETLGR 

II I I I III 

MTFPRALTVIDDNGMVINI IFWFLLI I ILILLSI ALLNI IKLCMVCCNLGRTVI I VPA9HAYDAYKNFMRIK 
10 20 30 40 50 60 70 

AYNPDGALLA 
80 



12. US-08-249-182-11 (1-23) 

REPB.STRPN REPLICATION PROTEIN REPB. 

ID REPB.STRPN STANDARD? PRT; 210 AA. 

AC P13921! 

DT 01-JAN-1990 (REL. 13, CREATED) 

DT 01-JAN-1990 (REL. 13, LAST SEQUENCE UPDATE) 

DT 01-MAY-1991 (REL. 18, LAST ANNOTATION UPDATE) 

DE REPLICATION PROTEIN REPB. 

GN REPB. 

OS STREPTOCOCCUS PNEUMONIAE, AND STREPTOCOCCUS AGALACTIAE. 

OG PLASMID PLS1, AND PLAStllD PNV158. 

oc prokaryota; firnicutes; cocci; streptococcaeae. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC PLASHID=PLSl; 

RM 87226167 

RA LACKS S.A., LOPEZ P., GREENBERG B. , ESPINOSA H.J 

RL J. MOL. BIOL. 1925 753-765(1986). 

RN C2] 

RP SEQUENCE FROM N.A. 

RC PLASMID=PMV158? 

RM 90016790 

RA VAN DER LELIE D., BRON S. , VENEMA G., OSKAM L.i 

RL NUCLE I C AC I DS RES . 17: 7283-7294 ( 1 989 ) . 

CC -!- FUNCTION: IS ESSENTIAL FOR PLASMID REPLICATION. NICKS THE POSITIVE 
CC STRAND AT THE PLUS ORIGIN OF REPLICATION. 

CC -!- SIMILARITY: WITH REPLICATION PROTEINS FROM OTHER GRAM* BACTERIAL 

CC PLASMIDS. REPLICATING WITH THE ROLLING-CIRCLE MECHANISM. 

DR EMBL; XI 5669! SAREPAB. 

DR EMBL! H29725; PPCG1. 

DR PIR; B25599! B25599. 

DR PIR; S05981; S05981. 

KH PLASMID; DNA REPLICATION; TOPOISOMERASE. 

SQ SEQUENCE 210 AA; 24250 MM; 229833 CN; 



Residue Identity = 34X Hatches = 8 Hi snatches = IS 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 X 

TEFLSNYLTNVDDITLVPETLGR 

ii i i mi 

HAKEKARYFTFLLYPESIPSDUELKLETLGVPHAISPLHDKDKSSIKGGKYKKAHYHVLYIAKNPVTADSVR 
10 20 30 40 50 60 70 

KKIKLLLGE 
80 



13. US-08-249-182-11 (1-23) 
HYPE.BRAJA HYPE PROTEIN. 

ID HYPE.BRAJA STANDARD; PRT; 321 AA. 

AC P31906; 

DT 01-JUL-1993 (REL. 26, CREATED) 

DT 01-JUL-1993 (REL. 26. LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26. LAST ANNOTATION UPDATE) 

DE HYPE PROTEIN. 

GN HYPE. 

OS BRADYRHI20BIUH JAPONICUH. 

oc prokaryota; gracilicutes; scotobacteria; AEROBIC rods and cocci; 

oc RHIZOBIACEAE. 

rn 111 

RP SEQUENCE FROM N.A. 

RC STRAIN=CB1809; 

RH 93287991 

RA VAN SOON C. VERRETH C, SAHPAIO H.J., VANDERLEYDEN J. > 

RL HOL. GEN. GENET. 239 .'235-238(1 993) . 

CC -!- SIHILARITYJ BELONGS TO THE HYPE FAMILY. 

DR EHBL; Z17373J BJHYPHOX. 

DR PIRf S28642; S28642. 

SO SEQUENCE 321 AA; 33581 HU; 463510 CN; 

Initial Score = 8 Optinized Score = 8 Significance = 3.91 
Residue Identity = 342 Hatches = 8 Hi snatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSNYLTNVDD I TLVPETLG 
II III III 

FADLKL I AESHG AAAREAD VH I I TGDTK VVERGKADGLF I STAGVG VVPDGLDLSAEKAR VGDRVL I SGTLG 
90 100 110 120 130 140 150 

X 
R 

DHGVA I HSKRQNLAFETE I VSDSASLHDL VARMVQAGGRG IRLHRDPTRGG 
160 170 180 190 200 



14. US-08-249-182-11 (1-23) 

EGF2.STRPU EPIDERHAL GROWTH FACTOR-RELATED PROTEIN PRECURSOR 

ID EGF2 STRPU STANDARD; PRT; 325 AA. 

AC P15216; 

DT 01-APR-1990 (REL. 14, CREATED) 

DT 01-APR-1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-APR-1990 (REL. 14, LAST ANNOTATION UPDATE) 

DE EPIDERHAL GROWTH FACTOR-RELATED PROTEIN PRECURSOR (SPEGF2) . 

OS STRONGYLOCENTROTUS PURPURATUS (PURPLE SEA URCHIN). 

OC EUKARYOTA; HETAZOA; ECHINODERHATA; ECHIN070A; ECHINOIDEA. 



tutinjnuiutft. 

RN [1] 

RP SEQUENCE FROH N.A. 

RM 90049203 

RA VANG 0., ANGERER L.tl., ANGERER R.C.; 

RL SCIENCE 246:806-808(1989). 

CC -!- SIMILARITY: THE PROTEIN INCLUDES 4 EGF-LIKE REPEATS. 

CC -!- SIMILARITY? EACH OF THE EGF-LIKE REPEAT IS VERY SIMILAR TO THE 

CC EXOGASTRULA-INDUCING PEPTIDES FROH THE SEA URCHIN ANTHOCIDARIS 

CC CRASS I SPINA. 

DR EHBL; M29004; SPEGF2. 





EGF-LIKE DOMAIN? 


REPEAT! 


SIGNAL. 


FT 


SIGNAL 


1 


17 


POTENTIAL. 


FT 


CHAIN 


18 


325 


EPIDERMAL GROWTH FACTOR-RELATED PROTEIN. 


FT 


DOMAIN 


47 


311 


4 X EGF-TYPE REPEATS. 


FT 


REPEAT 


47 


105 


EGF-LIKE 1. 


FT 


REPEAT 


106 


176 


EGF-LIKE 2. 


FT 


REPEAT 


177 


248 


EGF-LIKE 3. 


FT 


REPEAT 


249 


311 


EGF-LIKE 4. 


SQ 


SEQUENCE 


325 AA; 36898 


MW; 486695 CN; 



Initial Score = 8 Optimized Score = 8 Significance = 3.91 
Residue Identity = 347. Matches = 8 Mi snatches = 15 

Gaps = 0 Conservative Substitutions = 0 



X 10 20 

TEFLSNYLTNVDDITLVPETLG 
llll I II 

NRCLSDTSNCDGHGIC8LSTFGRNERYICFCALGFRNNNYGGCSPYTPREIEFLSYVARDLELEMLTRDSLG 
110 120 130 140 150 160 170 180 



RCKSDTHNCDEAG8CVTKT YGRYAGE Y I CVCNHGYRNNA YGGCSPMTTRE I 
X 190 200 210 220 230 



15. US-08-249-182-11 (1-23) 

MYB.AVIMB MYB TRANSFORMING PROTEIN. 

ID MYB.AVIMB STANDARD; PRT; 382 AA. 

AC P0H04; 

DT 21-JUL-1986 (REL. 01. CREATED) 

DT 01-MAR-1989 (REL. 10, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE MYB TRANSFORMING PROTEIN. 

GN V-MYB. 

OS AVIAN MYELOBLASTOSIS VIRUS. 

OC VIRIDAE; SS-RNA ENVELOPED VIRUSES; POSITIVE-STRAND; RETROVIRIDAEi 

OC ONCOVIRINAE. 

RN £1] 

RP SEQUENCE FROM N.A. 

RM 83129359 

RA KLEMPNAUER K.-H., GONDA T.J., BISHOP J.H.; 

RL CELL 31:453-463(1982). 

RN [21 

RP SEQUENCE FROM N.A. 

Ril 82223743 

RA RUSHLOU K.E., LAUTENBERGER J. A., PAPAS T.S., BALUDA H.A., PERBAL B., 

RA CHIRIKJIAN J.G., REDDY E.P.; 

RL SCIENCE 216:1421-1423(1982). 

CC -!- FUNCTION: MYB IS A DNA-BINDING PROTEIN THAT SPECIFICALLY RECOGNIZE 
CC THE SEQUENCE YAAC(G/T)G. 

CC -!- DISEASE: THE V-MYB ONCOGENE TRANSFORMS IMMATURE MYELOMONOCYTIC 
CC AVIAN CELLS IN CULTURE AND INDUCES MYELOBLASTOSIS (MYELOID 



i\. Lcuncnifw in inimcns. 

DR EMBLr J02012J RE0NVI1YB. 

DR PIR; A01347; QOYV. 

DR TFD5 P00028; RELEASE 3.0. 

DR PROSITE; PS00037; HYB_1. 

DR PROSITE i PS00334; MYB.2. 

KU TRANSFORMING PROTEIN; ONCOGENE; NUCLEAR PROTEIN; DNA-BINDINGi REPEAT. 

FT DNA.BIND 1 15 NYB (PARTIAL). 

FT DNAJ1ND 16 67 MYB. 

FT DNA.BIND 68 118 HYB. 

SQ SEQUENCE 382 AA; 43061 HW; 744659 CN; 

Initial Score = 8 Optinized Score = 8 Significance = 3.91 
Residue Identity = 34% Hatches = 8 Mismatches = 15 

Gaps = 0 Conservative Substitutions = 0 

X 10 20 

TEFLSN YLTN VDD I TLVPETLG 
II I II III 

WHSTTVADNTRTSGDNAPVSCLGEHHHCTPSPPVDHGCLPEESASPARCNIVH8SNILDNVKNLLEFAETLQ 
260 270 280 290 300 X 310 320 

X 
R 

L I DSFLNTSSNHENLNLDNPALTSTPVCGHKHSVTTPFHKD8TFTEYRKHH 
330 340 350 360 370 

> 0 < 

0| |0 InteUiGenetics 

> 0 < 

FastDB - Fast Pairuise Coup ari son of Sequences 
Release 5.4 S^CJ I 

Results file u249_la.res nade by on Thu 22 Sep 94 10:06:41-PDT. 



Query sequence being coBparedJUS-08-249-182-1 (1-5) 
Number of sequences searched: 42145 
Number of scores above cutoff: 4616 

Results of the initial comparison of US-08-249-182-1 (1-5) with? 
Data bank : A-GeneSeq 15. all entries 

100000- 

N 

U50000- 
N 

B « 

E 
R 

* 

0 

F10000- 
S 

E 5000- 

Q 

U 

E 

N 

C - * 
E 

S 1000- 



500- 



100- 



50- 



10- 



II 

SCORE 0| 
STDEV 1 



1 I 1 



I I I 



PARAMETERS 



Similarity natrix 


Unitary 


K-tuple 2 


MisRatch penalty 


1 


Joining penalty 20 


Gap penalty 


1.00 


Window size S 


Gap size penalty 


0.05 




Cutoff score 


0 




Randomization group 


0 




Initial scores to save 40 


Alignments to save IS 


Qptiwized scores to 


save 0 


Display context 50 




SEARCH STATISTICS 


Scores? 


Mean 


Median Standard Deviation 




0 


1 1.02 


Tines: 


CPU 


Total Elapsed 




00:00:26.99 


00:00:37.00 



Nunber of residues: 5287517 
Nuftber of sequences searched; 42145 
Nunber of scores above cutoff: 4616 

Cut-off raised to 2. 
Cut-off raised to 3. 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 



A 100X identical sequence to the query sequence was not found. 



The list of best scores is: 



Sequence Naae Description 



Init. Opt. 
Length Score Score 



Sig. FraRe 







»*** 3 standard deviations above nean **«* 








1. 


R37443 


HutoLdXin pepnoe aia id. 


L A 
O H 


4 


3.92 


0 


2. 


P71436 


I AmiiMAmAfJiil s4 am aaa.4 4 jJa 141 O i 

mpiunonoou idior pepnoe vie l 


O A 


4 


3.92 


0 


3. 


P71435 


mnunonouu iawor pepiioe wii 1 


O A 


4 


3.92 


0 


4. 


P71423 


lripiunonoou i oitCr pepLioe 5c in 


01 A 


4 


3.92 


0 


5. 


P71422 


lFiptunoROuu labor pepLioe n in 


01 A 


4 


3.92 


0 


6. 


P71493 


nni/igeniu pep woe cross reacu 




4 


3.92 


0 


7. 


R42697 


pio ot net pepwoe ot niv~i, 


7<I A 


4 


3.92 


0 


8. 


P90958 


Pont i Aa rntr^acn 4 a AA171»*0n*\ 
rcpwlOc uOrrespr l*0 nnl/I CvJ 


1^ A 


4 


3.92 


0 


9. 


P92270 


Paniirio PP1A fVnra UTU-1 nrnia 

repwoe rriO T run niv 1 prove 


77 A 

Of 1 


4 


3.92 


0 


10. 


R42698 


n17 nf nor nan4 i rJo nf UTU-1 

pi/ ot net pepirioe ot niv *■ 


A 


4 


3.92 


0 


11. 


P92271 


Pan* i Ha PC 1 7 frnn UTU-1 nnnta 

repuioe rri/ tron niv 1 prote 


A7 A 


4 


3.92 


0 


12. 


Rl 1 188 




ins a 


4 


3.92 


0 


13. 


R38893 


wtfT protein ot niv i. 


CUO *i 


4 


3.92 


0 


14. 


P61515 


sequence ot t protein. 


Ot\k A 
CUO *r 


4 


3.92 


0 


15. 


P60423 


sequence ot Lftv virus ur\r r p 


01 L A 
ClO *f 


4 


3.92 


0 


16. 


P70863 


CamiAnPA nf CO ritkuni 4 A n Dam 

sequence ot oc suounii ot »or 


OOA A 


4 


3.92 


0 


17. 


P96014 


rertussis toxin suounii dc* 


OOA A 
CCD 4 


4 


3.92 


0 


IB. 


R23966 


rrotein sequence ot piasftio p 


OTA h 

234 4 


4 


3.92 


0 


19. 


R45021 


C4 anknl Ar Apf si ah4 aha4 au t w C^ 

oudpny iococco i enueroLOXin ti 


3*/. a 

c4o 4 


4 


3.92 


0 


20. 


P92062 


Sequence of Isopenicillin N s 


*333 4 


4 


3.92 


0 


21. 


R20006 


zonula occiuaens toxin. 


too a 

J77 4 


4 


3.92 


0 


22. 


R44220 


Threonine synthase. 


481 4 


4 


3.92 


0 


23. 


P81325 


mreonme synthetase* 


4YD 4 


4 


3.92 


0 


24. 


R10274 


Sinian innunodeff iceincy viru 


CAO J 
QVC 4 


4 


3.92 


0 


25. 


R32356 


txciLory anino acio receptor 


7JO 4 


4 


3.92 


0 


26. 


R27931 


SPS protein. 


1068 4 


4 


3.92 


0 


27. 


R20198 


Sucrose phosphate synthase fr 


1068 4 


4 


3.92 


0 


28. 


R24033 


Soluble nannose receptor pept 


1456 4 


4 


3.92 


0 


29. 


R44432 


eryA region polypeptide nodul 


2986 4 


4 


3.92 


0 


30. 


R 10834 


Rianodin receptor. 


4987 4 


4 


3.92 


0 


31. 


Rl 1510 


Ryanodine receptor deduced fr 


5072 4 


4 


3.92 


0 






**** 2 standard deviations above nean **** 








32. 


R28702 


BAG 75 50 kD fragment N-terni 


6 3 


3 


2.94 


0 


33. 


R14168 


ACE-inhibi ting hexapeptide* I 


6 3 


3 


2.94 


0 


34. 


R43915 


Pyruvate kinase conserved don 


8 3 


3 


2.94 


0 


35. 


R07942 


TNF binding protein fraction 


9 3 


3 


2.94 


0 


36. 


R44069 


Pulmonary surfactant protein 


10 3 


3 


2.94 


0 


37. 


R44065 


Pulmonary surfactant protein 


13 3 


3 


2.94 


0 


38. 


R28701 


BAG 75 N-teminal. 


13 3 


3 


2.94 


0 


39. 


P70837 


Sequence encoded by the Beta- 


14 3 


3 


2.94 


0 


40. 


R34888 


Hunan TSH residues 61-75. 


15 3 


3 


2.94 


0 



1. US-08-249-182-1 (1-5) 

R37443 Autotaxin peptide ATX 18. 

ID R37443 standard; peptide; 6 AA. 

AC R37443; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 18. 

KU Cell notility stimulating; cancer netastasis; antibody; detection; 

KU innunostains; disease outcone prediction; therapy choice; 

KU cancer therapy; cross I inked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 



DR WPI? 93-085861/10. 

PT Motility stimulating protein naned autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example? Page 33? 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 18. It nay be used to 

CC raise anti-autotaxin antibodies uhich can be used to diagnose cancer 

CC metastasis and in innunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich nay also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SQ Sequence 6 AA; 

SQ 2 A; o R; i N; 0 D; o Bi 0 C; o o E; 0 Z; o G; l h; 

sq o i; o l; o k; o m? o f? o p? o s? o t? 1 u; o Y; l v; 



Initial Score = 4 
Residue Identity = 807. 
Gaps = 0 



Optimized Score = 4 Significance = 3.92 
Matches = 4 Misnatches - 1 

Conservative Substitutions = 0 



X X 
WHVAR 

Mil 

UHVAAN 
X X 



2. US-08-249-182-1 (1-5) 

P71436 Iwiunonodulator peptide #12 inhibits HIV-T4 intera 

ID P71436 standard; Protein? 9 AA. 

AC P71436; 

DT 03-MAY-1991 (first entry) 

DE IfiBunowodulator peptide #12 inhibits HIV-T4 interaction. 

KM AIDS? T4 cell receptor? icmunonodulation. 

OS Synthetic. 

PN W08703601-A. 

PD 18-JUN-1987. 

PF 08-DEC-1986? 402717. 

PR 06-DEC-1985? FR-018155. 

PA UNSP ) INST PASTEUR. 

PA (AUFR/) AUFRAY C. 

PI Auffray C* Montagnier Lf Klatznann Dr Charron D? 

DR WPI; 87-177935/25. 

PT New peptide derivs. contg. specified exposed tetraipeptide 

PT sequences - inhibiting interaction of AIDS virus with T4 cell 

PT receptors 

PS Clain 18? Page 50? 57pp; French. 

CC The peptide corresponds to the conserved sequence immediately 

CC after the RFDS peptide flotif of the Type II HLA antigen of ARV2. 

CC It is used to produce monoclonal antibodies specific to the peptide. 

CC See also P71422-P71435 and P71437. 

SO Sequence 9 AA? 

SQ 2 A? 1 R? 0 N? 0 D? 0 B? 0 C? 0 Q? 0 E; 0 Z? 0 G? 2 H; 

SQ 0 I? 1 L? 1 K? 0 N? 1 F? 0 P? 0 S? 0 T? o w; 0 Y? 1 V? 

Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity = 80X Matches = 4 Misnatches - 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

till 

KLAFHHVAR 
X X 



3. US-08-249-182-1 (1-5) 

P71435 Innunonodulator peptide #11 inhibits HIV-T4 inters 



ID P71435 standard? Protein; 9 AA. 

AC P71435; 

DT 03-HAY-1991 (first entry) 

DE Inmjnonodulator peptide #11 inhibits HIV-T4 interaction. 

KW AIDS; T4 cell receptor; innunonodulation. 

OS Synthetic. 

PN H08703601-A. 

PD 18-JUN-1987. 

PF 08-DEC-1986; 402717. 

PR 06-DEC-1985? FR-018155. 

PA (INSP ) INST PASTEUR. 

PA (AUFR/) AUFRAY C. 

PI Auffray Ct Hontagnier Li Klatznann Di Charron D; 

DR WPI; 87-177935/25. 

PT New peptide derivs, contg. specified exposed tetra?peptide 

PT sequences - inhibiting interaction of AIDS virus with T4 cell 

PT receptors 

PS Claim 18; Page 50; 57pp; French. 

CC The peptide corresponds to the conserved sequence ianediately 

CC after the RFDS peptide notif of the Type II HLA antigens of 

CC LV and LAVla. It is used to produce nonoclonal antibodies specific 

CC to the peptide, 

CC See also P71422-P71434 and P71436-7. 

SQ Sequence 9 AA; 

SQ 2 a; 2 R; 0 N; 0 D; 0 B; 0 C; 0 9; 0 E; 0 Z; 0 G; 2 H; 

SQ o I; l l; 0 k; o h; l F; o p; o s; 0 t; o H; o y; l v; 

Initial Score = 4 Optinized Score = 4 Significance = 3.92 
Residue Identity = 80X Matches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

Nil 
RLAFHHVAR 
X X 



4. US-08-249-182-1 (1-5) 

P71423 InRunonodulator peptide #2 inhibits HIV-T4 interac 

ID P71423 standard; Protein; 21 AA. 

AC P71423; 

DT 03-MAY-1991 (first entry) 

DE Innunowodulator peptide #2 inhibits HIV-T4 interaction. 

KW AIDS; T4 cell receptor; inpiunoaodulation. 

OS Synthetic. 

PN W08703601-A. 

PD 18-JUN-1987. 

PF 08-DEC-1986J 402717. 

PR 06-DEC-1985; FR-018155. 

PA (INSP ) INST PASTEUR. 

PA (AUFR/) AUFRAY C. 

PI Auffray C> Hontagnier L» Klatznann D* Charron D; 

DR WPI; 87-177935/25. 

PT New peptide derivs. contg. specified exposed tetra:peptide 

PT sequences - inhibiting interaction of AIDS virus with T4 cell 

PT receptors 

PS Claim 5; Page 48; 57pp; French. 

CC The peptide is a specific example of a peptide comprising the 

CC tetrapeptide notif RFDS (pref . at position 7 to 10 and optionally 



navmg Kt at positions 1 ano c ano/or tL at positions *u ano £U. 

CC It interferes with interaction between the AIDS virus and T4 

CC receptors on lymphocytes. The peptide also has innunonodulatory 

CC activity. It is useful in diagnosis to detect antibodies to the 

CC region of the viral genone containing the RFDS sequence. 

CC See also P71422 and P71424-P71437. 

SQ Sequence 21 AA; 

sa 2 A; 3 R; 0 N; l D; 0 B; 0 C; 0 8; 3 E; 0 Z; 0 G; 2 H; 

SQ 0 I » 3 L; 1 K? 0 H; 2 Fi 0 PJ 1 5; 0 T; 1 H? 0 Y; 2 V; 



Initial Score 5 4 
Residue Identity = 80% 
Gaps = 0 



Optimized Score ~ 4 Significance = 3.92 
Matches = 4 Hi snatches = 1 

Conservative Substitutions = 0 



X X 
WHVAR 
1 1 1 1 

REVLEWRFDSKLAFHHVAREL 
10 X 20 



5. US-08-249-182-1 (1-5) 

P71422 Ifwunopiodulator peptide #1 inhibits HIV-T4 interac 

ID P71422 standard; peptide; 21 AA. 

AC P71422; 

DT 03-NAY-1991 (first entry) 

DE ImflunoRodulator peptide #1 inhibits HIV-T4 interaction. 

KH AIDS; T4 cell receptor; icmunoRodulation. 

OS Synthetic. 

PN U08703601-A. 

PD 18-JUN-1987. 

PF 08-DEC-1986; 402717. 

PR 06-DEC-19S5; FR-018155. 

PA tINSP ) INST PASTEUR. 

PA ( AUFR/) AUFRAY C. 

PI Auffray d llontagnier Li Klatznann Dr Charron D; 

DR HPI; 87-177935/25. 

PT Neu peptide derivs. contg. specified exposed tetraJpeptide 

PT sequences - inhibiting interaction of AIDS virus with T4 cell 

PT receptors 

PS Clain 5; Page 48; 57pp; French. 

CC The peptide is a specific example of a peptide comprising the 

CC tetrapeptide notif RFDS (pref. at position 7 to 10 and optionally 

CC having RE at positions 1 and 2 and/or EL at positions 20 and 21). 

CC It interferes yith interaction between the AIDS virus and T4 

CC receptors on lymphocytes, The peptide also has ipmunonodulatory 

CC activity. It is useful in diagnosis to detect antibodies to the 

CC region of the viral genone containing the RFDS sequence. 

CC See also P71423-P71437. 

SQ Sequence 21 AA; 

SQ 2 A; 4 R; o N; l D; 0 B; o C; o Q; 3 E; 0 Z; 0 G; 2 H; 

SQ o I; 3 L; o K? o M; 2 F; o P; l s; o T; l w; o y; 2 v; 

Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity = SOX Hatches - 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

mi 

REVLEWRFDSRLAFHHVAREL 
10 X 20 



6. US-08-249-182-1 (1-5) 



ID P71493 standard; peptide? 32 AA. 

AC P71493; 

DT Ol-MAY-1991 (first entry) 

DE Antigenic peptide cross-reactive with HTLV-III env protein 3 r 0RF. 

KW Hunan T-cell Lynphotrophic virus; p27 antigen. 

OS Hunan t-cell lynphotrophic virus. 

PN H08702988-A, 

PD 21-MAY-1987. 

PF 05-N0V-1986; U02381. 

PR 07-N0V-1985? US-795997. 

PA (HARD ) HARVARD COLLEGE. 

PI Essex HE, Allan JS, Lee TH; 

DR WPI? 87-150611/21. 

PT Hunan T-cell lynphotrophic virus protein - used for assaying for 

PI antibodies and in assaying for antigenic deterninants for 

PT predicting the course of HTLV-III infection 

PS Clain 7; Page 14? 25pp; English. 

CC The p27 antigen has an deterninant cross-reactive with the protein 

CC encoded by the ORF 3'to the env gene of HTLV-III. It nay be used in 

CC diagnosis and monitoring of the course of the disease. 

SQ Sequence 32 AA; 

SG 2 A; 4 R; l N; 3 D; o B; i c; 0 9; 5 E; 0 Z; 0 G; 3 H; 

SS 0 I? 3 Li l K; 0 h; 2 F? 2 P; 1 S? 0 T; 1 H; 1 v; 2 V? 

Initial Score = 4 Optinized Score = 4 Significance - 3.92 
Residue Identity = 802 Matches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
HHVAR 
111! 

DDPEREVLEHRDSRLAFHHVARELHPEYFKNC 
10 20 X 30 



7. US-08-249-182-1 (1-5) 

R42697 pl6 of nef peptide of HIV-1. 

ID R42697 standard; protein; 35 AA. 

AC R42697; 

DT 10-N0V-1993 (first entry) 

DE pl6 of nef peptide of HIV-1. 

KU AIDS; antibody; p25; gpllO; gp4l; assay; detection; 

KW innunity; vaccine. 

OS Hunan innunodef iciency virus-1. 

FH Key Location/Qualifiers 

FT Modified.site 35 

FT /note* a Cys(acetanidonethyl) 0 

PN US5221610-A. 

PD 22-JUN-1993. 

PF 26-MAY-1988? 199143. 

PR 26-NAY-1988; US-199143. 

PR 04-SEP-1991; US-754300. 

PA (INRH ) INST NAT SANTE & RECH HEDICALE. 

PA UNSP ) INST PASTEUR. 

PI Bahraoui EM» Chanaret Sr Ferris St Granier C» Hontagnier LJ 

PI Rietschoten JV> Rochat H» Sabatier Jfi; 

DR WPI; 93-213434/26. 

PT Diagnosis of HIV infection - by detecting HIV antibodies using 

PT antigenic polypeptide derived fron nef protein of HIV-1 

PS Disclosure; Page 3; 15pp; English. 

CC The peptide is expressed in vivo in HIV infected patients before 

CC detectable ants, of p25* gpllO and gp41 are expressed. Thus, it 

CC can be used in assays for early detection of HIV. 



it tan aibo t<e useo raise dritiipoeitrs Tur use in oetectioru 

CC to induce cellular immunity or to raise neutralising antibodies 

CC that either inactivate the AIDS virus or reduce the viability of 

CC the virus in vivo or destroy infected cells. 

CC The peptide nay be used in viral vaccines. 

SQ Sequence 35 AA; 

SB 2 a; 4 R; i n; 3 d; 0 e; l c; o G; 5 E; o z; l G; 3 H; 

sq o I? 3 l; l k; i m; 3 f; 2 p; i s; o tj l w; i y; 2 v; 



Initial Score = 4 
Residue Identity - 80% 
Gaps = 0 



Optimized Score = 4 Significance = 3.92 
Hatches = 4 Mismatches = 1 

Conservative Substitutions = 0 



X X 
WHVAR 

mi 

GMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 
10 20 X 30 



8. US-08-249-182-1 (1-5) 

P90958 Peptide corresp. to AA171-205 of HIV-1 protein F f 

ID P90958 standard; peptide; 35 AA. 

AC P90958; 

DT 23-FEB-1990 (first entry) 

DE Peptide corresp. to AA171-205 of HIV-1 protein F fragment 

KW HIV 

OS 

PN W08909227-A. 

PD 05-0CT-1989. 

PF 31-MAR-1989? F00151. 

PR 01 -APR- 1988? FR-004405. 

PA (INSP) Inst Pasteur. 

PI Hontagnier Lj Rochat H f Bahraoui EM» Chanaret 5; 

DR UPI; 89-309503/42. 

PT New human imnuno-def iciency virus peptide(s) - corresp. to protein F 

PT fragments* useful for acquired immunodef iciency syndrome diagnosis 

PS Claim 3; page 43; 57pp; French. 

CC This can be used as an immunoassay reagent for AIDS diagnosisr in 

CC treating AIDS and in the prodn. of other peptides and vaccines. They are 

CC prepd. by conventional synthesis. 

SQ Sequence 35 AA; 

SQ 2 A; 4 R; 1 N; 3 D: o B; l c; 0 Q; SE; 0 Z; l G; 3 H; 

SQ 0 I; 3 L; l K; l M; 3 F; 2 P? l S; 0 T; 1 w; l y; 2 v; 

Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity = 80X Matches = 4 Mismatches - 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

mi 

GMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 
10 20 X 30 



9. US-08-249-182-1 (1-5) 

P92270 Peptide PF16 from HIV-1 protein F fragment 

ID P92270 standard; peptide; 37 AA. 

AC P92270; 

DT 23-FEB-1990 (first entry) 

DE Peptide PF16 from HIV-1 protein F fragment 

KW HIV 

OS 



rn Aey LocaLion/uua ut iers 

FT Misc-difference 1 

FT /note= a 0H, NH2 or 1-5 AA" 

FT Wise-difference 37 

FT /note= u QH, NH2 or 1-5 AA" 

PN H08909227-A. 

PD 05-QCT-1989. 

PF 31-MAR-1989; F00151. 

PR 01-APR-198B; FR-004405. 

PA (INSP) Inst Pasteur. 

PI Montagnier Lr Rochat Hi Bahraoui EMi Chanaret S; 

DR WPI; 89-309503/42. 

PT New hunan innuno-def iciency virus peptide(s) - corresp. to protein F 

PT fragments* useful for acquired immunodeficiency syndrone diagnosis 

PS Clain 4? page 43; 57pp; French. 

CC This can be used as an immunoassay reagent for AIDS diagnosis? in 

CC treating AIDS and in the prodn. of other peptides and vaccines. It 

CC prepd. by conventional synthesis. 

SQ Sequence 37 AA; 

S9 2 A; 4 R; i N; 3 D? 0 B; 1 c; o 8; 5 E; l z; 1 G; 3 H; 

SQ 0 I; 3 l; l k; l h; 3F; 2 P; l s; o T; l u; l Y; 2 v; 

SO 1 Others; 

Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity = 807. Hatches = 4 Mismatches ~ 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
UHVAR 
Mil 

XGHDDPEREVLEWRFDSRLAFHHVARELHPEYFKNCZ 
10 20 X X 30 



10. US-08-249-182-1 (1-5) 

R42698 p!7 of nef peptide of HIV-1. 

ID R42698 standard; protein; 65 AA. 

AC R42698; 

DT 10-NQV-1993 (first entry) 

DE p!7 of nef peptide of HIV-1. 

KW AIDS; antibody; p25; gpllO; gp41; assay; detection; 

KW inBunity* vaccine. 

OS Hunan immunodeficiency virus-1. 

FH Key Location/Qualifiers 

FT Modified.site 1 

FT /note= tt Cys(acetaflidonethyl) 0 

FT Hodified.site 65 

FT /note* u Cys(acetanido*ethyl) 0 

PN US5221610-A. 

PD 22-JUN-1993. 

PF 26-I1AY-1988; 199143. 

PR 26-MAY-1988; US-199143. 

PR 04-SEP-1991; US-754300. 

PA (INRH ) INST NAT SANTE & RECH MEDICALE. 

PA (INSP ) INST PASTEUR. 

PI Bahraoui EMi Chanaret Si Ferris Si Granier Ci Hontagnier L; 

PI Rietschoten JVi Rochat Hi Sabatier JH; 

DR WPI; 93-213434/26. 

PT Diagnosis of HIV infection - by detecting HIV antibodies using 

PT antigenic polypeptide derived fron nef protein of HIV-1 

PS Disclosure; Page 3; 15pp; English. 

CC The peptide is expressed in vivo in HIV infected patients before 

CC detectable ants, of p25i gpllO and gp41 are expressed. Thus? it 

CC can be used in assays for early detection of HIV. 

CC It can also be used to raise antibodies for use in detection* 



CC thai either inactivate the AIDS virus or reduce the viability of 

CC the virus in vivo or destroy infected cells. 

CC The peptide nay be used in viral vaccines. 

SQ Sequence 65 AA; 

SQ 3 a; 4 R; 3 N; 4 D; 0 e; 2 C; 0 Q; 9 E; 0 z; 2 G; 5 H; 

sa o I; 7 l; 4 k; l H; 3 F; 5 P; 3 S; l T; l w; 2 y; 6 v; 



Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity = 807. Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions - 0 

X X 
WHVAR 
Mil 

CYKIVPVEPDKVEEANKGENTSLLHPVSIHGHDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 
10 20 30 40 50 X 60 



11. US-08-249-182-1 (1-5) 

P92271 Peptide PF17 fro* HIV-1 protein F fragment 



ID P92271 standard; peptide! 67 AA. 

AC P92271; 

DT 23-FEB-1990 (first entry) 

DE Peptide PF17 fron HIV-1 protein F fragment 

KU HIV 

OS 

FH Key Location/Qualifiers 

FT Misc-difference 1 

FT /note= D 0H, NH2 or 1-5 AA" 

FT Misc-difference 67 

FT /note="0H. NH2 or 1-5 AA" 

PN W08909227-A. 

PD 05-0CT-1989. 

PF 31-MAR-1989; F00151. 

PR 01-APR-1988. FR-004405. 

PA (INSP) Inst Pasteur. 

PI Hontagnier L, Rochat Hr Bahraoui Ell. Chanaret S; 

DR WPI; 89-309503/42. 

PT Neu hunan innuno-deficiency virus peptide(s) - corresp. to protein F 

PT fragments i useful for acquired immunodeficiency syndrome diagnosis 

PS Claim 4? page 43; 57ppf French. 

CC This can be used as an immunoassay reagent for AIDS diagnosisi in 

CC treating AIDS and in the prodn. of other peptides and vaccines. It i 

CC prepd. by conventional synthesis. 

SQ Sequence 67 AA; 

S3 3 A; 4 R; 3 N; 4 D; OB; 2 c; 0 Q; 9 E; l z; 2 G; 5 H; 

S8 0 I; 7 u 4 K; l «; 3 F; 5 P; 3 S; IT; l y; 2 Y; 6 V; 

SO 1 Others; 



Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity = 80% Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

mi 

CYKLVPVEPDKVEEANKGENTSLLHPVSLHGHDDPEREVLEURFDSRLAFHHVARELHPEYFKNCZ 
10 20 30 40 50 X X 60 



12. US-08-249-182-1 (1-5) 

Rl 1 188 Wheat 33kD protein pre-sequence and N-terminal I in 



ID Rl 1 188 standard; Protein; 105 AA. 



DT 22-MAY-1991 (first entry) 

OE Wheat 33k D protein pre-sequence and N-terninal linker. 

KU uheat; 33k Da protein; photosynthetic oxygen-evolving conplex; 

KU thylakoid lunen. 

OS Triticun aestivun. 

FH Key Location/Qualifiers 

FT Peptide 1..79 

FT /label= 33kDa protein pre-sequence 

FT /note= "targets proteins to thylakoid lunen" 

PN W09102800-A. 

PD 07-MAR-1991. 

PF 14-AUG-1990; C01281. 

PR 14-AUG-1989; GB-018496. 

PA (AGRI-) AGRIC GENETICS CO LTD. 

PA (ADTE-) ADVANCED TECHNOLOGIES (CAMBRIDGE) LTD. 

PA (BI0T-) BIOTAL LTD. 

PA (BRPE ) BP NUTRITION LTD. 

PA (CIBA ) C1BA-GEIGY PLC. 

PA (ICIL ) IMPERIAL CHEMICAL INDUSTRIES PLC. 

PA (RHON ) RHONE-POULENC LTD. 

PA (SCHD ) SCHERING AGROCHEHICALS LTD. 

PA (SHEL ) SHELL RESEARCH LTD. 

PA (UNIL ) UNILEVER UK. 

PI Robinson C. 

DR UP If 91-087283/12. 

DR Q-PSDB; 011022. 

PT Novel chineric gene used for transforming plant cell - useful for 

PT delivering heterologous passenger protein into thylakoid lunen of 

PT chloroplast. 

PS Clain 5; Page 17; 19pp; English. 

CC The sequence conprises the 79 anino acid pre-sequence of the 33kDa 

CC protein of the photosynthetic oxygen-evolving conplex of uheat. 

CC directly linked to the first 26 N-terninal anino acids of the nature 

CC uheat 33k Da protein. The latter constitutes an optional linker 

CC betueen the pre-sequence and an heterologus protein sequence. 

CC Vectors containing a chinaeric gene of the invention (i.e. the pre- 

CC sequence* optional linker and heterologous protein-encoding 

CC sequences) can be used to transforn a plant cell for production of 

CC the heterologous protein which is targetted to the thylakoid lunen 

CC of the chloroplast. 

S8 Sequence 105 AA> 

sq 24 a; 6 R; o n; 4 d; o B; 2 c; 3 o; 4 e; o z; 9 g; l H; 

SQ 4 I; 7 L; 6 K; 4 H; 3 F; 3 P; 11 S; 8 T; o u; l y; 5 V; 



Initial Score = 4 Optinized Score = 4 Significance = 3.92 
Residue Identity = 80% Hatches = 4 Nisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
llll 

MAASLQAAATLHPAKIGGRASSARPSSHVARAFGVDAGARITCSLfiSDIREVASKCADAAKMAGFALATSAL 
10 20 X 30 40 50 60 70 



LVSGATAEG 
80 



13. US-08-249-182-1 (1-5) 

R38893 Nef protein of HIV-1. 

ID R38893 standard; Protein; 206 AA. 

AC R38893; 

DT 10-N0V-1993 (first entry) 

DE Nef protein of HIV-1. 



r\w Hiuor dnuiDuuyj pcjT gpiiui gptif abbayi utftei. tiun j 

KM immunity; vaccine. 

OS Huaan inmunodef iciency virus-l. 

PN US5221610-A. 

PD 22-JUN-1993. 

PF 26-MAY-1988? 199143, 

PR 26-MAY-1988; US-199143. 

PR 04-SEP-1991; US-754300. 

PA UNRM ) INST NAT SANTE & RECH MEDICALE. 

PA (INSP ) INST PASTEUR. 

PI Bahraoui EHi Chanaret S» Ferris Si Granier C» Rontagnier L; 

PI Rietschoten JV, Rochat H, Sabatier JM; 

DR WPI; 93-213434/26. 

PT Diagnosis of HIV infection - by detecting HIV antibodies using 

PT antigenic polypeptide derived fron nef protein of HIV-1 

PS Disclosure; Fig 2; 15pp; English. 

CC The nef protein comprises peptides ahich are expressed in vivo in HIV 

CC infected patients before detectable ants, of p25> gpllO and gp41 are 

CC expressed. Thus* they can be used in assays for early detection of HIV. 

CC They can also be used to raise antibodies for use in detection 

CC to induce cellular ifminity or to raise neutralising antibodies 

CC that either inactivate the AIDS virus or reduce the viability of 

CC the virus in vivo or destroy infected cells. 

CC The peptides nay be used in viral vaccines. 

SO Sequence 206 AA; 

so 17 A; 13 r; 6 n; 10 D; o B; 3 c; 6 s; 19 E; o 2; 16 G; 9 h? 

so 4 I; 17 l; 10 k; 4 m; 7 f; 15 p; n s; 10 T? 7 w; i y; 15 v; 



Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity - SOX Matches = 4 Mismatches - 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
HHVAR 
Mil 

CYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEHRFDSRLAFHHVARELHPEYFKNC 
150 160 170 180 190 X X 200 



14. US-08-249-182-1 (1-5) 

P61515 Sequence of E f protein. 

ID P61515 standard; Protein; 206 AA. 

AC P61515; 

DT 08-JUN-1991 (first entry) 

DE Sequence of E' protein. 

KW HIV; LAV; AIDS; diagnosis; vaccine. 

OS HTLV-IIIB/H9 cells (ATCC CRL 8543). 

PN EP-187041-A. 

PD 09-JUL-1986. 

PF 23-DEC-1985; 309454. 

PR 24-DEC-1984; US-685272. 

PR 04-DEC-1985? US-805069. 

PA (GETH ) GENENTECH INC. 

PI Capon DJf Lasky LA; 

DR UPI; 86-177602/28. 

DR N-PSDB; N60288 

PT Acquired innune deficiency syndrone polypeptide (s) - obtd. by 

PT nolecular cloning etc. and used for diagnosis and in vaccines 

PT against virus disease 

PS Example; fig 2; 125pp; English. 

CC A comparison of N60287 uith the cDNA of the HTLV-III genone 

CC revealed one particular clone* designated p7.11 yhich contained a 

CC DNA sequence encoding this peptide (P60308) sequence* This approx. 

CC 2.2 kilobase covers the precursor gag region and encodes i 5 f to 3'i 

CC p-12r p-15i p-24 a second p-15 protein? and approx. 300 extra base 



SO Sequence 206 AA; 

SQ 18 A; 13 R; 7 N; io D; 0 B; 3 c; 6 G; 18 E; o Z; 16 G; 9 H; 
S9 6 I r 17 l; ii k? 4 n; 7 F; 15 P; io s? 9 tj 7 u; 7 y; 13 v; 



Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity = 807. Hatches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

UN 

CYKLVPVEPDKVEEANKGENTSLLHPVTLHGHDDPEREVLEHRFDSRLAFHHVARELHPEYFKNC 
150 160 170 180 190 X X 200 



15. US-08-249-182-1 (1-5) 

P60423 Sequence of LAV virus ORF F protein. 

ID P60423 standard? Protein; 216 AA. 

AC P60423; 

D7 20-AUG-1991 (first entry) 

DE Sequence of LAV virus ORF F protein. 

KW AIDS vaccine; diagnosis? immunoassay; HIV; HTLV-III. 

OS Lynphadenopathy virus. 

PN H08602383-A. 

PD 24-APR-1986. 

PF 18-0CT-1985; E00548. 

PR 18-QCT-1984; FR-016013. 

PR 16-NQV-1984; GB-029099. 

PR 21-JAN-1985; GB-001473. 

PA (CNRS ) CNRS CENT NAT RECH SCI. 

PA (INSP ) INST PASTEUR. 

PI Montagnier it Krust B, Chanaret Si Clavel Fr Chernann J-d 

PI Barre-Sinoussi Ft Ali2on Sonigo Pt Stewart Ct Danos Of 

PI Wain-Hobson S. 

DR WPI; 86-119166/18. 

DR N-PSDB; N60365. 

PT Purified glycoprotein and peptide (s) - are recognised by sera contg. 

PT antibodies against lymphadenopathy virus and useful in detecting 

PT AIDS antibodies or in vaccines 

PS Disclosure; Fig 4; 75pp; English. 

CC The inventors claim a polypeptide uhich is recognised by sera of 

CC human origin contg. antibodies against the virus of 

CC lynphadenopathies (LAV) or acquired immune deficiency syndrome 

CC (AIDS). Also c lamed are various peptides corresp. to the AA 

CC sequences deducible from proteins encoded by LAV DNAr defined by 

CC specific residues (e.g. 12-32* 37-46? 49-79* 88-153) in accordance 

CC uith a formula given in the specification. 

SQ Sequence 216 AA; 

SQ 18 A; 12 R; 6 n; io D; 0 B; 4 c; 7 Q; is E; o z; 17 G; 10 H; 

S8 4 I; 17 l; 12 k; 4 h; 8 f; 16 P; 11 s; n T; 8 »; 8 Y; 15 V; 

Initial Score = 4 Optimized Score = 4 Significance = 3.92 
Residue Identity = 80X Hatches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

mi 

CYKLVPVEPDKVEEANKGENTSLLHPVSLHGHDDPEREVLEWRFDSRLAFHHVARELHP8YFKNC 
160 170 180 190 200 X X 210 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 



rasiuu - rasi, rairuise tonparison ot sequences 
Release 5.4 



Results file u249_lp.res nade by on Thu 22 Sep 94 10:46:43-PDT. 



Query sequence being copipared:US-08-249-182-l (1-5) 
Number of sequences searched: 70848 
Nunber of scores above cutoff: 3849 

Results of the initial conparison of US-08-249-182-1 (1-5) with: 
Data bank : PIR 41 , all entries 

100000- 

N 

U50000- * 

W 

B 

E 

R * 

0 

F 10000- 

S « 

E 5000- 

G 

U 

E 

N 

C 

E 

S 1000- 



500- 



100- 



50- 



10- 



5- 



0 

II I III I 
SCORE 0 | 0 1 | 1 2 



aiucv u 



PARAMETERS 



Similarity natrix 


Unitary 


K-tuple 


2 


Mismatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Uindou size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Random z at ion group 


0 






Initial scores to save 40 


Alignnents to save 


15 


Optimized scores to 


save 0 


Display context 


50 



SEARCH STATISTICS 



Scores; 



(lean Median Standard Deviation 

1 3 0.98 



Tines: 



CPU 
00:01:22.94 



Total Elapsed 
00:01:32.00 



Nunber of residues: 20816057 
Nunber of sequences searched: 70848 
Nunber of scores above cutoff: 3849 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence uas not found. 



The list of best scores is: 



Sequence Name Description 



Init. Opt. 
Length Score Score Sig. Frane 







**t* 3 standard deviations above nean **#* 






1. 


512146 


hypothetical protein E16 - ph 


26 4 ' 


» 3.05 


0 


2. 


JQ1620 


hypothetical 4.3K protein - h 


34 4 i 


» 3.05 


0 


3. 


PN0630 


Acid phosphatase (EC 3.1.3.2) 


38 4 - 


» 3.05 


0 


4. 


B442SS 


macrophage nannose receptor r 


71 4 i 


1 3.05 


0 


5. 


A42329 


autotaxin - hunan (fragments) 


114 4 i 


» 3.05 


0 


6. 


B45065 


putative helicase - Escherich 


130 4 * 


} 3.05 


0 


7. 


B49205 


virulence-associated=vapC - D 


135 4 ' 


1 3.05 


0 


8. 


A40663 


acid phosphatase (EC 3.1.3.2) 


146 4 • 


» 3.05 


0 


9. 


S33783 


henoglobin V - Tokunagayusuri 


152 4 ' 


I 3.05 


0 


10. 


S38404 


hypothetical protein - yeast 


154 4 i 


1 3.05 


0 


11. 


A38148 


acid phosphatase (EC 3.1.3.2) 


158 4 ' 


» 3.05 


0 


12. 


D25973 


pertussis toxin chain S2 prec 


169 4 i 


1 3.05 


0 


13. 


XXECC3 


chloramphenicol 0-acetyltrans 


203 4 ' 


» 3.05 


0 


14. 


503246 


nef protein (clone HAT3) - hu 


204 4 ' 


» 3.05 


0 


15. 


533986 


nef protein - hunan innunodef 


206 4 ' 


\ 3.05 


0 


16. 


S25937 


nef protein - hunan innunodef 


206 4 ' 


1 3.05 


0 


17. 


518691 


nef protein - hunan innunodef 


206 4 < 


} 3.05 


0 


IB. 


S14609 


nef protein - hunan innunodef 


206 4 ' 


1 3.05 


0 


19. 


503245 


nef protein (clone HXB3) - hu 


206 4 ' 


\ 3.05 


0 


20. 


503244 


nef protein (clone HXB2) - hu 


206 4 ' 


1 3.05 


0 


21. 


ASLJFV 


nef protein - hunan innunodef 


206 4 i 


» 3.05 


0 


22. 


ASLJVL 


nef protein - hunan innunodef 


206 4 > 


1 3.05 


0 


23. 


ASU12 


nef protein - hunan innunodef 


206 4 ' 


\ 3.05 


0 



CT • 

25. 


A AC Dvi 

144001 


irii vronprini ru i u-dLervy i w dns 

nef protein - hunan innunodef 


CIO 

214 


26. 


S26084 


ribosonal protein S3 - Euglen 


218 


27. 


S34524 


ribosonal protein S3 - Euglen 


218 


28. 


S16554 


GTP-binding protein rgpl - ri 


226 


29. 


C25973 


pertussis toxin chain S2 prec 


226 


30. 


WEBR2P 


pertussis toxin chain S2 prec 


226 


31. 


S23143 


acid phosphatase C - hunan 


227 


32. 


S23142 


acid phosphatase A - hunan 


227 


33. 


A38535 


hypothetical protein (MicA re 


229 


34. 


S07475 


early light-induced protein* 


231 


35. 


A37147 


bacteriophage resistance prot 


234 


36. 


S30700 


hypothetical protein o235 - E 


235 


37. 


B37841 


hypothetical protein 235 (dap 


235 


3B. 


B1AGA6 


virBl protein - Agrobacteriun 


239 


39. 


B1AG55 


virBl protein precursor - Agr 


239 


40. 


SI 5685 


kallikrein, glandular - nulti 


250 



J.UJ 

3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 
3.05 



1. US-08-249-182-1 (1-5) 

S12146 hypothetical protein E16 



phage D108 



ENTRY S12146 Stype complete 

TITLE hypothetical protein E16 - phage D108 

ORGANISM ifornal.nane phage 0108 

DATE 21-Nov-l993; #sequence_revision 21-Nov-1993; itext.change 

21-Nov-1993 
ACCESSIONS S12146 
REFERENCE S12145 

iauthors Pa to. II. L.; Banerjee. H.> Uagonner, B.T. 

^journal Nucleic Acids Res. (1990) 18:6458 

ttitle Sequence of gene E15 of bacteriophage D108 and conparison 

uith phage flu. 
Ifcross-references MUID:91057162 
iaccession S12146 

##status prelininary 
##residues 1-26 ##label PAT 
##cross-references EHBL:X54298 
SUHHARY # length 26 tnolecular-ueight 3102 #c heck sun 6658 

SEQUENCE 



Initial Score = 
Residue Identity = 
Gaps = 



4 Optinized Score = 4 
802 Hatches = 4 

0 Conservative Substitutions 



Significance = 
Hi snatches = 



3.05 

1 
0 



X X 
NHVAR 



MSRTSLIKLIHVARRELGLDDDTYRA 
10 X 20 



2. US-08-249-182-1 (1-5) 



JG1620 hypothetical 4.3K protein - human innunodef iciency 

ENTRY JQ1620 ttype complete 

TITLE hypothetical 4.3K protein - hunan innunodef iciency virus type 

1 

ORGANISM #fornal_nane hunan innunodef iciency virus type li HIV-1 

DATE 21-Nov-1993; *sequence_revision 21-Nov-1993; itext.change 

21-Nov-1993 
ACCESSIONS JQ1620 
REFERENCE JG1620 

•authors Snithi J.; Azad> A.; Deacon, N. 

#journal J. Gen. Virol. (1992) 73:1825-1828 

ttitle Identification of tuo novel hunan innunodef iciency virus type 



i spuce acceptor sites in mrecteo i ceiL tines. 
#cross-references MUID :92333269 
^accession JQ162Q 

##status prelininary 
##residues 1-34 Silabel SHI 
SUMMARY tlength 34 inolecular-ueight 4274 Schecksun 4556 

SEQUENCE 

Initial Score = 4 Optimized Score - 4 Significance = 3.05 
Residue Identity = 807. Matches = 4 Mismatches = 1 

Gaps - 0 Conservative Substitutions = 0 

X X 
UHVAR 
till 

MDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 
10 20 X 30 



3. US-08-249-182-1 (1-5) 

PN0630 Acid phosphatase (EC 3.1.3.2) IF* lou-nolecular-ue 



ENTRY 
TITLE 

ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 

iauthors 

#journal 

ititle 

iaccession 
iistatus 
iiresidues 

SUMMARY 

SEQUENCE 



PN0630 itype fragment 

Acid phosphatase (EC 3.1.3.2) 1F» lou-nolecular-ueight - 

Hunan (fragment) 
#fornal_nane Hono sapiens #connon_nane wan 
27-May-1994? #sequence_revision 27-Nay-1994? #text change 

27-May-1994 
PN063Q 
PN0630 

La2arukr K.D.A.; Dissing* J.; Sensabaughi G.F. 

Biochen. Biophys. Res. Connun. (1993) 196:440-446 

Exon structure at the hunan ACPI locus supports alternative 

splicing nodel for f and s isozyne generation. 
PN0630 

prelininary 
1*38 iilabel LAZ 
8 length 38 ichecksun 7022 



Initial Score = 
Residue Identity = 
Gaps = 



4 Optinized Score = 4 Significance = 3.05 
80X Matches = 4 Mismatches = 1 

0 Conservative Substitutions = 0 



X X 
HHVAR 

mi 

WRVDSAATSGYEIGNPPDYRGQSCMKRHGIPNSHVARQ 
10 20 30 X X 



4. US-08-249-182-1 (1-5) 



B44255 nacrophage nannose recepton MRCl=group I hepatic 

ENTRY B44255 #type fragment 

TITLE nacrophage nannose receptor* MRCl=group I hepatic 

glycoprotein receptor honolog (carbohydrate-recognition 
donains 1-8) - hunan (fragment) 

ORGANISM ifornal_nane Hono sapiens #connon_nane nan 

DATE 30-Apr-1993; #sequence_revision 30-Apr-1993; ttext change 

30-Apr-1993 

ACCESSIONS B44255 

REFERENCE A44255 

iauthors Kin> S.J.; Ruiz, N. ? Bezouska* K.J Drickaneri K. 
#journal Genomics (1992) 14:721-727 

ititle Organization of the gene encoding the hunan nacrophage 



•cross-references MUID? 93052405 
•accession B44255 

••status prelieinary 
••residues 1-71 Isabel Kill 
••cross-references NCBIP; 118421 
••note sequence extracted from NCBI backbone 

SUMMARY Slength 71 #checksum 5821 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



4 Optimized Score = 4 Significance = 3.05 
80% Matches = 4 Mismatches = 1 

0 Conservative Substitutions = 0 



X X 
WHVAR 



FEGSESLWNKDPLTSVSYQINKSALLTHH9ARKSCQQ8NAELLSITEIHE9TYLTGLTSSLTSGLMIGLNS 
10 20 30 X 40 50 60 70 



5. US-08-249- 182-1 (1-5) 
A42329 autotaxin 



human (fragments) 



ENTRY A42329 #type fragments 

TITLE autotaxin - human (fragments) 

ORGANISM •formal_name Homo sapiens •common_nane man 

DATE 04-Mar-1993; ^sequence revision Ol-Jan-1993; #text change 

08-Nay-1993 
ACCESSIONS A42329 
REFERENCE A42329 

•authors Stracke* M.L.; Krutzsch* H.C.; Unsuorth, E.J.I Arestad, A.; 
Ciocer V.J Schiffmanni E.J Liotta» L.A. 

•journal J. Biol. Chem. (1992) 267:2524-2529 

•title Identification! purification* and partial sequence analysis 

of autotaxin* a novel motility-stimulating protein, 
•cross-references MUID:92129337 
•accession A42329 

••status prel iminary 

••moleeule_type protein 

••residues" 1-114 ••label STR 

••cross-references NCBIP:78526; NCBIP:78523; NCBIP:7852l! NCBIP:78518; 

NCBIP:78515? NCBIP?78512; NCBIP:7851l; NCBIP:78510? 
NCBIP;78509; NCBIP:78508; NCBIP:78503 
••note sequence extracted from NCBI backbone 

SUMMARY #length 114 tchecksum 7335 

SEQUENCE 



Initial Score = 4 
Residue Identity - 80% 
Gaps = 0 



Optimized Score = 4 Significance = 3.05 
Matches = 4 Mismatches = 1 

Conservative Substitutions * 0 



X X 
WHVAR 

III! 

TATKSPPFENINLYYDVPWNETIPEEVTXPNYLfiAEVSYPAFKPXLDVYKHHVAAN 
60 70 80 90 100 110 X 



6. US-08-249-182-1 (1-5) 

B45065 putative helicase - Escherichia coli (fragment) 

ENTRY B45065 tiype fragment 

TITLE putative helicase - Escherichia coli (fragment) 

ORGANISM #formal_name Escherichia coli 

DATE 10-Jun-1993; •&equence_revision 10-Jun-1993; #text_change 



ACCESSIONS B45065 
REFERENCE A45065 

#authors Huang, S.J Deutscher* M.P. 

#journal J. Biol. Chew. (1992) 267:25609-25613 

#li tie Sequence and transcriptional analysis of the Escherichia coli 

rnt gene encoding RNase T. 
#cross-references MUID: 93094287 
icontents K12 strain UT481 
taccession B45065 

##status prelininary 
##nolecule_type nucleic acid 
##residues~ 1-130 ##label HUA 
iHcross-references NCBIP: 120478 
tftnote sequence extracted fron NCBI backbone 

SUMMARY #length 130 #checksun 1961 

SEQUENCE 

Initial Score = 4 Optimized Score = 4 Significance = 3.05 
Residue Identity = 80X Matches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
III) 

MADNPDPSSLLPDVFSPATRDWFLRAFKQPTAVSP8TWHVAARSEHALVIAPTGSGKTLAAFLYALDRLFRE 
10 20 30 40 X 50 60 70 

GGEDTREAHKRKTSRILYIS 
80 90 



7. US-08-249-182-1 (1-5) 

B49205 virulence-associated=vapC 



Dichelobacter nodosus 



ENTRY B49205 Hype complete 

TITLE virulence-associated=vapC - Dichelobacter nodosus 

ORGANISM #fornal_nafne Dichelobacter nodosus 

DATE 19-Dec-1993; #sequence_revision 19-Dec-1993; #text change 

19-Dec-1993 
ACCESSIONS B49205 
REFERENCE A49205 

#authors Katzi M.E.; Strugnell* R.A.; Rood* J.I. 

#journal Infect. Innun. (1992) 60:4586-4592 

ititle Molecular characterization of a genonic region associated 

uith virulence in Dichelobacter nodosus. 
#cross-references MUIDJ93014173 
laccession B49205 

##status prel ininary 

i#ciolecule_type nucleic acid 
##residues" 1-135 ##label KAT 
ttcross-references NCBINU16426J NCBIPU16429 
#inote sequence extracted fron NCBI backbone 

SUMMARY ^length 135 tnolecular-ueight 15128 #checksun 5461 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



4 Optimized Score = 4 Significance = 3.05 
80X Matches = 4 Mi snatches = 1 

0 Conservative Substitutions = 0 



X X 
WHVAR 
II II 

RYEIGDIGISNITACELAFGVEKSGSAKNKTALTKFLAPLSILPFDKQAIWHYARIR9SLQNRGTPIGALDM 
30 40 50 60 70 80 X 90 100 



110 120 130 



US-08-249-182-1 (1-5) 

A40663 acid phosphatase (EC 3.1.3.2) alpha, adipocyte - h 



adipocyte - hunan 



#text_change 



Bernlohr* D.A, 



ENTRY A40663 #type fragment 

TITLE acid phosphatase (EC 3.1.3.2) alpha 

( fragment) 

ORGANISM #fornal jiane Hono sapiens Sconnonjiane nan 

DATE 16-Feb-1994; #sequence_revision 16-Feb-1994 

16-Feb-1994 
ACCESSIONS A40663 
REFERENCE A40663 

#authors Shekels, L.L.; Smith, A.J.; Van Etten, R.L. 

#journal Protein Sci. (1992) 1:710-721 

#title Identification of the adipocyte acid phosphatase as a 

PAQ-sensitive tyrosyl phosphatase. 
#cross-references MUID:93284125 
tcontents adipocytes 
♦accession A40663 

itstatus prelininary 
#ipiolecule_type nucleic acid 
##residues" 1-146 ##label SHE 
##cross-references NCBIN: 134183; NCBIP: 134185 
#tnote sequence extracted fron NCBI backbone 

KEYWORDS phosphoric nonoester hydrolase 

SUMMARY # length 146 tchecksua 1878 

SEQUENCE 



Initial Score = 
Residue Identity = 
Gaps = 



4 Optimized Score = 4 Significance = 3.05 
80X hatches = 4 Mi snatches = 1 

0 Conservative Substitutions = 0 



X X 
HHVAR 

illl 

IAEAVFRKLVTDQNISENWRVDSAATSGYEIGNPPDYRGQSCMKRHGIPMSHVARQITKEDFATFDYILCMD 
10 20 30 40 50 60 X 70 80 



ESNLRDLNRKSNRVKTCKAKIELLGSYDP9KQL 
90 100 110 



9. US-08-249-182-1 (1-5) 

S33783 hemoglobin V - Tokunagayusurika akanusi 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 

iauthors 

ijournal 

ititle 

♦accession 
iistatus 
♦♦residues 

SUMMARY 

SEQUENCE 



S33783 ♦type complete 

hemoglobin V - Tokunagayusurika akanusi 

#fornal_nafle Tokunagayusurika akanusi 

02-Dec-1993» ♦sequence_revision 02-Dec-1993? #text change 

02-Dec-1993 
S33783 
S33783 

Fukuda, M.; Takagi, T.J Shikana, K. 

BiochiR. Biophys. Acta (1993) 1157:185-191 

Polymorphic hemoglobin from a nidge larva (Tokunagayusurika 

akafiusi) can be divided into tuo different types. 
S33783 

prelininary 
1-152 ♦♦label FUK 
♦length 152 ♦nolecular-ueight 17197 ♦checksum 8528 



Initial Score 



4 Optimized Score 



4 Significance = 3.05 



I 
0 



X X 
WHVAR 

mi 

ETLKGHPLDEVKDTANFKL1AGRIFTIFDNCVKNVGNDKGFQKVIADHSGPHVARP1THGSYNDLRGVIYDS 
50 60 70 80 90 100 110 

MHLDSTHGAAWNKNHDNFFYVFYECLDGRCS8F 
120 130 140 ISO 



10. US-08-249-182-1 (1-5) 

S38404 hypothetical protein - yeast (Saccharonyces cerevi 

ENTRY S38404 #type conplete 

TITLE hypothetical protein - yeast (Saccharonyces cerevisiae) 

ORGANISM #fornal_nane Saccharonyces cerevisiae 

DATE 17-Nar-1994; #sequence_revision 17-Har-1994? ftext_change 

17-Har-1994 
ACCESSIONS S38404 
REFERENCE S37786 

•authors Vandenboli M.» Bollei P.; Dion. C.f Portetellet D.i Hilger. 

F. 

fsubnission subnitted to the EHBL Data Library* September 1993 
laccession S38404 

##status prelininary 
ftresidues 1-154 ISlabel VAN 
Hcross-references EHBL:Z26878 
SUMMARY # length 154 #nolecular-ueight 17967 tchecksun 6460 

SEQUENCE 

Initial Score = 4 Optimized Score = 4 Significance = 3.05 
Residue Identity = 80% Hatches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

mi 

A I 8RNDDGGSKENSYGTTI GF8DRNRY YFPRI fiGHDDPCHR I VHYLL I PRI HVARFFHFEKTLSLVGHH8Y I 
40 50 60 70 80 90 X 100 110 

LAALIARAECTCLGHNTRLLISFWLIS8LGTFS 
120 130 140 



11. US-08-249-182-1 (1-5) 

A38148 acid phosphatase (EC 3.1.3.2) A - hunan 



ENTRY 
TITLE 

ALTERNATE.NAMES 

ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
fauthors 

Ijournal 
ititle 



A38148 itype complete 

acid phosphatase (EC 3.1.3.2) A - hunan 

acid phosphatase B-f; HCPTP-A; phosphotyrosyl protein 

phosphatase A 
Sfornaljtane Hono sapiens 8connon_nane nan 
31-Dec-1993 #sequence_revision 31-Dec-1993 ttext change 

31-Dec-1993 
A38148; A39491 
A38148 

Uoi Y.Y.P.; McCornack. A.L.; Shabanouitz. J.; Hunt. D.F.; 

Davisi J. P.> Mitchell. G.L.; Van Etten, R.L. 
J. Biol. Chen. (1992) 267:10856-10865 
Sequencing, cloning, and expression of hunan red cell-type 

acid phosphatase, a cytoplasnic phosphotyrosyl protein 

phosphatase. 



tcross-references MUID:92268143 



•accession A38148 
f#piolecule_type nRNA 
••residues" 1-153 ##label W01 
••cross-references NCBIPU03842 
##note sequence extracted fron NCBI backbone 

REFERENCE A39491 

•authors Dissingr J.; Johnserw A.H.; Sensabaughi G.F, 
•journal J. Biol. Chen. (1991) 266:20619-20625 
•title Hunan red cell acid phosphatase (ACPI). The anino acid 

sequence of the two isozymes Bf and Bs encoded by the 
ACP1*B allele. 
Icross-references MUID:9204191l 
laccession A39491 
8#nolecule_type protein 
••residues" 2-158 #«label DIS 
KEYWORDS phosphoric nonoester hydrolase 

SUMMARY # length 158 *nolecular-ueight 18042 8c heck sun 5808 

SEQUENCE 

Initial Score = 4 Optinized Score = 4 Significance = 3.05 
Residue Identity = 80X Hatches = 4 Misnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
Mil 

I AEAVFRKL VTDQNI SENWRVDSAATSGYE I GNPPDYRGQSCMKRHGI PHSH VAR9 I TKEDF ATFDVI LCHD 
30 40 50 60 70 X X 80 90 

ESNLRDLNRKSNQVKTCKAKIELLGSYDPQKQL 
100 110 120 



12. US-08-249-182-1 (1-5) 

D25973 pertussis toxin chain S2 precursor - Bordetella pa 

ENTRY D25973 Stype complete 

TITLE pertussis toxin chain S2 precursor - Bordetella parapertussis 

ORGANISM #fornal_nane Bordetella parapertussis 

DATE 04-Har-1988 #sequence_revision 04-Mar-1988 itext change 

18-Jun-1993 
ACCESSIONS D25973 
REFERENCE A25973 

lauthors Arico. B.J Rappuoli, R. 

•journal J. Bacterid. (1987) 169:2847-2853 

•title Bordetella parapertussis and Bordetella bronchiseptica 

contain transcriptionally silent pertussis toxin genes, 
•cross-references MUID:87222217 
•accession D25973 

••status preliminary 
••nolecule_type DNA 
••residues" 1-169 «ilabel ARI 
CLASSIFICATION ftsuperfanily pertussis toxin chain S2 
SUMMARY Slength 169 «nolecular-ueight 18217 ic heck sun 656 

SEQUENCE 



Initial Score = 
Residue Identity = 
Gaps = 



4 Optinized Score = 4 
80% Matches = 4 

0 Conservative Substitutions 



Significance = 
Mi snatches - 



3.05 

1 
0 



X X 
UHVAR 
till 

MPISRKTLCHLLSVLPLAFLGCHVARASTPGIVIPP8E8ITSHGGPYGRCANKTRALTVAELRGSGDL8EYL 
10 20 X X 30 40 50 60 70 



iiLnssir iLfli iurc ssuperisnuy niua net protein 

SUMMARY § length 204 Snolecular-weight 23176 Ic heck sun 8284 

SEQUENCE 



Initial Score ~ 4 Optimized Score = 4 Significance = 3.05 
Residue Identity = 807. Matches = 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
llll 

CFKLVPVEPDKVEEATEGENNSLLHPICLHGMDDPEKEVLVHKFDSRLAFHHVAREKHPEYYKDC 
140 150 160 170 180 190 X 200 



15. US-08-249-182-1 (1-5) 

533986 nef protein - hunan innunodef iciency virus type 1 



S33986 Stype cooplete 

nef protein - hunan innunodef iciency virus type 1 
#fornal_nane hunan innunodef iciency virus type h HIV-1 
22-Nov-1993; #sequence_revision 22-Nov-1993i ltext_change 

22-Nov-1993 
S33986 
S33979 
Carlini. F. 

subnitted to the EMBL Data Library. Novenber 1991 
S33986 

prelininary 
1-206 ##label CAR 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
# authors 
((submission 
# accession 
Ststatus 
♦•residues 
##cross-references ENBL:Z11530 
SUMMARY S length 206 #ftolecular-ueight 23398 tchecksun 2728 

SEQUENCE 



Initial Score = 4 Optinized Score = 
Residue Identity = 80% Matches = 4 Mismatches 

Gaps = 0 Conservative Substitutions 



4 Significance = 3.05 
1 
0 



X X 
WHVAR 

llll 

CYKLVPVEPEKLEEANKGENTSLLHPVSLHGHDDPGREVLEWRFDSRLAFHHVARELHPEYFKNC 
150 160 170 180 190 X X 200 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 



Results file u249_ls.res nade by on Thu 22 Sep 94 10:49:02-PDT. 



Query sequence being conpared: US-08-249-182-1 (1-5) 
Number of sequences searched: 36000 
Nunber of scores above cutoff: 3810 

Results of the initial conparison of US-08-249-182-1 (1-5) uith: 
Data bank : Suiss-Prot 28t all entries 

100000- 

N 

U50000- 
M 

B - * 



RHVT 



13. US-08-249-182-1 (1-5) 

XXECC3 chloraRphenicol Q-acetyltransf erase (EC 2.3,1.28) 



ENTRY 
TITLE 

ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 

•authors 

#ciiaiion 

•accession 
itnolecule, 
##residues 

CLASSIFICATION 

KEYWORDS 

SUMMARY 

SEQUENCE 



XXECC3 #type complete 

chloranphenicol O-acetyltransferase (EC 2.3.1.28) III - 

Escherichia coli 
SforRat_nane Escherichia coli 

03-Aug-1984 isequence revision 03-Aug-1984 ttexi change 

30-Jun-1993 
A00567 
A00567 

Packnam L.C> Kaye? 
unpublished results r 
Biochen. 14 r 1-46* 
A00567 

.type protein 

1-203 ##label PAC 
•superfanily chloranphenicol acetyltransf erase 
acyltransferase; antibiotic resistance 
•length 203 Saolecular-ueight 23824 tchecksun 9862 



N.M.C.J Fitton, J.E. 

cited by Shaw, W.V.r CRC Crit. Rev. 

1983 



Initial Score = 4 
Residue Identity = 807, 
Gaps = 0 



Optimized Score = 4 Significance = 3.05 
Matches = 4 MisRatches = 1 

Conservative Substitutions = 0 



X X 
WHVAR 

III! 

INPLAPWVNFDSFDLNVANFDNMAKYQ8EGDRLLLPLVLSVQVHHAVCDGFHVARFINRL8ELCNSKLK 
140 150 160 170 180 X 190 200 



14. US-08-249-182-1 (1-5) 

S03246 nef protein (clone HAT3) 



hunan iRRunodef iciency 



ENTRY S03246 #type complete 

TITLE nef protein (clone HAT3) - huean iRnunodef iciency virus type 

1 

3'-orf protein 

•fornal^nane huRan inRunodef iciency virus type li HIV-1 
28-Feb-1990 •sequence_revision 28-Feb-1990 itext change 

30-Sep-I993 
S03246 
S03244 

Ratnerr L.; Starcich, B.; Josephs, S.F.; Hahni B.H.; Reddy* 
E.P.S Livak, K.J.; Petteuay Jr.» S.R.; Pearson* M.L.J 
Haseltine* W.A.; Aryar S.K.; Wong-Steal, F. 
Nucleic Acids Res. (1985) 13:8219-8229 
PolynorphisR of the 3' open reading frane of the virus 
associated uith the acquired innune deficiency syndrofie* 
hunan T-lywphotropic virus type III. 
•cross-references MUID;86067228 
•accession S03246 
••Rolecule_type DNA 
##residues" 1-204 iilabel RAT 
••cross-references EMBL:X03190 

••note the authors translated the codon AGT for residue 11 as 

Gly 

GENETICS 

•gene nef! 3 f -orf? orf-F 



ALTERNATE.NAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
•authors 



•journal 
•title 



0 

F10000- 
* 

s 

E 5000- 

a 
u 

E 
N 
C 
E 

S 1000- 



500- 



100- 



50- 



10- 



5- 



0 

II I ill III I I I I 

SCORE 01 0 1 | 1 2 | 2 3 3 4 4 
STDEV 0 1 2 



PARAMETERS 



Similarity natrix 


Unitary 


K-tuple 


2 


Hi snatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


ttindou size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 


15 


Optimized scores to 


save 0 


Display context 


50 



SEARCH STATISTICS 



Scores." 



Mean Median Standard Deviation 

1 3 0.89 



Tines? 



CPU 
00;00:48.92 



Total Elapsed 
00:00:54.00 



Nunber of residues: 12496420 
Number of sequences searched: 36000 
Nunber of scores above cutoff: 3810 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100% identical sequence to the query sequence was not found. 



The list of best scores is: 



Init. Opt. 

Sequence Name Description Length Score Score Sig. Fraae 







§#** 3 standard deviations above nean *«»* 






1. 


E16_BPD10 


PROTEIN E16 (FRAGMENT) . 


26 4 ' 


» 3.38 


0 


2. 


YBBD.EC0LI 


HYPOTHETICAL 9.2 KD PROTEIN I 


78 4 ' 


1 3.38 


0 


3. 


cybJcapl 


CYTOCHROME B (EC 1.10.2.2) (F 


98 4 ' 


I 3.38 


0 


4. 


RHLF.EC0LI 


PUTATIVE ATP-DEPENDENT RNA HE 


130 4 ' 


; 3.38 


0 


5. 


Y145.ADE07 


HYPOTHETICAL 14.5 KD EARLY PR 


133 4 ' 


1 3.38 


0 


6. 


PPAF_HUMAN 


RED CELL ACID PHOSPHATASE 1. 


157 4 ' 


? 3.38 


0 


7. 


NEF_HV1PV 


NEGATIVE FACTOR (F-PROTEIN) ( 


206 4 ' 


1 3.38 


0 


8. 


NEF.HV1BR 


NEGATIVE FACTOR (F-PROTEIN) ( 


206 4 ' 


» 3.38 


0 


9. 


NEF_HV112 


NEGATIVE FACTOR <F-PROTEIN) ( 


206 4 ' 


» 3.38 


0 


10. 


NEF_HV1RH 


NEGATIVE FACTOR (F-PROTEIN) ( 


208 4 < 


I 3.38 


0 


11. 


CAT3.EC0LI 


CHLORAMPHENICOL ACETYLTRANSFE 


213 4 < 


I 3.38 


0 


12. 


NEF_HV1JR 


NEGATIVE FACTOR (F-PROTEIN) ( 


216 4 ' 


} 3.38 


0 


13. 


RS3.EUGGR 


30S RIBOSOMAL PROTEIN S3. 


218 4 ' 


} 3.38 


0 


14. 


T0X2.B0RPE 


PERTUSSIS TOXIN SUBUNIT 2 (S2 


226 4 ' 


» 3.38 


0 


15. 


RGP1_0RYSA 


GTP-BINDING REGULATORY PROTEI 


226 4 ' 


1 3.38 


0 


16. 


YGGH~EC0LI 


HYPOTHETICAL 25.9 KD PROTEIN 


230 4 < 


» 3.38 


0 


17. 


ELI5JWRVU 


HIGH MOLECULAR MASS EARLY LIG 


231 4 ' 


1 3.38 


0 


18. 


YIGAJC0LI 


HYPOTHETICAL 26.7 KD PROTEIN 


235 4 ' 


» 3.38 


0 


19. 


VIB1.AGRT9 


VIRB1 PROTEIN PRECURSOR. 


239 4 < 


» 3.38 


0 


20. 


NGFA_M0USE 


7S NERVE GROWTH FACTOR ALPHA 


256 4 ' 


» 3.38 


0 


21. 


PRIA.LENED 


PRIA PROTEIN PRECURSOR. 


258 4 ' 


1 3.38 


0 


22. 


NGFGJ10USE 


7S NERVE GROWTH FACTOR GAMMA 


261 4 ' 


» 3.38 


0 


23. 


KAGB_M0USE 


GLANDULAR KALLIKREIN MGK-11 P 


261 4 ' 


} 3.38 


0 


24. 


KAG1. HOUSE 


GLANDULAR KALLIKREIN MGK-i PR 


261 4 ' 


» 3.38 


0 


25. 


EGBC.H0USE 


EPIDERMAL GROWTH FACTOR-BINDI 


261 4 t 


I 3.38 


0 


26. 


KAGR.PRANA 


GLANDULAR KALLIKREIN PRECURSO 


263 4 * 


1 3.38 


0 


27. 


YIHW.EC0LI 


HYPOTHETICAL 29.5 KD PROTEIN 


269 4 < 


I 3.38 


0 


28. 


TRY4.AN0GA 


TRYPSIN 4 PRECURSOR (EC 3.4.2 


275 4 i 


} 3.38 


0 


29. 


ETB.STAAU 


EXFOLIATIVE TOXIN B PRECURSOR 


277 4 i 


\ 3.38 


0 


30. 


YIJO.ECOLI 


HYPOTHETICAL 32.1 KD PROTEIN 


283 4 ' 


} 3.38 


0 


31. 


RP32.CITFR 


RNA POLYMERASE SIGHA-32 FACTO 


284 4 ' 


\ 3.38 


0 


32. 


Y0HI_EC0LI 


HYPOTHETICAL 35.2 KD PROTEIN 


315 4 i 


I 3.38 


0 


33. 


PSB0.HHEAT 


OXYGEN-EVOLVING ENHANCER PROT 


325 4 ' 


} 3.38 


0 


34. 


UL95.EBV 


HYPOTHETICAL PROTEIN BGLF3. 


332 4 t 


\ 3.38 


0 


35. 


IPNSJTRLP 


ISOPENICILLIN N SYNTHETASE (I 


333 4 i 


) 3.38 


0 


36. 


GVPN.HALSA 


GVPN PROTEIN. 


345 4 ' 


» 3.38 


0 


37. 


GVPN_HALNE 


GVPN PROTEIN. 


347 4 ' 


\ 3.38 


0 


38. 


gvpn'halha 


GVPN PROTEIN. 


347 4 i 


\ 3.38 


0 


39. 


CYB.ACITR 


CYTOCHROME B (EC 1.10.2.2). 


380 4 ' 


} 3.38 


0 


40. 


RL3A_ARATH 


60S RIBOSOMAL PROTEIN L3. 


388 4 ' 


1 3.38 


0 



1. US-Ua-tf-W-lBi!-! U-D> 

E16JPD10 PROTEIN Eli (FRAGMENT). 

ID E16JPD10 STANDARD; PRT; 26 AA. 

AC P24796; 

DT Ol-MAR-1992 (REL. 21, CREATED) 

DT Ol-MAR-1992 (REL. 21, LAST SEQUENCE UPDATE) 

DT Ol-MAR-1992 (REL. 21, LAST ANNOTATION UPDATE) 

DE PROTEIN E16 (FRAGMENT) . 

GN E16. 

OS BACTERIOPHAGE DIOB. 

OC VIRIDAE, DS-DNA NONENVELOPED VIRUSES; SIPHOVIRIDAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 91057162 

RA PATO N.L., BANERJEE ft. , WAGONNER B.T.; 

RL NUCLEIC ACIDS RES. 18:6458-6458(1990). 

DR EMBL! X54298S BD108E15. 

DR PIR; S12146; S12146. 

FT NON.TER 26 26 

SO SEQUENCE 26 AA! 3102 MM? 2831 CN; 

Initial Score = 4 Optimized Score = 4 Significance = 3.38 
Residue Identity = 80% Hatches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

mi 

HSRTSLIKLIHVARRELQLDDDTYRA 
10 X 20 



2. US-08-249- 182-1 (1-5) 

YBBD.ECOLI HYPOTHETICAL 9.2 KD PROTEIN IN RHSD 3'REGION (ORFD 

ID YBBD.ECOLI STANDARD, PRT; 78 AA. 

AC P33669; 

DT 01-FEB-1994 (REL. 28, CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 9.2 KD PROTEIN IN RHSD 3'REGION (0RFD3) . 

GN YBBD. 

OS ESCHERICHIA COLI. 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTER I ACE AE . 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12; 

RM 92115567 

RA SADOSKY A.B., GRAY J. A., HILL C.W.J 

RL NUCLEIC ACIDS RES. 19:7177-7183(1991). 

DR EMBL; LI 9084; EC0RFD23U. 

DR ECOGENE; EG 11 770; YBBD. 

KH HYPOTHETICAL PROTEIN. 

SQ SEQUENCE 78 AA; 9157 MM; 33115 CN; 

Initial Score = 4 Optinized Score = 4 Significance = 3.38 
Residue Identity = 80% Hatches = 4 Misnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 

llll 

FIFAIIIIVVLCVITYLYLYKDESLVSKHYINYMAIPENDGVFTWLPDFFPHVARGYINIHKCRR 
20 30 40 50 60 X 70 



3. US-08-249-182-1 (1-5) 

CYB.SCAPL CYTOCHROHE 8 (EC 1.10.2.2) (FRAGMENT) . 

ID CYB.SCAPL STANDARD; PRTi 98 AA. 

AC P29672; 

DT 01-APR-1993 (REL. 25, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE CYTOCHROME B (EC 1.10.2.2) (FRAGMENT). 

GN COB OR CYTB. 

OS SCAPHIRHYNCHUS PLATORYNCHUS (STURGEON) . 

OG MITOCHONDRION. 

oc eukaryota; hetazoa; chordata; vertebrata; pisces; gnathostomata; 

oc osteichthyes; actinopterygii; acipenseriformes. 

RN [1] 

RP SEQUENCE from n.a. 

RM 92130804 

RA NORMARK B.B., HCCUNE A.R., HARRISON R.G.; 

RL NOL. BIOL. EVOL. 8:819-834(1991). 

CC -!- FUNCTION: COMPONENT OF THE UBIQUINOL-CYTOCHROME C REDUCTASE 
CC COMPLEX (COMPLEX III OR CYTOCHROME B-Cl COMPLEX), WHICH IS A 

CC RESPIRATORY CHAIN THAT GENERATES AN ELECTROCHEMICAL POTENTIAL 

CC COUPLED TO ATP SYNTHESIS. 

CC -!- CATALYTIC ACTIVITY: 9H(2) ♦ 2 FERR I CYTOCHROHE C = 6 + 
CC 2 FERROCYTOCHROME C. 

CC -!- SUBUNIT: THE MAIN SUBUNITS OF COMPLEX B-Cl ARE: CYTOCHROHE B, 
CC CYTOCHROME CI AND THE RIESKE PROTEIN. 

CC -!- COF ACTOR: TWO HEME GROUPS (B562 AND B566) UHICH ARE NOT COVALENTLY 

CC BOUND TO THE PROTEIN. 

DR EMBL, M64921; MISPCOC. 

DR PROSITE; PS00192; CYTOCHROME_B_HEHE. 

DR PROSITE? PS00193; CYTOCHROME VqO. 

KM ELECTRON TRANSPORT; MITOCHONDRION; RESPIRATORY CHAIN; TRANSMEMBRANE; 



KW 


HEME. 










FT 


NON TER 


1 


1 






FT 


METAL 


48 


48 


IRON 1 


(HEME B562 AXIAL LIGAND). 


FT 


METAL 


62 


62 


IRON 2 


(HEME B566 AXIAL LIGAND). 


FT 


NON.TER 


98 


98 






S8 


SEQUENCE 


98 AA; 


11031 mu; 


56757 


cn; 



Initial Score = 4 Optinized Score = 4 Significance = 3.38 
Residue Identity = 80X Hatches = 4 Nisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
UHVAR 
Mil 

LTGLFLAMHYTADISTAFSSVAHICRDVNYGWLIRNVHANGASFFFICLYLHVARGNYYGSYLQKETNNIGV 
20 30 40 50 60 X 70 80 

VLLLLTMMTAFVGYVL 
90 



4. US-08-249-182-1 (1-5) 

RHLF_ECOLI PUTATIVE ATP-DEPENDENT RNA HELICASE RHLF (FRAGMENT 

ID RHLF.ECOLI STANDARD; PRT; 130 AA. 

AC P30015; 

DT 01-APR-1993 (REL. 25, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE PUTATIVE ATP-DEPENDENT RNA HELICASE RHLF (FRAGMENT). 

GN RHLF. 



uo taincruiiufl lull 

OC PROKARYOTA? GRACILI CUTES; SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTER I ACE AE . 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12; 

RN 93094287 

RA HUANG S., DEUTSCHER II. P.; 

RL J. BIOL. CHEN. 267:25609-25613(1992). 

CC -!- SIMILARITY: TO OTHER RNA HELICASES. 

DR ENBL; L01622; ECRNTASET. 

DR ECOGENE; EG1154B; RHLF. 

DR PIR; B45065; B45065. 

DR PROSITEJ PS00039; DE AD_ ATP_HEL I CASE . 

KU HELICASE; ATP-BINDING; RNA-BINDING. 

FT NP_BIND 51 58 ATP (BY SIMILARITY) . 

FT NON.TER 130 130 

SQ SEQUENCE 130 AAi 14560 MW; 77456 CN; 



Initial Score = 4 Opiinized Score = 4 Significance = 3.38 
Residue Identity = 80X Hatches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
HHVAR 
Mil 

HADNPDPSSLLPDVFSPATRDWFLRAFKQPTAV8P8TWHVAARSEHALVIAPTGSGKTLAAFLYALDRLFRE 
10 20 30 40 X 50 60 70 



GGEDTREAHKRKTSR I LY I S 
80 90 



5. US-08-249-182-1 (1-5) 

Y145.ADE07 HYPOTHETICAL 14.5 KD EARLY PROTEIN. 



ID Y145_ADE07 STANDARD; PRT; 133 AA. 

AC P05667; 

DT 01-N0V-1988 (REL. 09, CREATED) 

DT 01-N0V-1988 (REL. 09, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 14.5 KD EARLY PROTEIN. 

OS HUMAN ADENOVIRUS TYPE 7. 

OC VIRIDAE; DS-DNA NONENVELOPED VIRUSES; ADENOVIRIDAE; mast adenoviruses, 

rn i n 

RP SEQUENCE FROM N.A. 

RC strain=gonen; 

RK 83183660 

RA ENGLER J. A., HOPPE M.S., VAN BREE M.P.; 

RL GENE 21:145-159(1983). 

DR EMBL; X03000; AD7001. 

KW HYPOTHETICAL PROTEIN; EARLY PROTEIN. 

SQ SEQUENCE 133 AA; 14557 MW; 80147 CN; 



Initial Score = 4 Optinized Score = 4 Significance = 3.38 
Residue Identity = 80X Matches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
UHVAR 
Mil 

MRDDAAVD I LDLTPLGES YRPRELEPEREFNR INLG I VDGGLPKDFLHVARVVLVGDLGHELLDLFLLE I SA 
10 20 30 40 X 50 60 70 



ARSLDGGREVVGDAPNELRESIHARLVPD 
80 90 100 



6. US-08-249- 182-1 (1-5) 

PPAF.HUMAN RED CELL ACID PHOSPHATASE 1, ISOZYME F (EC 3.1.3.2 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
RN 
RP 
RN 
RA 
RL 
RN 
RP 
RH 
RA 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
KU 
FT 
FT 
FT 
FT 
SQ 



PRT; 157 aa. 



PPAF.HUHAN STANDARD! 
P24666; 

01-MAR-1992 (REL. 21, CREATED) 

01-HAR-1992 (REL. 21, LAST SEQUENCE UPDATE) 

01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

RED CELL ACID PHOSPHATASE 1, ISOZYME F (EC 3.1.3.2) (ACPI). 

ACPI. 

homo sapiens (human) . 

eukaryota; metazoa; chqrdata; vertebrata; tetrapoda; mammalia; 

eutheria; primates. 

m 

B ALLELE, SEQUENCE. 
92041911 

DISSING J., JOHNSEN A.H., SENSABAUGH G.F.; 
J. BIOL. CHEN. 266:20619-20625(1991). 
[23 

A AND C ALLELES, SEQUENCE. 
92329495 

DISSING J. , JOHNSEN A.H.J 

BIOCHIM. BIOPHYS. ACTA 1121:261-268(1992). 

-!- FUNCTION: HAS BEEN SHOWN TO HYDROLYZE FMN AND PROTEIN TYROSINE 
PHOSPHATE. 

CATALYTIC ACTIVITY: AN ORTHOPHOSPHORIC HONOESTER + H(2)0 = AN 
ALCOHOL + ORTHOPHOSPHATE. 
SUBCELLULAR LOCATION: CYTOPLASMIC. 

HUMAN ACPI IS GENETICALLY POLYMORPHIC, AND THREE COMMON ALLELES 
SEGREGATING AT THE ACPI LOCUS GIVE RISE TO SIX PHENOTYPES. EACH 
ALLELE APPEARS TO ENCODE TWO ELECTROPHORET I CALLY DIFFERENT 
ISOZYMES, F AND S, WHICH ARE PRODUCED IN ALLELE-SPECIFIC RATIOS. 
THE SEQUENCE SHOWN IS THAT OF ALLELES B AND C. 
ALTERNATIVE SPLICING: THE F AND S ISOZYMES PROBABLY RESULT FROM 
ALTERNATIVE SPLICING OF THE PRIMARY RNA TRANSCRIPT. THE RATIO OF 
F TO S IS 2:1 IN ALLELE A. 4M IN ALLELE B, AND 1:4 IN ALLELE C. 
THIS PHOSPHATASE IS INHIBITED BY SULFHYDRYL REAGENTS. 
SIMILARITY: TO BOVINE LOW MOLECULAR WEIGHT CYTOSOLIC ACID 
PHOSPHATASE. 
PIR; A39491; A39491. 
Miti; 171500; TENTH EDITION. 
HYDROLASE; ACETYLATION? ALTERNATIVE SPLICING. 

1 ACETYLATION. 
12 PROBABLE. 
17 PROBABLE. 
105 Q -> R (IN ALLELE A). 



MOD.RES 
ACT_SITE 
ACT.SITE 
VARIANT 
SEQUENCE 



1 

12 
17 
105 
157 AA; 



17996 HW; 118642 CN; 



Initial Score = 
Residue Identity = 
Gaps = 



4 Optimized Score = 4 Significance 
SOX Hatches = 4 Mi snatches 

0 Conservative Substitutions 



= 3, 



38 
1 
0 



X X 
WHVAR 
till 

IAEAVFRKLVWDQNISENWRVDSAATSGYEIGNPPDYRGQSCHKRHGIPNSHVARQITKEDFATFDYILCND 
30 40 50 60 70 X 80 90 



ESNLRDLNRKSNQVKTCKAKIELLGSYDPQKQL 
100 110 120 



7. US-08-249-182-1 (1-5) 

NEF_HV1PV NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'0RF 



ID NEF_HV1PV STANDARD; PRT; 206 AA. 

AC P03405; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-I986 (REL. 01. LAST SEQUENCE UPDATE) 

DT 01- JUL- 1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'0RF). 

GN NEF. 

OS HUNAN IMMUNODEFICIENCY VIRUS TYPE 1 (PV22 ISOLATE) (HIV-1). 

OC VIRIDAE; SS-RNA ENVELOPED VIRUSES; POSITIVE-STRAND; RETROVIRIDAE; 

OC LENTIVIRINAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RN 85111157 

RA HUESING N.A., SMITH D.H., CABRADILLA CD., BENTON C.V., LASKY L.A., 

RA CAPON D.J.J 

RL NATURE 313:450-458(1985). 

RN [2] 

RP POST-TRANSLAT I ONAL MODIFICATIONS, FUNCTION. 

RM 88039140 

RA GUY B.» KIENY N.-P., RIVIERE Y., LE PEUCH C. DOTT K., GIRARD H., 

RA MONTAGNIER L., LEC0C6 J. -P. J 

RL NATURE 330 : 266-269 < 1 987 ) . 

CC -!- FUNCTION: NEF HAS GTPASE, GTP-BINDING AND AUTOPHOSPHORYLATING 

CC ACTIVITIES, IT SEEN TO DOWN REGULATE THE CD4(T4) ANTIGEN. 

DR EMBLJ X01762; REHTLV3. 

DR HIV,* K 0208 3; NEF$PV22. 

DR PIRf A04007; ASLJVL. 

KU AIDS; HYRISTYLATION; GTP-BINDING. 

FT LIPID 2 2 NYRISTATE. 

S3 SEQUENCE 206 AA; 23352 MW; 219653 CN; 

Initial Score = 4 Optimized Score = 4 Significance - 3.38 
Residue Identity = 80X Hatches = 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
HHVAR 
Mil 

CYKLVPVEPDKVEEANKGENTSLLHPVSLHGMDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 
150 160 170 180 190 X X 200 



8. US-08-249-182-1 (1-5) 

NEF.HV1BR NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'0RF 

ID NEF.HV1BR STANDARD; PRT; 206 AA. 

AC P03406; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-1986 (REL. 01, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'ORF). 

GN NEF. 

OS HUNAN IMMUNODEFICIENCY VIRUS TYPE 1 (BRU ISOLATE) (HIV-1). 

OC VIRIDAE? SS-RNA ENVELOPED VIRUSES; POSITIVE-STRAND; RETROVIRIDAE; 

OC LENTIVIRINAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 85099333 

RA WAIN-HOBSON S., SONIGO P., DANOS 0., COLE S., ALIZON H.; 

RL CELL 40:9-17(1985). 

RN [2] 

RP CLONE PNL4-3, SEQUENCE FROM N.A. 

RA BUCKLER C.E., BUCKLER-WHITE A.J., HILLEY R.L., MCCOY J.; 

RL SUBMITTED (JUN-1988) TO THE HIV DATA BANK. 

RN [33 



t\r rU3 1 " I KnnDLH I lUftnL nUUltiUftl IONS, fUNUlUN. 

RM 86039140 

RA GUY B. > KIENY H.-P., RIVIERE Y., LE PEUCH C. DOTT K., GIRARO H., 

RA MONTAGNIER L.. LECDCG J.-P. J 

RL NATURE 330:266-269(1987). 

CC -!- FUNCTION! NEF HAS GTPASEr GTP-BINDING AND AUTOPHOSPHORYLATING 

CC ACTIVITIES, IT SEEN TO DOWN REGULATE THE C04(T4) ANTIGEN. 

DR EhBL; K02013J HIVBRUCG. 

DR EMBL; N19921? REHIVNL4. 

DR HIV; K02013; NEF$BRU. 

DR HIV; H19921; NEFSNL43. 

DR PIR; A0400BJ ASLJFV. 

KU AIDS; HYRISTYLATION; GTP-BINDING; PHOSPHORYLATION. 



FT 


LIPID 


2 


2 


NYRISTATE. 


FT 


MOD.RES 


15 


15 


PHOSPHORYLATION (BY PKC). 


FT 


VARIANT 


11 


11 


V -> I (IN CLONE PNL4-3). 


FT 


VARIANT 


15 


15 


T -> A (IN CLONE PNL4-3) . 


FT 


VARIANT 


33 


33 


A -> V (IN CLONE PNL4-3). 


FT 


VARIANT 


51 


51 


T -> N (IN CLONE PNL4-3) . 


S3 


SEQUENCE 


206 AA; 


23342 HUi 


: 221722 CNf 



Initial Score = 4 Optiwized Score = 4 Significance = 3.38 
Residue Identity = 807. Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
HHVAR 

mi 

CYKLVPVEPDKVEEANKGENTSLLHPVSLHGNDDPEREVLEMRFDSRLAFHHVARELHPEYFKNC 
150 160 170 180 190 X X 200 



9. US-08-249-182-1 (1-5) 

NEF_HV112 NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'ORF 

ID NEF.HV112 STANDARD; PRTi 206 AA. 

AC P04324; 

DT 20-NAR-1987 (REL. 04, CREATED) 

DT 13-AUG-1987 (REL. 05, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'ORF) . 

GN NEF. 

OS HUMAN IMMUNODEFICIENCY VIRUS TYPE 1 (CLONE 12) (HIV-1). 

OC VIRIDAE; SS-RNA ENVELOPED VIRUSES; POSITIVE-STRAND; RETROVIR I DAE; 

OC LENTIVIRINAE. 

RN 111 

RP SEQUENCE FROM N.A. 

RN 86177573 

RA ARYA S.K., GALLO R.C.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 83:2209-2213(1986). 

RN [21 

RP POST-TRANSLATIONAL MODIFICATIONS, FUNCTION. 

RN 88039140 

RA GUY B. > KIENY H.-P., RIVIERE Y., LE PEUCH C. DOTT K., GIRARD H. , 

RA MONTAGNIER L., LECOCQ J.-P.; 

RL NATURE 330:266-269(1987). 

CC -!- FUNCTION: NEF HAS GTPASE , GTP-BINDING AND AUTOPHOSPHORYLATING 

CC ACTIVITIES, IT SEEM TO DOHN REGULATE THE CD4(T4) ANTIGEN. 

DR EHBL! Ml 1840; HIVDSN. 

DR HIV; 1111840; NEFIPCV12. 

DR PIR! A04006; ASLJ12. 

KM AIDS; HYRISTYLATION; GTP-BINDING. 

FT LIPID 2 2 MYRISTATE. 

S8 SEQUENCE 206 AA; 23366 NH; 218839 CN; 



Initial Score 



4 Optinized Score = 4 Significance = 3.38 



nesioue laennvy - bva natcnes = •» nisnaicnes = i 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
llll 

CYKLVPVEPEKLEEANKGENTSLLHPVSLHGNDDPEREVLEWRFDSRLAFHHVARELHPEYFKNC 
150 160 170 180 190 X X 200 



10. US-08-249-182-1 (1-5) 

NEF.HV1RH NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'ORF 



ID NEF_HV1RH STANDARD; PRTf 208 AA. 

AC P05858; 

DT 01-NOV-1988 (REL. 09, CREATED) 

DT 01-NQV-198B (REL. 09. LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26r LAST ANNOTATION UPDATE) 

DE NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'0RF>. 

GN NEF. 

OS HUMAN IMMUNODEFICIENCY VIRUS TYPE 1 (RF/HAT ISOLATE) (HIV-1). 

OC VIRIDAE; SS-RNA ENVELOPED VIRUSES? POSITIVE-STRAND; RETROVIRIDAEi 

OC LENTIVIRINAE. 

RN [11 

RP SEQUENCE FROM N.A. 

RA STARCICH B.R., HAHN B.H., SHAH G.M., HCNEELY P.O., HODROU S., 

RA WOLF H.> PARKS E.S.. PARKS W.P., JOSEPHS S.F., GALLO R.C., 

RA UONG-STAAL F.; 

RL SUBMITTED (XXX-1967) TO THE HIV DATA BANK. 

CC -!- FUNCTION: NEF HAS GTPASE, GTP-BINDING AND AUTOPHOSPHOR YLAT I NG 

CC ACTIVITIES. IT SEEM TO DOUN REGULATE THE CD4IT4) ANTIGEN. 

DR EMBL; R17451; HIVRF. 

DR HIV; H17451; NEF$RF. 

KM AIDS; HYRISTYLATION; GTP-BINDING. 

FT LIPID 2 2 MYRISTATE (BY SIMILARITY) . 

SO SEQUENCE 208 AA. 23532 NU; 229900 CN; 



Initial Score = 4 Optimized Score = 4 Significance = 3.38 
Residue Identity = 807. Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
UHVAR 
llll 

CFKLVPVEPDKVEEATEGENNSLLHPICLHGMDDPEKEVLVHKFDSRLAFHHVAREKHPEYYKDC 
150 160 170 180 190 X 200 



11. US-08-249-182-1 (1-5) 

CAT3_EC0LI CHLORAMPHENICOL ACETYLTRANSFERASE III (EC 2.3.1.28 

ID CAT3.EC0LI STANDARD; PRTf 213 AA. 

AC P00484; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 01-AUG-1988 (REL. 08, LAST SEQUENCE UPDATE) 

DT 01-MAY-1992 (REL. 22, LAST ANNOTATION UPDATE) 

DE CHLORAMPHENICOL ACETYLTRANSFERASE III (EC 2.3.1.28). 

GN CAT3. 

OS ESCHERICHIA COLI. 

OG PLASKID R387. 

OC prokaryota; gracilicutes; scotobacteria; FACULTATIVELY anaerobic rods; 

oc enterobacteriaceae. 

RN [13 

RP SEQUENCE FROM N.A. 

RM 88339790 

RA HURRAY I. A.. HAWKINS A.R.. KEYTE J.W., SHAH H.V.J 



nu tuuintn. J. mc* iM-imi-reo/ . 

RN [2] 

RP PRELIMINARY SEQUENCE. 

RM 83181524 

RA PACKMAN L.C., KAYE N.N.C.r FITTON J.E.; 

RL UNPUBLISHED RESULTS, CITED BY? 

RL SHAW H.V.; 

RL CRC CRIT. REV. BIOCHEM. 14:1-46(1983). 

RN [3] 

RP X-RAY CRYSTALLOGRAPHY (1.75 ANGSTROMS). 

RM 88247977 

RA LESLIE A.G.H.. MOODY P.C.E., SHAU W.V.J 

RL PROC. NATL. ACAD. SCI. U.S.A. 85:4133-4137(1988). 

RN [4] 

RP X-RAY CRYSTALLOGRAPHY (1.75 ANGSTROMS) . 

RM 90250768 

RA LESLIE A.G.U.; 

RL J. MOL. BIOL. 213:167-186(1990). 

CC -!- FUNCTION: THIS ENZYME IS AN EFFECTOR OF CHLORAMPHENICOL RESISTANCE 
CC IN BACTERIA. 

CC -!- CATALYTIC ACTIVITY: ACETYL-COA + CHLORAMPHENICOL = COA + 

CC CHLORAMPHENICOL 3-ACETATE. 

CC -!- SUBUNIT: HOHOTRIMER. 

DR EMBLr X07848; SFCAT3. 

OR PIR; A00567; XXECC3. 

DR PIR; SO 060 2; XXEBCF. 

DR PDBS 1CLAJ 15-0CT-92. 

DR PDBJ 2CLAJ 15-JUL-90. 

DR PDB; 3CLAJ 15-JUL-92. 

DR PDBS 4CLA5 15-JUL-92. 

DR PROSITE; P500100; CAT. 

KU ANTIBIOTIC RESISTANCE; TRANSFERASE; AC YL TRANSFERASE; PLASHID; 

KU 3D-STRUCTURE. 



ACT_SITE 


189 


189 


STRAND 


3 


5 


TURN 


9 


10 


TURN 


12 


13 


HELIX 


14 


21 


TURN 


22 


23 


STRAND 


27 


35 


HELIX 


37 


44 


TURN 


45 


45 


HELIX 


50 


62 


TURN 


63 


64 


HELIX 


66 


69 


STRAND 


70 


72 


STRAND 


77 


80 


STRAND 


84 


91 


TURN 


92 


95 


STRAND 


96 


101 


HELIX 


108 


121 


TURN 


123 


124 


TURN 


130 


131 


STRAND 


138 


144 


TURN 


145 


146 


TURN 


160 


161 


STRAND 


166 


170 


STRAND 


173 


175 


TURN 


176 


177 


STRAND 


178 


188 


TURN 


189 


191 


HELIX 


194 


208 


TURN 


209 


209 


SEQUENCE 


213 aa; 


24 



SG SEQUENCE 213 AA; 24993 M*; 238808 CN; 



Initial Score = 4 Optimized Score 



4 Significance = 3.38 



I 

0 



X X 
WHVAR 
Mil 

PUVNFDSFNLNVANFTDYFAPIITMAKYQQEGDRLLLPLSV8VHHAVCDGFHVARFINRL6ELCNSKLK 
150 160 170 180 190 X 200 210 



12. US-08-249-182-1 (1-5) 

NEF_HV1JR NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'ORF 

ID NEF.HVUR STANDARD; PRT; 216 AA. 

AC P20867; 

DT 01-FEB-1991 (REL. 17, CREATED) 

DT 01-FEB-1991 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE NEGATIVE FACTOR (F-PROTEIN) (27 KD PROTEIN) (3'0RF). 

GN NEF. 

OS HUMAN IMMUNODEFICIENCY VIRUS TYPE 1 (JRCSF ISOLATE) (HIV-1). 

OC VIRIDAE; SS-RNA ENVELOPED VIRUSES; POSITIVE-STRAND, RETROV IRI DAE ; 

OC LENTIVIRINAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RA KOYANAGI S., CHEN I.S.Y.i 

RL SUBMITTED (DEC-1988) TO THE HIV DATA BANK. 

CC -!- FUNCTION! NEF HAS GTPASE, GTP-BINDING AND AUTOPHOSPHOR YLAT ING 

CC ACTIVITIES, IT SEEM TO DOWN REGULATE THE CD4(T4) ANTIGEN. 

DR EMBL; M38429; HIVJRCSF. 

DR HIV; N38429! NEF$ JRCSF. 

KU AIDS; MYRISTYLATION; GTP-BINDING. 

FT LIPID 2 2 MYRISTATE (BY SIMILARITY). 

SG SEQUENCE 216 AA; 24567 MU; 243460 CN; 

Initial Score = 4 Optimized Score = 4 Significance - 3.38 
Residue Identity = 807. Matches = 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
llll 

CFKLVPVDPEKVEEANEGENNCLLHPMSQHGMDDPEKEVLVWKFDSKLALHHVARELHPEYYKDC 
160 170 180 190 200 X X 210 



13. US-08-249-182-1 (1-5) 

RS3.EUGGR 30S RIBOSOMAL PROTEIN S3. 

ID RS3_EUGGR STANDARD; PRT; 218 AA. 

AC P191695 

DT 01-N0V-1990 (REL. 16, CREATED) 

DT 01-N0V-1990 (REL. 16, LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25. LAST ANNOTATION UPDATE) 

DE 30S RIBOSOMAL PROTEIN S3. 

GN RPS3. 

OS EUGLENA GRACILIS. 

OG CHLOROPLAST. 

oc eukaryota; planta; phycophyta; euglenophyta. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAINS; 

RM 89063445 

RA CHRISTOPHER D.A., CUSHMAN J.C., PRICE C.A., HALLICK R.B.; 

RL CURR. GENET. 14:275-286(1988). 

CC -!- SIMILARITY! BELONGS TO THE S3P FAMILY OF RIBOSOMAL PROTEINS. 



DR EHBLJ M37463; CHEGRIBP. 

DR PIR; S26084; S26084. 

DR PIR; S34524; S34524. 

DR PROSITE? PS00548; RIB0S0FIAL_S3_1. 

DR PROSITE; PS00734! R1B0S0MAL_S3_2. 

KW RIBOSOMAL PROTEIN; CHLOROPLAST. 

SQ SEQUENCE 218 AA; 25267 NW; 242884 CN; 

Initial Score = 4 Optimized Score = 4 Significance = 3.38 
Residue Identity = 802 Hatches = 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions - 0 

X X 
WHVAR 

Mil 

VERRHYASFVKEDIVIRNFMNKELLETL1SLIKIERIYEFSEQRNNTIVYIHVARPERVIGRDGQGLSRIRD 
30 40 50 60 70 X 80 90 

I L I DRMNYLLGKTPR 1 1 TCKVVGVTSPNLD ARL 
100 110 120 



14. US-08-249-182-1 (1-5) 

T0X2JB0RPE PERTUSSIS TOXIN SUBUNIT 2 (S2) PRECURSOR (ISLET-AC 

ID T0X2J0RPE STANDARD; PRT; 226 AA. 

AC P04978; 

DT 13-AUG-1987 (REL. 05. CREATED) 

DT 13-AUG-1987 (REL. 05. LAST SEQUENCE UPDATE) 

DT 01-MAY-1991 (REL. 18, LAST ANNOTATION UPDATE) 

DE PERTUSSIS TOXIN SUBUNIT 2 (S2) PRECURSOR (ISLET-ACTIVATING PROTEIN) 

DE (IAP). 

OS BORDETELLA PERTUSSIS. 

oc prokaryota; gracilicutes; scotobacteria; aerobic rods and cocci; 

OC ALCALIGENACEAE. 

RN [1] 

RF SEQUENCE FROM N.A. 

RC STRAIN=BP165; 

RM 86259651 

RA NICOSIA A., PERUGINI H. > FRANZ INI C. CASAGLI H.C., BORRI N.G.. 

RA ANTONI G.i ALNONI N. > NERI P.. RATTI G.. RAPPUOLI R.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 83:4631-4635(1986). 

RN [2] 

RP SEQUENCE FROM N.A. 

RM 86208173 

RA LOCHT C, KEITH J.M.! 

RL SCIENCE 232:1258-1264(1986). 

CC -!- FUNCTION: B BINDS TO RECEPTORS ON THE EUKARYOTIC CELL SURFACE AND 

CC FACILITATES THE TRANSLOCATION OF THE TOXIC SUBUNIT ACROSS THE CELL 

CC MEMBRANE. 

CC -!- SUBUNIT: PERTUSSIS TOXIN CONTAINS FIVE DIFFERENT CHAINS, S1-S5. 

CC THEY ARE ORGANIZED INTO 2 FUNCTIONAL SUBMITS: A, COMPOSED OF SI 

CC (WHICH IS TOXIC) AND B. CONTAINING S2, S3, S5. AND TWO COPIES OF 

CC S4 (B BINDS TO THE MEMBRANE RECEPTORS). D I HERS OF S2-S4 AND S3-S4 

CC ARE HELD TOGETHER BY S5. 

DR ENBL; M14378; BPTOXS. 

DR EHBL; M13223; BPTOX. 

DR PIR; B24144! UEBR2P. 

KW MEMBRANE; TOXIN; SIGNAL; WHOOPING COUGH. 

FT SIGNAL 1 27 

FT CHAIN 28 226 TOXIN SUBUNIT 2. 

FT SIMILAR 28 226 677. WITH S3 SUBUNIT. 

SQ SEQUENCE 226 AA; 24829 HU; 287397 CN; 



Initial Score = 4 Optimized Score = 4 Significance = 3.38 



nesioue loentny - ou/. naixnes = * nisnatcnes = i 

Gaps = 0 Conservative Substitutions = 0 

X X 
WHVAR 
llll 

MPIDRKTLCHLLSVLPLALLGSHVARASTPGIVIPPQE9IT8HGSPYGRCANKTRALTVAELRGSGDL8EYL 
10 20 X X 30 40 50 60 70 

RHVT 



15. US-08-249-182-1 (1-5) 

RGP1.0RYSA GTP-BINDING REGULATORY PROTEIN RGP1. 



ID RGP1 ORYSA STANDARD; PRT, 226 AA. 

AC P25766; 

DT 01-MAY-1992 (REL. 22, CREATED) 

DT 01-HAY-1992 (REL. 22, LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25, LAST ANNOTATION UPDATE) 

DE GTP-BINDING REGULATORY PROTEIN RGP1. 

GN RGP1 . 

OS ORYZA SATIVA (RICE) . 

OC EUKARYOTA; PL ANT A; EHBRYOPHYTA; ANGIOSPERMAE, HONOCOTYLEDONEAE i 

OC CYPERALES, GRANINEAE. 

RN [11 

RP SEQUENCE FRON N.A. 

RC STRAIN=CV. GINBOZU.' 

Rtl 91360069 

RA SANO H., YOUSSEFIAN S.; 

RL HOL. GEN. GENET. 228:227-232(1991). 

CC -!- FUNCTION: MAY PLAY AN IMPORTANT ROLE IN PLANT GROWTH AND 
CC DEVELOPMENT. 

CC -!- DEVELOPMENTAL STAGE: DURING SEEDLING RGP1 EXPRESSION WAS FIRST 
CC OBSERVED 14 DAYS AFTER GERMINATION, REACHING A MAXIMUM LEVEL 

CC BETWEEN 28 AND 42 DAYS, AND GRADUALLY DECREASED THEREAFTER UNTIL 

CC 63 DAYS WHEN IT ATTAINED THE SAME LEVEL OF EXPRESSION AS IN 14-DAY 

CC OLD SEEDLINGS. 

CC -!- SIMILARITY: TO RAS PROTEINS. BELONGS TO YPT1 SUB-FAMILY. 

DR EMBL; X 59276; 0SRGP1. 

DR PIR; S16554: S16554. 



KU 


GTP-BINDING,' LIPOPROTEIN; 


PRENYLATION. 


FT 


NP_BIND 


25 


32 


GTP (BY SIMILARITY). 


FT 


NP BIND 


73 


77 


GTP (BY SIMILARITY). 


FT 


NP.BIND 


131 


134 


GTP (BY SIMILARITY). 


FT 


DOMAIN 


47 


55 


EFFECTOR REGION (POTENTIAL). 


FT 


LIPID 


223 


223 


GERANYL-GERANYL (BY SIMILARITY). 


FT 


LIPID 


224 


224 


GERANYL-GERANYL (BY SIMILARITY). 


SQ 


SEQUENCE 


226 AA; 


24852 MW! 266731 CN; 



Initial Score = 4 Optinized Score = 4 Significance = 3.38 
Residue Identity = 80X Matches = 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
WHVAR 
llll 

TRTLHIDARTVKA9TRDTAGQERYRAVTSAYYRGAVGANLVYDITKRQSFDHVARWLEELRGHADKNIVIML 
60 70 80 90 100 110 120 



IGNKSDLGTLRVVPTEDAKEFAERENLFFMETS 
130 140 150 160 

> 0 < 

0| |0 IntelUGenetics 

> 0 < 
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8uery sequence being conpared:US-08-249-182-2 (1-6) 
Nunber of sequences searched: 42145 
Nunber of scores above cutoff: 3792 

Results of the initial conparison of US-08-249- 182-2 (1-6) with: 
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PARAHETERS 



Sini larity natrix 


Unitary 


K-tuple 


2 


Hisnatch penally 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window si2e 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 15 




Optimized scores to 


save 0 


Display context 50 






SEARCH STATISTICS 




Scores i 


Mean 


Median Standard Deviation 




0 


1 1.10 




Tines; 


CPU 


Total Elapsed 






00:00:27.91 


00:00:30.00 




Number of residues: 




5287517 




Number of sequences 


searched: 


42145 




Number of scores above cutoff: 


3792 





Cut-off raised to 2. 
Cut-off raised to 3, 
Cut-off raised to 4. 



The scores beloy are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence uas not found. 
The list of best scores is: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



4 standard deviations above nean **« 



1. 


R37444 


Autotaxin peptide ATX 19. 


7 


5 


5 


4.55 


0 


2. 


RS2633 


Guinea pig PH-30. 30 alpha su 


289 


5 


5 


4.55 


0 


3. 


R39354 


EpiP protein. 


461 


5 


5 


4.55 


0 


4. 


R41043 


CD4-EBA175 fusion protein. 


1786 


5 


5 


4.55 


0 






***• 3 standard deviations above nean *•»• 








5. 


R31940 


In vivo tumour binding peptid 


5 




4 


3.64 


0 


6. 


P50562 


Sequence of peptide with innu 


5 




4 


3.64 


0 


7. 


R24686 


IflnunoRodulatory peptide. 


6 




4 


3.64 


0 


e. 


R24619 


Innunonodulatoru peptide. 


6 




4 


3.64 


0 


9. 


R24632 


Innunonodulatoru peptide. 


6 




4 


3.64 


0 


10. 


R24639 


Innunonodulatoru peptide. 


7 




4 


3.64 


0 


11. 


R24589 


Innunonodulatory peptide. 


7 




4 


3.64 


0 


12. 


R24588 


Innunonodulatoru peptide. 


7 




4 


3.64 


0 


13. 


R21056 


Ganna-carboxylase. N-terninus 


10 




4 


3.64 


0 


14. 


R30889 


Cell adhesion polypeptide. 


19 




4 


3.64 


0 


15. 


R26829 


Cell adhesion polypeptide. 


19 




4 


3.64 


0 


16. 


R13381 


Vascular permeability factor 


21 




4 


3.64 


0 


17. 


R36681 


Guinea pig VPF N-terninal. 


25 




4 


3.64 


0 


18. 


R11665 


N-terninal sequence of vascul 


36 




4 


3.64 


0 


19. 


P70481 


Anino-terninal anino acid seq 


52 




4 


3.64 


0 


20. 


P70669 


Sequence of N-terninal of bov 


56 




4 


3.64 


0 


21. 


P82904 


Acetylcholinesterase-like pro 


81 




4 


3.64 


0 









77 


23. 


R22364 


GroES structural protein. 


102 


24. 


R29702 


Tapetum protein from A9 gene 


105 


25. 


P81001 


Sequence (I) of human granulo 


128 


26. 


R34496 


Bovine acidic Fibroblast Grou 


140 


27. 


R25915 


Hunan acidic fibroblast grout 


140 


28. 


R 13030 


Brain-derived acidic fibrobla 


140 


29. 


P90069 


Bovine acidic fibroblast grou 


140 


30. 


R25569 


Recombinant bovine Ala47i Gly 


141 


31. 


R34010 


Bovine lysozyne c. 


145 


32. 


P92066 


Amino acid sequence of bovine 


145 


33. 


P91300 


Signal sequence fused to the 


147 


34. 


R 14626 


Beta-lactoglobulin contg. pos 


162 


35. 


R42169 


Haenopoietic stem cell pronot 


188 


36. 


R41349 


NXG2 encoded xyloglucanase. 


190 


37. 


P91467 


Peptide with f irin-binding ac 


192 


38. 


R26064 


Hunan FcERI alpha-subunit and 


235 


39. 


R47462 


Truncated xylanase (XYLA). 


236 


40. 


R47457 


Truncated xylanase (XYLA). 


241 



3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 
3.64 



1. US-08-249-182-2 (1-6) 

R37444 Autotaxin peptide ATX 19. 

ID R37444 standard; peptide; 7 AA. 

AC R37444; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 19. 

KM Cell motility stimulating; cancer metastasis; antibody; detection; 

KU imnunostainsf disease outcone prediction; therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Hodified_site 2 

FT /note= "potentially glycosylated residue" 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992J 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUNAN SERVICE. 

PI Krutzsch Hi Liotta LA. Schiffnann E. Stracke M. 

DR HPl; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 19. It may be used to 

CC raise anti-autotaxin antibodies uhich can be used to diagnose cancer 

CC metastasis and in innunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SQ Sequence 7 AA; 

SQ o A; 0 R; 0 n; 1 D; 0 B; 0 c; 0 a; o E; 0 z; 0 G; 0 H; 

SQ 0 i; l L; l k; o n; o F; l P; 0 s; o T; o w; l y; l v; 

SO 1 Others; 

Initial Score = 5 Optimized Score = 5 Significance = 4.55 
Residue Identity = 832 Hatches = 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 

iiiii 

PXLDVYK 



A A 



2. US-08-249- 182-2 (1-6) 

R52633 Guinea pig PH-30* 30 alpha subunit. 

ID R52633 standard? Protein; 289 AA. 

AC R52633? 

DT 22-JUN-1994 (first entry) 

DE Guinea pig PH-30* 30 alpha subunit. 

KW PH-20? PH-30; contraceptive? fertilisation; spern surface protein; 

KW vaccine? sper«-egg fusion. 

OS Cavia cobaya. 

PN WQ9325233-A. 

PD 23-DEC-1993. 

PF 10-JUN-1993? U05640. 

PR 12-JUN-1992? US-897883. 

PA (UYC0-) UNIV CONNECTICUT. 

PI (lyles DGi Prinakoff P; 

DR HPI; 94-007200/01. 

DR N-PSOB; 054636. 

PT Contraceptive vaccine for reducing spern-egg fusion - conprises 

PT peptide fron spern surface protein which stinulates antibody 

PT prodn. 

PS Exanple 5; Fig 8B; 79pp; English. 

CC Spern surface proteins or peptides stimulate an innune response to 

CC produce antibodies which block spern-egg fusion and provide 

CC contraception. Pref. sperR surface proteins are the PH-20 and PH-30 

CC spern surface proteins. 

SQ Sequence 289 AA? 

S3 26 A; 2 R? 14 N? 12 D? 0 6; 23 C; 8 Q? 26 E; 0 Z? 32 G? 4 H? 

SQ 19 I? 17 L? 9 K? 1 M? 8 F; 29 P? 24 S; 22 T? 2 W? 5 Y? 6 V; 

Initial Score = 5 Optimized Score = 5 Significance = 4.55 
Residue Identity = 83X Matches = 5 Mismatches - 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
III II 

YCTGQSGKCPLDTYK8DGTPCNEGFFCVSKGCTDPGIQCATYFGHGARSAPDACYTTLNSIGNIF 
10 X 20 30 40 50 60 



3. US-08-249-182-2 (1-6) 
R39354 EpiP protein. 

ID R39354 standard? Protein? 461 AA. 

AC R39354? 

DT 15-SEP-1993 (first entry) 

DE EpiP protein. 

KM Epidernin? derivatives? lantibiotic. 

OS Staphylococcus epidernis. 

PN EP-543195-A. 

PD 26-MAY-1993. 

PF 30-0CT-1992? 118598. 

PR 31-0CT-1991? US-784234. 

PA (THOM ) THOMAE GMBH KARL. 

PI Augustin Jr Engelke G, Entian K, Got2 F, Jung G> Kaletta C, Klein C; 

PI Kellner R, Kupke T, Rosenstein R, Schnell N> Hieland B* 

DR WPI! 93-168917/21. 

DR N-PSDB? 04254 1. 

PT Novel DNA molecule - encoding Epi B* C» D* P or 0 enzynes 

PT involved in biosynthesis of lantibiotic epidernin. 

PS Clain 5? Fig 9? 52pp? English. 

CC The sequence is that of EpiP uhich is believed to he responsible 



ror t leaving nature epicermin rrcm tne w-Lerminai teaoer pep^io?; 
CC based on its striking homologies with the essential domain of 
CC serine proteases. 
SQ Sequence 461 AA? 

SQ 21 A? a R; 47 n; 30 D? 0 B; 2 c; 13 Q; 26 E; o Z; 32 G; 5 H, 
SQ 27 I; 38 L? 67 K; 8 M? 15 F; 8 P? 40 S? 16 T; 3 U? 22 Y; 33 V; 

Initial Score = 5 Optimized Score = 5 Significance = 4.55 
Residue Identity = 83X Hatches = 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
(III! 

SLAAPKVSGALALEIDKYSLKD9PETAIELFKKKGIEKEKYMDKKHYGNGKLDVYKLLKE 
410 420 430 440 450 X 460 




4. US-08-249-182-2 (1-6) 

R41043 CD4-EBA175 fusion protein. 

ID R41043 standard; protein; 1786 AA. 

AC R41043; 

DT 22-HAR-1994 (first entry) 

DE CD4-EBA175 fusion protein. 

KW Herozoite; Erythrocyte Binding Antigen 175; malaria? HIV? env? 

KW human immunodeficiency virus? envelope glycoprotein; hybrid protein? 

KH red blood cell; erythrocyte? AIDS; molecular machine. 

OS Chimeric Homo sapiens. 

OS Chimeric Plasmodium falciparum. 

FH Key Location/Qualifiers 

FT Region 1 - .371 

FT /note= "residues 1-371 of CD4 fl 

FT Region 372.. 1786 

FT /note* Residues 20-1435 of EBA-175 0 

PN W09318160-A. 

PD 16-SEP-1993. 

PF 10-MAR-1993? G00505. 

PR ll-HAR-1992; GB-005276. 

PR 08-JUL-1992? GB-014481. 

PR 24-JUL-1992? GB-015829. 

PR 16-SEP-1992? GB-019562. 

PR 03-MAR-1993? GB-004311. 

PA (PREN/) PRENDERGAST K F. 

PI Prendergast KF? 

DR HPI? 93-303474/38. 

PT Anti-viral fusion peptide(s) - comprise viral-binding component 

PT and malaria merozoite red cell binding componentf for treating 

PT e.g. HIV* and hepatitis 

PS Claim 9? Page 44-47? 69pp? English. 

CC The hybrid protein NH2-CD4(1-371)-EBA175(20-1435)-CD0H is a 

CC specifically claimed example of a fusion protein of the invention? 

CC -it comprises at least part of the CD4 molecule fused to a peptide 

CC from a malarial parasite merozoite protein with affinity for red 

CC blood cells. The fusion protein can bind free HIV in the blood to 

CC red blood cells and consequently reduce viral titre? prevent 

CC transmission of the virus and improve safety of blood transfusions. 

SQ Sequence 1786 AA; 

SQ 47 A? 65 R; 163N; 126D? 0 B; 41 C? 63 Q? 169E? 0 Z? 73 G? 40 H; 

SQ 1031? 134L? 202K? 29 M? 55 F? 51 P? 160S? 98 T? 28 U? 54 Y? 85 V? 

Initial Score = 5 Optimized Score = 5 Significance = 4.55 

Residue Identity = 83X Matches = 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 



TLUYTft 



VWVLNPEAGMWQCLLSDSGQVLLESNIKVLPTWSTPVKARNEYDIKENEKFLDVYKEKFNELDKKKYGNVQK 
340 350 360 370 380 X 390 400 

TDKKIFTFIENKLDILMNSKFNKRWKSYGTPDNI 
410 420 430 440 



5. US-08-249-182-2 <l-6) 

R31940 In vivo tunour binding peptide contg. Leu-Asp-Val. 



ID R31940 standard; peptide; 5 AA. 

AC R31940; 

DT 04-JUN-1993 (first entry) 

DE In vivo tunour binding peptide contg. Leu-Asp-Val. 

KM Diagnosis? in vivo tunour imaging; therapy; treatment; cancer; LDV. 

OS Synthetic. 

FH Key Location/Sualif iers 

FT Region 2.. 4 

FT /note= "LDV - binds to tunour LDV binding sites" 

PN EP-527056-A. 

PD 10-FEB-1993. 

PF 06-AUG-1992; 307198. 

PR 06-AUG-1991; GB-016925. 

PA (ANTI-) ANTISOPIA LTD. 

PI Stuttle AWJ. 

DR HPI; 93-046975/06. 

PT Peptide(s) contg. leucine-aspartic acid-valine sequencer bind 

PT tumours in vivo - for diagnosing and treating tumours 

PS Claim 3; Page 9; 13pp? English. 

CC The peptide contains the sequence Leu-Asp-Val (LDV) and as a result 

CC is capable of binding in vivo with pathological tissues such as 

CC tumours contg. an LDV binding site. It nay have an attached or 

CC conjugated radioactive label or cytotoxin. It can be used in in vivo 

CC tunour inagingi diagnosis and treatnent of tunours. Using the peptide 

CC the disadvantages of monoclonal antibody based cytotoxic and 

CC diagnostic reagents are overcome. It has specific binding properties 

CC to tunour associated binding sites and is rapidly transported to the 

CC tunour site following injection. It has rapid clearance of unbound 

CC reagent fron the body following adnin. thereby considerably reducing 

CC treatnent tines and reducing effective dosage of potentially highly 

CC toxic reagents whether adninistered for diagnostic or therapeutic 

CC purposes. 

SO Sequence 5 AA; 

SQ 0 A; 0 R; 0 N; 1 D; 0 B; 0 C; 0 8; 0 E; o 7; 0 G; 0 H; 

SQ 0 i; i l; o Kf 0 M; 0 F; o P; o S; 0 T; o w; 2 Y; l v; 



Initial Score = 4 Optimized Score = 4 Significance = 3.64 
Residue Identity - SOX Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 

IIII 

YLDVY 
X X 



6. US-08-249-182-2 (1-6) 

P50562 Sequence of peptide with innunonodulatory activity 

ID P50562 standard; peptide; 5 AA. 

AC P50562; 

DT 29-N0V-1991 (first entry) 

DE Sequence of peptide uith innunonodulatory activity. 



r\w iHriunurtuuu 1 o vvi t tynpnupuiev it bben Lett djt t erenti dtioru 

KU haemopoiesis; DiGeorge syndronel therapy. 

FH Key Location/Qualifiers 

FT Modified -site 1 

FT /label= N-alpha-acetyl-Arg 

FT Modified -site 2 

FT /label= 4-nethyl-Leu 

FT Modified -site 5 

FT /label= Tyr-OH 

PN US4505853-A. 

PD 19-MAR-1985. 

PF 1B-N0V-1983? 553281. 

PR 18-N0V-1983; US-553281. 

PA (ORTH ) ORTHO PHARM CORP. 

PI Goldstein G> Heavner Gi Kroon D* Audhya T; 

DR HPI; 85-086695/14. 

PT New peptide (s) - useful innuno-regulatory agents with inproved 

PT resistance to enzynatic degradation in body 

PS Claifl 28; column 38; 20pp; English. 

CC The peptides of the invention induce differentiation of 

CC lymphopoietic sten cells fron haenopoietic tissues into thynus- 

CC derived cells and so are useful for treating such conditions as 

CC DiGeorge syndrone. They also assist collective body innunity and so 

CC can be used with autoinnune diseases, etc. Dose is 10-100 

CC nicrograns/kg daily for DiGeorge syndrone. 

S& Sequence 5 AA; 

SQ 0 AM R; o N; l D; 0 B; o c; 0 fl; 0 E; o z; o G; 0 H; 

sq o I; l L; o K; o M; o F; 0 P; o s; 0 T; o w; 1 Y; l v; 



Initial Score = 4 Optimized Score = 4 Significance = 3.64 
Residue Identity = 807. Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 

llll 
RLDVY 
X X 



7. US-08-249-1S2-2 (1-6) 

R24686 Innunonodulatory peptide. 

ID R24686 standard; peptide; 6 AA. 

AC R24686; 

DT 03-DEC-1992 (first entry) 

DE Innunonodulatory peptide. 

KU Innunodef iciencies; immunosuppression; T-cell subset; innunotherapy? 

KU inflammation; uounds! lymphocyte; vaccine. 

OS Synthetic. 

PN U09209628-A. 

PD ll-JUN-1992. 

PF 22-N0V-1991; U08795. 

PR 23-N0V-1990; US-617494. 

PA (IMMU-) IMMUNODYNAMICS INC. 

PI Atkin A; 

DR HPI; 92-217021/26. 

PT Neu synthetic innunonodulatory peptide(s) - for treating 

PT innunodef icienciesr immunosuppression and T-cell subset 

PT deviations and innuno-therapy of infections t inf lannationi uounds 

PT etc. 

PS Clain 10? Page 36; 52pp; English. 

CC The innunonodulatory peptide is a specific exanple of a peptide cpd. 

CC (or an acid or base salt) constructed by combination and/or 

CC overlapping of the anino acid sequences A1B1XB2A2* A3B3XA4B4, 

CC B5A5XA6B6, B7A7XBSA8, A9B9, AlOAUr B10A12, and B11B12 (X= Ala, Gly, 



^v, uer Leu» rne or vai? fli-fti£ eacn= ftrg? nsn? bin? Lys? rne or van 

CC B1-B12 each= Asp? Glu? Tyr? Phe or Val- The synthetic peptide nay 

CC be used for innunonodulation of various innunodef iciencies and 

CC innunosuppressed conditions? T-cell subset and lymphocyte deviations? 

CC enhancement of a vaccines efficacyi as well as for immunotherapy? 

CC including infections? local or systenic complications of non- 

CC infectious diseases? postoperative inf lannations? wounds and burns. 

CC See also R24583-R24701. 

SO Sequence 6 AA; 

S9 0 A; l R; l N; 1 D; 0 B; 0 C; 0 9; 0 E ; 0 Z; 0 G? 0 H; 

sq o i; o l; i k; o B; o F? o p; o s? o T? o w; l ¥; l v; 



Initial Score = 4 Optimized Score = 4 Significance = 3.64 
Residue Identity = 66X Hatches = 4 Hi snatches = 2 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 

llll 
RNDVYK 
X X 



8. US-08-249- 182-2 (1-6) 

R24619 Innunonodulatory peptide. 



ID R24619 standard; peptide; 6 AA. 

AC R24619; 

DT 03-DEC-1992 (first entry) 

DE Innunonodulatory peptide. 

KH Immunodeficiencies; immunosuppression; T-cell subset; innunotherapy; 

KW inf lannation; wounds; lymphocyte; vaccine. 

OS Synthetic. 

PN U09209628-A. 

PD ll-JUN-1992. 

PF 22-N0V-1991,' U08795. 

PR 23-N0V-1990; US-617494. 

PA UHHU-) IHHUNODYNAMICS INC. 

PI Atkin A; 

DR HPI; 92-217021/26. 

PT New synthetic innunonodulatory peptide(s) - for treating 

PT innunodef iciencies? immunosuppression and T-cell subset 

PT deviations and innuno-therapy of infections r inf lannation? wounds 

PT etc. 

PS Claifl 10; Page 36? 52pp; English. 

CC The immunomodulatory peptide is a specific exanple of a peptide cpd. 

CC (or an acid or base salt) constructed by conbination and/or 

CC overlapping of the anino acid sequences A1B1XB2A2? A3B3XA4B4i 

CC B5A5XA6B6, B7A7XB8A8? A9B9? A10A11? B10A12? and B11B12 (X= Alar Gly? 

CC lie, Leu? Phe or Val, A1-A12 each- Arg, Asn? Gin, Lys? Phe or Val; 

CC B1-B12 each* Aspr Glu? Tyr? Phe or Val. The synthetic peptide nay 

CC be used for innunonodulation of various innunodef iciencies and 

CC innunosuppressed conditions! T-cell subset and lynphocyte deviations* 

CC enhancenent of a vaccines efficacy? as well as for innunotherapy? 

CC including infections? local or systenic conplications of non- 

CC infectious diseases? postoperative inf lannations? wounds and burns. 

CC See also R24583-R24701. 

SO Sequence 6 AA; 

SO o A; l R; o N; i D; 0 B; o c; 0 0; 0 E; 0 Z; o G; 0 H; 

so o i; o i; 2 k; o H; 0 f; o P; o S; 0 T; 0 w; l Y; l V? 



Initial Score = 4 Qptinized Score = 4 Significance = 3.64 
Residue Identity = 667. Hatches = 4 Hisnatches = 2 

Gaps = 0 Conservative Substitutions = 0 



X X 



rmv ir\ 

mi 

RKDVYK 
X X 



9. US-08-249-182-2 (1-6) 

R24632 Innunonodulatory peptide. 

ID R24632 standard; peptide; 6 AA. 

AC R24632; 

DT 03-DEC-1992 (first entry) 

DE Innunonodulatory peptide. 

KM Innunodeficiencies; innunosuppression? T-cell subset; i immunotherapy; 

KW inf lannation; uounds; lynphocyte; vaccine. 

OS Synthetic. 

PN U09209628-A. 

PD ll-JUN-1992. 

PF 22-N0V-1991; U08795. 

PR 23-N0V-1990; US-617494. 

PA (IMHU-) IHMUNODYNAHICS INC. 

PI Atkin A; 

DR WPI; 92-217021/26. 

PT New synthetic innunonodulatory peptide(s) - for treating 

PT innunodeficiencies, immunosuppression and T-cell subset 

PT deviations and innuno-therapy of infections, inf lannation* uounds 

PT etc. 

PS Clain 10; Page 36; 52pp; English. 

CC The innunonodulatory peptide is a specific exanple of a peptide cpd. 

CC (or an acid or base salt) constructed by combination and/or 

CC overlapping of the am no acid sequences A1B1XB2A2i A3B3XA4B4, 

CC B5A5XA6B6, B7A7XB3A8, A9B9, AlOAil, B10A12, and B11B12 (X= Ala, Gly, 

CC He, Leu, Phe or Val, A1-A12 each= Arg, Asn, Gin, Lys, Phe or Val; 

CC B1-B12 each= Asp, Glu, Tyr, Phe or Val. The synthetic peptide nay 

CC be used for innunonodulation of various innunodef iciencies and 

CC innunosuppressed conditions, T-cell subset and lynphocyte deviations, 

CC enhancenent of a vaccines efficacy, as well as for innunotherapy, 

CC including infections, local or systenic conplications of non- 

CC infectious diseases, postoperative inf lannations, uounds and burns. 

CC See also R24583-R24701. 

SO Sequence 6 AA; 

SQ 0 A; o R; 0 N, l D; 0 B; 0 C; 0 Q ; 0 E; 0 2; 0 G; 0 H; 

SQ o I; o L; 2 k; o ii; 0 f; o p; o S? o t; o w; 2 Y; 1 v; 

Initial Score = 4 Optinized Score = 4 Significance = 3.64 
Residue Identity = 66X Hatches = 4 Misnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 

llll 
YKDVYK 
X X 



10. US-08-249-182-2 (1-6) 

R24639 Innunonodulatory peptide. 

ID R24639 standard; peptide; 7 AA. 

AC R24639; 

DT 03-DEC-1992 (first entry) 

DE Innunonodulatory peptide, 

KW Innunodeficiencies; innunosuppression; T-cell subset; innunotherapy? 

KW inf lannation; uounds; lynphocyte; vaccine. 

OS Synthetic. 

PN W09209628-A. 



PF 22-N0V-19915 U08795. 

PR 23-N0V-1990; US-617494. 

PA (IMNU-) IHHUNODYNAHICS INC. 

PI Atkin A; 

DR WPI; 92-217021/26. 

PT Neu synthetic immunomodulatory peptide(s) - for treating 

PT immunodeficiencies, immunosuppression and T-cell subset 

PT deviations and immuno-therapy of infections, inflammation, wounds 

PT etc. 

PS Claim 10; Page 36; 52pp? English. 

CC The immunomodulatory peptide is a specific example of a peptide cpd. 

CC (or an acid or base salt) constructed by combination and/or 

CC overlapping of the amino acid sequences A1B1XB2A2, A3B3XA4B4t 

CC B5A5XA6B6, B7A7XB8A8, A9B9, A10A11, B10A12, and B11B12 (X= Ala, Gly, 

CC Ilei Leu, Phe or Val, A1-A12 each= Arg, Asni Gin, Lys, Phe or Val; 

CC B1-B12 each= Asp, Glu, Tyr, Phe or Val. The synthetic peptide nay 

CC be used for immunomodulation of various immunodeficiencies and 

CC imnunosuppressed conditions, T-cell subset and lymphocyte deviations, 

CC enhancement of a vaccines efficacy, as well as for immunotherapy, 

CC including infections, local or systemic complications of non- 

CC infectious diseases, postoperative inflammations, wounds and burns. 

CC See also R24583-R24701. 

SO Sequence 7 AA; 

so 0 am R; l N; l D; o B; o cm q; o E; o z; o G; o h; 

se 0 I; o lm k? o m; o f; o P; o s; o T; o mm ym v? 



Initial Score 
Residue Identity 
Gaps 



4 Optimized Score = 4 Significance = 3.64 
BOX Matches = 4 Mismatches = 1 

0 Conservative Substitutions = 0 



X X 
PLDVYK 

1111 
RDVYK8N 
X X 



11. US-08-249- 182-2 (1-6) 

R24589 Immunomodulatory peptide. 

ID R24S89 standard; peptide; 7 AA. 

AC R24589; 

DT 03-DEC-1992 (first entry) 

DE Immunomodulatory peptide. 

KM Immunodeficiencies; immunosuppression; T-cell subset; immunotherapy; 

KM inflammation; wounds; lymphocyte; vaccine. 

OS Synthetic. 

PN M09209628-A. 

PD ll-JUN-1992. 

PF 22-N0V-199U U08795. 

PR 23-N0V-1990; US-617494. 

PA (INMU-) IMIiUNODYNAMICS INC. 

PI Atkin A; 

DR MPI? 92-217021/26. 

PT Neu synthetic immunomodulatory peptide(s) - for treating 

PT immunodeficiencies, immunosuppression and T-cell subset 

PT deviations and immuno-therapy of infections, inflammation, wounds 

PT etc. 

PS Claim 9? Page 34; 52pp; English. 

CC The immunomodulatory peptide is a specific example of a peptide cpd. 

CC (or an acid or base salt) constructed by combination and/or 

CC overlapping of the amino acid sequences A1B1XB2A2, A3B3XA4B4, 

CC B5A5XA6B6, B7A7XB8A8, A9B9, A10AU, B10A12, and B11B12 (X= Ala, Gly, 

CC lie, Leu, Phe or Val, A1-A12 each= Arg, Asn, Gin, Lys, Phe or Val; 

CC B1-B12 each= Asp, Glu, Tyr, Phe or Val. The synthetic peptide may 



CC immunosuppressed conditions r T-cell subset and lymphocyte deviations i 

CC enhancement of a vaccines efficacy, as well as for immunotherapy! 

CC including infections i local or systemic complications of non- 

CC infectious diseases, postoperative inf lammations, wounds and burns. 

CC See also R24583-R24701. 

SQ Sequence 7 AA; 

sa 0 A? o R; o N; i D; o b; o c; 0 8; 1 e; o Z; o G; o H; 

sa o i; o L? 3 K; o h; o F; o p; o s; o T; o u; l ym v; 



Initial Score - 4 
Residue Identity = 66X 
Gaps = 0 



Optimized Score = 4 Significance = 3.64 
Hatches = 4 Mismatches = 2 

Conservative Substitutions = 0 



X X 
PLDVYK 



KEKDVYK 
X X 



12. US-08-249-182-2 (1-6) 

R24588 Immunomodulatory peptide* 

ID R24588 standard; peptide; 7 AA. 

AC R24588; 

DT 03-DEC-1992 (first entry) 

DE Immunomodulatory peptide. 

KM Immunodeficiencies; immunosuppression; T-cell subset; immunotherapy; 

KU inflammation; uounds; lymphocyte; vaccine. 

OS Synthetic. 

PN WQ9209628-A. 

PD ll-JUN-1992. 

PF 22-N0V-1991; U08795. 

PR 23-N0V-1990? US-617494. 

PA (IMMU-) IMMUNODYNAMICS INC. 

PI At tun A; 

DR WPIi 92-217021/26. 

PT New synthetic immunomodulatory peptide(s) - for treating 

PT immunodeficiencies! immunosuppression and T-cell subset 

PT deviations and immuno-therapy of infections* inf lammation, wounds 

PT etc. 

PS Claim 9; Page 34; 52pp? English. 

CC The immunomodulatory peptide is a specific example of a peptide cpd. 

CC (or an acid or base salt) constructed by combination and/or 

CC overlapping of the amino acid sequences AlBlXB2A2r A3B3XA4B4, 

CC B5A5XA6B6, B7A7XB8A8, A9B9, A10A11, B10A12, and B11B12 (X= Ala, Gly, 

CC Ilei Leu, Phe or Val, A1-A12 each= Arg, Asn, Gin, Lys, Phe or Val; 

CC B1-B12 each= Asp, Glu, Tyr, Phe or Val. The synthetic peptide may 

CC be used for immunomodulation of various immunodeficiencies and 

CC immunosuppressed conditions, T-cell subset and lymphocyte deviations, 

CC enhancement of a vaccines efficacy, as well as for immunotherapy, 

CC including infections* local or systemic complications of non- 

CC infectious diseases, postoperative inflammations, uounds and burns. 

CC See also R24583-R24701 . 

SQ Sequence 7 AA; 

SG 0 A; 0 R; 0 N; 2 D; 0 B; 0 C; 0 Q; 0 E; 0 Z; 0 G; 0 H; 

SQ 0 I; 0 L; 3 K; o M; 0 F? o P; o s; 0 T? o w; l Y; l v; 

Initial Score = 4 Optimized Score = 4 Significance = 3.64 
Residue Identity = 66X Hatches = 4 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 
III! 



X X 



13. US-08-249-182-2 (1-6) 

R21056 Ganma-carboxylase* N-terminus. 

ID R21056 standard; Protein; 10 AA. 

AC R21056? 

DT 01-JUN-1992 (first entry) 

DE Ganma-carboxylasei N-terninus. 

KW Degenerate? Vitamin K dependent proteins? PCR. 

OS Hono sapiens. 

PN W09201795-A. 

PD 06-FEB-1992. 

PF 22-JUL-1991; U05177. 

PR 23-JUI-1990; US-557220. 

PR 14-MAR-1991? US-669735. 

PA (ZYH0-) ZYMQGENETICS INC. 

PI Berliner KL? 

DR UPl; 92-064951/08. 

PT Gamma-carboxylase protein conpsns. - used in recombinant prodn. 

PT of active vitamin=K dependent proteins 

PS Claim 6? Table 9; 91pp; English. 

CC Nucleotide sequences encoding gamma-carboxylase were obtd. using PCRi 

CC and oligonucleotides designed from amino acid sequences determined 

CC by microsequencing of partially purified material. SE9 ID No 20 

CC (peptide 6) is one of seven alternative sequences for the N-terminus 

CC of gamma carboxylase* due to the degeneracy of the genetic code. 

CC Obtaining the full DNA and protein sequence of gamma-carboxylase 

CC will alloy proteins such as Factor VII r Factor IX r prothrombin 

CC (activated) protein C> protein Si protein Zr or osteocalcin to be 

CC easily produced by recombinant techniques. 

CC See also R21049-55, R23010. 

S9 Sequence 10 AA? 

59 0 A? 0 R? 0 N? 2 D? 0 B? 0 C; 0 9? 0 E? 0 Z? 1 G? 0 H? 

S9 0 I? 2 L? 1 K? 0 M? 0 F? 1 P; 1 S; 1 T? 0 W? 1 Y? 0 V? 

Initial Score = 4 Optimized Score = 4 Significance = 3.64 
Residue Identity = 66X Matches = 4 Mismatches = 2 

Gaps - 0 Conservative Substitutions = 0 

X X 
PLDVYK 

II II 
TLPSGLDDYK 
X 10 



14. US-08-249- 182-2 (1-6) 

R30889 Cell adhesion polypeptide. 

ID R30889 standard? peptide? 19 AA. 

AC R30889? 

DT 09-FEB-1993 (first entry) 

DE Cell adhesion polypeptide. 

KW MOLT-4? human? lymphoblastic leukaemia? A375-SM? metastatic; 

KW melanoma; H1080; fibrosarcoma; LDV; LDL? IDA; inflammatory disease; 

KW rheumatoid arthritis; asthma; sepsis; graft rejection; reperfusion. 

OS Synthetic. 

PN U09213887-A. 

PD 20-AUG-1992. 

PF 06-FEB-1992? G00226. 

PR 07-FEB-1991! GB-002655. 

PR 08-FEB-1991? GB-002818. 

PA (UYMA-) UNIV VICTORIA MANCHESTER. 



ri numprines nu* 

DR WPI ? 92-279988/36. 

PT Neu cell adhesion (poly)peptide(s) modifying cell adhesive 

PT properties - useful in treating inflammatory conditions e.g. 

PT rheumatoid arthritis; asthma* inflammatory bowel disease! sepsis* 

PT etc, 

PS Disclosure; Page 4; 23pp; English. 

CC The peptide is an example of a cell adhesion polypeptide contg. the 

CC amino sequence X-Asp-Y-(A)n-Phe* where X and Y = Ala* Leu* He or 

CC Valr A= any anino acid and n= 3-10. At least a subsequence of the 

CC polypeptide is adherent for MOLT-4 human lymphoblastic leukaemia* 

CC A375-SM human metastatic melanoma or H1080 human fibrosarcoma cells. 

CC The cell adhesion peptides are used to modify or control the 

CC adhesive properties of cells* e.g. in treatment of inflammatory 

CC conditions such as rheumatoid arthritis* asthma* sepsis* graft 

CC rejection inflammatory bowel disease* reperfusion of cardiac tissue 

CC after myocardial infarction* and coagulatory disorders. They are 

CC selective antagonists of cell adhesion* e.g. they promote adhesion 

CC of the speccified cells but inhibit adhesion to the natural adhesion 

CC protein contg. the adhesive sequence. 

CC See also R26821-30 and R30887-903. 

SQ Sequence 19 AA* 

SQ 2 a; o R; o n; 3 D; o B; 0 C; 1 9; l E; 0 Z; 2 G; 0 H* 

SQ 2 I; 1 L; 0 K; 0 H; 2 F* 3 P; 0 S; 0 T; 0 U* 0 Y; 2 V; 

Initial Score = 4 Optimized Score = 4 Significance = 3.64 
Residue Identity ' 66X Hatches = 4 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
Mil 

PIIDVAPLDVGAPDfiEFGF 
X 10 X 



15. US-08-249-1B2-2 (1-6) 

R26829 Cell adhesion polypeptide. 

ID R26829 standard; peptide; 19 AA. 

AC R26829; 

DT 09-FEB-1993 (first entry) 

DE Cell adhesion polypeptide. 

KH MOLT-4; human* lymphoblastic leukaemia; A375-SH; metastatic* 

KW melanoma; H1080; fibrosarcoma; LDV; LDL; IDA; inflammatory disease; 

KW rheumatoid arthritis; asthma; sepsis; graft rejection; reperfusion. 

OS Synthetic. 

PN W09213887-A. 

PD 20-AUG-1992. 

PF 06-FEB-1992; G00226. 

PR 07-FEB-1991; GB-002655. 

PR 08-FEB-1991; GB-002818. 

PA (UYMA-) UNIV VICTORIA MANCHESTER. 

PI Humphries MJ; 

DR HPl; 92-299988/36. 

PT Neu cell adhesion (poly)peptide(s) modifying cell adhesive 

PT properties - useful in treating inflammatory conditions e.g. 

PT rheumatoid arthritis* asthma* inflammatory bowel disease* sepsis* 

PT etc. 

PS Disclosure; Page 4; 23pp; English. 

CC The peptide is an example of a cell adhesion polypeptide contg. the 

CC amino sequence X-Asp-Y-(A)n-Phe* where X and Y = Ala* Leu* lie or 

CC Val* A= any amino acid and n= 3-10. At least a subsequence of the 

CC polypeptide is adherent for M0LT-4 human lymphoblastic leukaemia* 

CC A375-SH human metastatic melanoma or H1080 human fibrosarcoma cells. 

CC The cell adhesion peptides are used to modify or control the 



^ aonesive properties or ceus, e.g. in ireamenx, ot int lannaiory 

CC conditions such as rheumatoid arthritis, asthna, sepsisi graft 

CC rejectioni inf lannatory bouel disease, reperfusion of cardiac tissue 

CC after myocardial infarction, and coagulatory disorders. They are 

CC selective antagonists of cell adhesion, e.g. they pronote adhesion 

CC of the speccified cells but inhibit adhesion to the natural adhesion 

CC protein contg. the adhesive sequence. 

CC See also R26821-30 and R30887-903. 

S9 Sequence 19 AA; 

5S l A; 0 R; 0 N; 2 0; 0 B; 1 C; l Q; 0 E; 0 2; l G; 0 H; 

SQ 1 I; 4 Li 0 K» 0 Mi 1 Fi 2 P? 4 Si 0 T» 0 U> 0 Y» 1 Vi 



Initial Score = 4 
Residue Identity = 66X 
Gaps = 0 



Optinized Score = 4 Significance = 3.64 
Hatches = 4 Hi snatches = 2 

Conservative Substitutions = 0 



X X 
PLDVYK 
llll 

CS8PLDVILLLDGSSSFPA 
X 10 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249_2p.res nade by on Thu 22 Sep 94 10:50:46-PDT. 



Query sequence being conpared:US-08-249-182-2 (1-6) 
Nunber of sequences searched: 70848 
Nuciber of scores above cutoff: 4700 

Results of the initial comparison of US-08-249- 182-2 (1-6) uith: 
Data bank : PIR 41, all entries 
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SCORE 0| 1 |1 2|2 3 | 3 4 4 5 
STDEV 0 12 3 



PARAMETERS 

iry K-tuple 2 

1 Joining penalty 20 

00 Window size 5 



Similarity matrix Unita 
Mismatch penalty 

Gap penalty 1. 

Gap size penalty 0.05 

Cutoff score 0 

Randomization group 0 

Initial scores to save 40 
Optimized scores to save 0 



Scores: 



Times: CPU 

00:01:23.97 

Number of residues: 

Number of sequences searched: 

Number of scores above cutoff: 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 



Alignments to save 15 
Display context 50 



Total Elapsed 
00:01:32.00 

20816057 
70848 
4700 



SEARCH STATISTICS 

Mean Median Standard Deviation 

1 3 0.98 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence uas not found. 



The list of best scores is: 



Sequence Name Description 



Init. Opt. 
Length Score Score Sig. Frame 
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1. US-08-249-182-2 (1-6) 

A42329 autotaxin - hunan (fragnents) 

ENTRY A42329 #type fragnents 

TITLE autotaxin - hunan (fragnents) 

ORGANISM #fornal_nane Hono sapiens iconnonj-iane nan 

DATE 04-Mar-1993; #sequence_revision 01-Jan-1993r #text change 

08-May-1993 
ACCESSIONS A42329 
REFERENCE A42329 
#authors Stracker 

Ciocer 

#journal J. Biol. 
Ititle 



H.L.? Krutzschi H.C.J Unsuorth* E.J.; Arestad» A.? 
V.; Schiffnannr E.; Liotta, L.A, 
Chen. (1992) 267:2524-2529 
Identification! purif ication* and partial sequence analysis 
of autotaxin* a novel notility-stinulating protein. 



#cross-references MUID;92129337 

#accession A42329 

##status prel ininary 

##nolecule_type protein 
##residues" 1-114 ##label STR 
ttcross-references NCBIP:78526? NCBIP;78523; 



NCBIP:7852l; NCBIPJ78518? 



##note 
SUMMARY 
SEQUENCE 



NCBIP:78509; NCBIP:78508J NCBIP:78503 
sequence extracted from NCBi backbone 
# length 114 Schecksum 7335 



Initial Score = 5 Optimized Score = 5 Significance = 4.09 
Residue Identity - 83X Hatches = 5 Hi snatches - 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
lllll 

GSPLHITATKSPPFENINLYYDVPWNETIPEEVTXPNYLQAEVSYPAFKPXLDVYKWHVAAN 
60 70 80 90 100 X 110 



2. US-08-249-182-2 (1-6) 

S19501 hypothetical protein YCR086W - yeast (Saccharonyce 

ENTRY S19501 #type complete 

TITLE hypothetical protein YCR086H - yeast (Saccharomyces 

cerevisiae) 

ORGANISM #formal_name Saccharomyces cerevisiae 

DATE 31-Mar-1992 #sequence_revision 31-Mar-1992 #text_change 

18-Jun-1993 
ACCESSIONS S19501 
REFERENCE S19351 

#authors Dusterhoft* A.; Erdmann* D.> Hegemann* J.; Philippseni P. 

Schweitzeri B.i Spiegelberg* R.; Steineri S. 
tsubmission submitted to the Protein Sequence Database! March 1992 
#accession S19501 
##molecule_type DNA 
##residues~ 1-190 ##label DUS 
##cross-references EMBL:X59720 
GENETICS 

#«ap_position 3R 

SUMMARY" #length 190 #molecular-ueight 21751 tchecksum 6551 

SEQUENCE 

Initial Score = 5 Optimized Score = 5 Significance = 4.09 
Residue Identity = 83X Matches ' 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
II III 

MDPLTVYKNSVK88IDSADLLVANLVNENFVLSEKLDTKATEIK8LQK9IDSLNAQVK 
X 10 20 30 40 50 



3. US-08-249-182-2 (1-6) 

S23402 sperm surface protein PH-30 alpha chain - guinea p 



ENTRY S23402 #type complete 

TITLE sperm surface protein PH-30 alpha chain - guinea pig 

ORGANISM #formal_name Cavia porcellus #common_name guinea pig 

DATE 29-Jan-1993 #sequence_revision 29-Jan-1993 ttext change 

18-Jun-1993 
ACCESSIONS S23402J S25695 

REFERENCE 523402 

ftauthors Blobeli CP. ; Wolfsbergi T.G.; Turcki C.W.; Mylesi D.G.? 

Primakoffi P.; Whiter J.M. 
ft journal Nature (1992) 356:248-252 

ttitle A potential fusion peptide and an integrin ligand domain in a 

protein active in sperm-egg fusion. 



♦accession S23402 

♦♦nolecule_type nRNA 

♦♦residues 1-289 ##label BLD 

♦♦cross-references EHBL:Z1 1719 
♦accession S25695 

♦♦nolecule_type protein 

♦♦residues" h'X'iS-B.'XMO-lBJlOS-liS «#label BLOi 



KEYWORDS 

FEATURE 
228-252 
73,161 

SUMMARY 
SEQUENCE 



glycoprotein; nenbrane protein 

♦donain transmembrane ♦status predicted #label TMH\ 
♦binding_site carbohydrate (Asn) (covalent) ♦status 
predicted 

#length 289 ♦nolecular-ueight 29735 ♦checksum 7298 



Initial Score 
Residue Identity 
Gaps 



5 Optimized Score = 5 Significance = 4.09 
83X Matches = 5 Mi snatches - 1 

0 Conservative Substitutions = 0 



PLDVYK 



YCTGQSGKCPLDTYKQOGTPCNEGFFCVSKGCTDPGIQCATYFGHGARSAPDACYTTLNSIGNIF 
10 X 20 30 40 50 60 



4. US-08-249-182-2 (1-6) 

A24849 ADPiATP carrier protein AAC1 



yeast (Saccharomyce 



yeast (Saccharonyces 



ALTERNATE JAMES 

ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 

♦authors 

♦journal 

#title 



ENTRY A24849 #type complete 

TITLE ADPiATP carrier protein AAC1 

cerevisiae) 

ADPrATP carrier protein PET9; ADPiATP translocase! ADPiATP 

trans locator protein 
♦formaljiame Saccharonyces cerevisiae 
31-Mar-1988 ♦sequence_revision 31-Har-1988 itext change 

0S-Apr-1994 
A24849J C39654; S19035 
A24849 

Adrian* G.S.; McCanmoni N.T.; Montgomery > D.L.; Douglas* M.G. 
Mol. Cell. Biol. (1986) 6:626-634 

Sequences required for delivery and localization of the 
ADP/ATP translocator to the mitochondrial inner membrane. 
#cross-references MUID:87064348 
taccession A24849 
♦♦molecule_type DNA 
♦♦residues" 1-309 ♦♦label ADR 
♦♦cross-references EHBL:M12514 
REFERENCE A39654 

♦authors Hoyti H.A.; Totisi L.J Roberts* B.T. 
♦journal Cell (1991) 66:507-517 

♦title Saccharonyces cerevisiae genes required for cell cycle arrest 

in response to loss of microtubule function, 
♦cross-references MUID:91330299 
♦accession C39654 
♦♦molecule_type DNA 
♦♦residues" 265-309 ♦♦label HOY 
♦♦cross-references GB?M64706 
GENETICS 

♦gene AAC1 
♦map_position 2L 
CLASSIFICATION ♦superfanily ADPiATP carrier protein 
KEYWORDS membrane protein? mitochondrion 

SUMMARY #length 309 ♦nolecular-aeight 34120 ♦checksum 3531 

SEQUENCE 



Initial Score = 5 Optimized Score * 5 Significance = 4.09 
Residue Identity = 837. Watches = 5 Mismatches = 1 

Gaps * 0 Conservative Substitutions = 0 

X X 
PLDVYK 
Hill 

YAKWFAGNLFSGGAAGGLSLLFVYSLDYARTRLAADARGSKSTS9RQFNGLLDVYKKTLKTDGLLGLYRGFV 
120 130 140 150 160 X 170 180 

PSVLGI IVYRGLYFGLYDSFKPVLLTGALEGSFV 
190 200 210 



5. US-08-249-182-2 (1-6) 

JT0584 deoxyribonuclease I (EC 3.1.21.1) 



precursor - Stre 



ALTERNATEJIAHES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

#authors 

#journal 

fttitle 



ENTRY JT0584 #type conplete 

TITLE deoxyribonuclease I (EC 3.1.21.1) precursor - Streptococcus 

"equisinilis 0 
streptodornase 

#fornal_nane Streptococcus "equisinilis" 
31-War-1992 #sequence_revision 31-Har-1992 #text change 

28-Apr-1993 
JT0584 
JT0584 

Wolinouskar R.J Ceglouski, P.; Kokr J.; Venecia r G. 
Gene (1991) 106:115-119 

Isolation; sequence and expression in Escherichia coli> 
Bacillus subtilis and Lactococcus lactis of the DNase 
(streptodornase) -encoding gene frow Streptococcus 
equisinilis H46A. 
Scross-references HUID: 92039051 
#contents Strain H46A 
taccession JT0584 
##nolecule_type DNA 
##residues" 1-327 ##label W0L 
##cross-references ENBUX17241 
GENETICS 

sdc 

hydrolase 



#gene 
KEYWORDS 
FEATURE 
1-35 
36-308 

SUHNARY 
SEQUENCE 



#donain signal sequence ilabel SIGN 
tprotein deoxyribonuclease I Sstatus predicted #label 
DEO 

Slength 327 #Rolecular-ueight 36844 Ichecksun 1320 



Initial Score = 
Residue Identity = 
Gaps = 



5 Optimized Score = 5 Significance = 4.09 
83X Hatches = 5 Mi snatches = 1 

0 Conservative Substitutions = 0 



X X 
PLDVYK 
III II 

YGEYKDYYTVIGESNID9SAFPKIYKTTERVYKGQGTSEKRVTVSDVVYNPLDGYKRSTGAYGVVTKDMIDN 
50 60 70 80 90 X 100 110 



SKGYREKWETNPEPSGWFRFYNRADNEEISEKEY 
120 130 140 



6. US-08-249-182-2 (1-6) 

S23420 hypothetical protein - Staphylococcus epidernidis 



cmiw acjteu w^ype conpieie 

TITLE hypothetical protein - Staphylococcus epidernidis 

ORGANISM #fornal_nane Staphylococcus epidernidis 

DATE 22-Nov-1993; #sequence_revision 22-Nov-1993? #text_change 

22-Nov-1993 
ACCESSIONS S23420 
REFERENCE S23413 

#authors Schnell* N.? Engelke, G.; Augustini J.; Rosensteirw R.f 

Ungernann> V.? Goetzr F.; Entian* K.D. 
ijournal Eur. J. Biochen. (1992) 204?57-6B 

#ti tie Analysis of genes involved in the biosynthesis of lantibiot 

epidernin. 
Scross-references MUID:92155237 
taccession S23420 

##status preliminary 
ttresidues 1-461 #*label SCH 
tttcross-references EMBL:X62386 
SUMMARY tlength 461 #nolecular-ueight 51814 ichecksun 7694 

SEQUENCE 



Initial Score = 5 
Residue Identity = 83X 
Gaps = 0 



Optimized Score = 5 Significance 
Hatches = 5 Mismatches 

Conservative Substitutions 



= 4, 



09 
1 
0 



X X 
PLDVYK 



SLAAPKVSGALALEIDKYSLKDQPETAIELFKKKGIEKEKYMDKKHYGNGKLDVYKLLKE 
410 420 430 440 450 X 460 



7. US-08-249-182-2 (1-6) 

516250 phytoene desaturase - Synechococcus sp, 

ENTRY S16250 itype complete 

TITLE phytoene desaturase - Synechococcus sp. 

ORGANISM #fornal_nane Synechococcus sp. 

DATE 21-Nov-1993; #sequence_revision 21-Nov-1993; itext change 

21-Nov-1993 
ACCESSIONS S16250 
REFERENCE SI 6250 

tauthors Chanovitzr D.; Pecker* I,; Hirschberg f J. 

ijournal Plant Mol. Biol. (1991) 16:967-974 

#title The nolecular basis of resistance to the herbicide 

norf lurazon. 
#cross-references MUID:91322511 
#accession S16250 

##status prelininary 
##residues 1-474 ## label CHA 
##cross-references EMBL:X55289 
SUMMARY #length 474 tflolecular-weight 53295 Ichecksun 4126 

SEQUENCE 



Initial Score = 5 
Residue Identity = 8351 
Gaps = 0 



Qptifmed Score = 5 
Hatches = 5 

Conservative Substitutions 



Significance = 
Misnatches = 



4.09 
1 
0 



X X 
PLDVYK 
II III 

ELVFAPAKDHIGRSDEDILAATMAEIEKLFPQHFSGENPARLRKYKIVKTPLSVYKATPGRQ8YRPD8ASPI 
350 360 370 380 390 400 X 410 420 



ANFFLTGD YTMfiR YLASMEGA VLSGKLT AO A I I A 
430 440 450 



8. US-08-249-182-2 (1-6) 

S35519 hypothetical protein - Caenorhabditis elegans 



ENTRY S35519 itype complete 

TITLE hypothetical protein - Caenorhabditis elegans 

ORGANISM #fomal_nane Caenorhabditis elegans 

DATE 09-Dec-1993; #sequence_revision 09-Dec-1993; #text_change 

09-Dec-1993 
ACCESSIONS S35519 
REFERENCE S35519 

tauthors Li r W.i Shau> J.E. 

#journal Nucleic Acids Res. (1993) 21:59-67 

ttiile A variant Tc4 transposable elenent in the nematode C. elegans 

could encode a novel protein. 
#accession S35519 

##status preliminary 
ttresidues 1-537 ##label LIA 
ticross-references EMBL:L00665 
SUMMARY tlength 537 tnolecular-weight 62032 ichecltsun 5804 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



5 Optimized Score = 5 Significance - 4.09 
83X Matches = 5 Mi snatches = I 

0 Conservative Substitutions = o 



X X 
PLDVYK 
lllll 

VFIPSVPKKLYIMLDSWPAFKDHTTIKNLVPNGHDVVIRNIPEHTTGMI9PLDVYWNAPWKSLIKKFTAYAL 
360 370 380 390 400 410 X 420 430 



RTQTDYVIAQRNNAICMVSVLYHQISAEHFRPFL 
440 450 460 



9. US-08-249-182-2 (1-6) 

UFECAQ fupiarate hydratase (EC 4.2,1.2) 



iron-dependent 



ENTRY 
TITLE 

ALTERNATE_NAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

#authors 

tjournal 

ttitle 



UFECAQ Itype complete 

funarate hydratase (EC 4.2.1.2) , iron-dependent - Escherichia 

coli 
fumarase 

#fornal_nane Escherichia coli 

17-Mar-1987 isequence^revision 17-Mar-1987 ttext change 

03-Feb-1994 
A03531; PX0048 
A93515 

Miles, J.S.i Guestr J.R. 
Nucleic Acids Res. (1984) 12:3631-3642 
Complete nucleotide sequence of the funarase gene funAi of 
Escherichia coli. 



tcross-references MUID:84221385 
^accession A03531 

#tnolecule_type DNA 

ttresidues 1-548 *#label MIL 



REFERENCE 
tauthors 
tjournal 
ttitle 



A92783 

Guestr J.R.i Miles* J.S.J 
J. Gen. Microbiol. (1985) 



Roberts, R.E.: Woods* 5. A. 
131:2971-2984 

The funarase genes of Escherichia coli: location of the funB 
gene and discovery of a new gene (funC). 
tcross-references MUID:86142617 

♦contents annotation; identification of the structural gene 
REFERENCE PX0048 

tauthors Uedai Y.; Yunotoi N. > Tokushiger M.J Fukui* K.J 
Ohya-Nishiguchii H. 



#ti tie Purification and characterization of tuo types of funarase 

fron Escherichia coli. 
#cross-references HUID:9201 1457 
Scontents strain U 
laccession PX0048 

##nolecule_type protein 
ttresidues" 2-21 ##label UED 
COMMENT In E. coli i three fimerate hydratase genes (fuwAr funB, and fund 
have been reported. This proteinr the funA-encoded funerate 
hydratase f is an Fe-dependent 4Fe-4S hydratase. 

GENETICS 

#gene fupiA 

#nap_position 35.5 win 
CLASSIFICATION #superf anily iron-dependent funarate hydratase 
KEYWORDS 4Fe-4S? carbon-oxygen lyase? honodiaer; hydro-lyase; 

iron-sulfur protein 

FEATURE 

2-548 #protein furcarate hydratase^ iron-dependent tlabel MAT 

SUMMARY Slength 548 #nolecular-ueight 60298 ichecksun 4355 

SEQUENCE 

Initial Score = 5 Optimized Score = 5 Significance - 4.09 
Residue Identity = 83% Matches = 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLOVYK 
III II 

VLPTCQDTGTAIIVGKKG9RVWTGGGDEAALARGVYNTYIEDNLRYS8NAPLDHYKEVNTGTNLPAQIDLYA 
110 120 130 140 150 X 160 170 

VDGDEYKFLCIAKGGGSANKTYLYQETKALLTPG 
180 190 200 



10. US-08-249-182-2 (1-6) 

JT0742 tartronate-semi aldehyde synthase CEC 4.1.1.47) - E 

ENTRY JT0742 #type complete 

TITLE tartronate-seni aldehyde synthase (EC 4.1.1.47) - Escherichia 

coli 

glyoxylate carbo-ligase 
#formal_name Escherichia coli 

30-Sep-1993 #sequence_revision 30-Sep-1993 ttext change 

30-Sep-1993 
JT0742; PN0053 
JT0742 

Chang, Y.Y.; Wang, A.Y.; Cronan Jr.* J.E. 
J. Biol. Chen. (1993) 268:3911-3919 

Molecular cloning, DNA sequencing; and biochemical analyses 

of Escherichia coli glyoxylate carboligase. 
JT0742 
##nolecule_type DNA 
^residues" 1-593 ##label CHA 
##cross-references GB:L03845 
#accession PN0053 

##molecule_type protein 
##residues~ 2-31 ##label CH2 
COMMENT This enzyme catalyzes the condensation of tuo molecules of 
glyoxylate to give tartronic senialdehyde. 

GENETICS 

ftgene gel 
KEYWORDS carbon-carbon lyase? carboxy-lyase 

FEATURE 

2-593 tprotein tartronate-seni aldehyde synthase tstatus 



ALTERNATE JAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

ttauthors 

Sjournal 

ttitle 

taccession 



SUMMARY 
SEQUENCE 



#length 593 tnolecular-ueight 64731 ichecksun 8186 



Initial Score = 5 Optimized Score = 5 Significance = 4.09 
Residue Identity = 83X Hatches = 5 Kisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
II III 

AVTVREAALVPRVLQSAFHLMRSGRPGPVLVDLPFDVQVAEIEFDPDMYEPLPVYKPAASRMQIEKAVEMLI 
130 140 150 160 170 180 X 190 200 

QAERPVIVAGGGVINADAAALLG8FAELTSVPVI 
210 220 230 



11. US-08-249-182-2 il-6) 

S03208 type III site-specific deoxyribonuclease (EC 3.1.2 



ALTERNATEJAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
#authors 



ENTRY S03208 #type complete 

TITLE type III site-specific deoxyribonuclease (EC 3.1.21.5) EcoPIS 

chain nod - Escherichia coli 
type III restriction enzyne EcoP15 chain nod 
8fornal_nane Escherichia coli 

07-Jun-1990 Ssequence_revision 31-Dec-1991 #text change 

28-Apr-1993 
S03208 
S01351 

Huenbelin, M. ? Surii B.? Rao* D.N.; Hornbyr D.P.? Eberlei H. ; 
Pripfl, T.; Keneh S.; Bickle, T.A. 
tjournal J. Mol. Biol. (1988) 200:23-29 

ttitle Type III DNA restriction and nodification systens EcoPl and 

EcoPIS. Nucleotide sequence of the EcoPl operon, the EcoPIS 
nod gene and sone EcoPl nod nut ants. 
Scross-references MUID?88245189 
iaccession S03208 
#itaolecule_type DNA 
ilresidues 1-645 ##label HUE 
##cross-references EHBL:X06288 



##note 



GENETICS 
#gene 
KEYWORDS 
SUMMARY 
SEQUENCE 



the anino acid sequence fron Fig. 6 is inconsistent with 
the nucleotide sequence fron Fig. 5 in having an 
additional Met-Asp at the anino end 



nod 

hydrolase 
«length 645 



#nolecular-ueight 74221 ichecksun 4238 



Initial Score = 
Residue Identity = 
Gaps = 



5 Optimized Score = 5 Significance = 4.09 
83X Matches = 5 Mi snatches = 1 

0 Conservative Substitutions = 0 



X X 
PLDVYK 
III II 

EQLISEWKSNISDVKNLLVNIGEEFASKYTGNEL9EKYTSUFREHRSELNPLDRYKYIDKDGIYTGS8SVHN 
260 270 280 290 300 X 310 320 



PGKEGYRYDIIHPKTKKPCKQQPLMGYRFPLDTM 
330 340 350 



12. US-08-249-182-2 (1-6) 

0YRTA1 guanylate cyclase (EC 4.6.1.2), soluble* alpha-1 c 



ENTRY OYRTAI #type complete 

TITLE guanylate cyclase (EC 4.6.1.2)? soluble^ alpha-1 chain - rat 

ALTERNATE_NAMES guanylate cyclase, soluble* 77K chain 
ORGANISM Sfornal jiane Rattus norvegicus #connon_napte Norway rat 

DATE 31-Mar-1993 8sequence_revision 31-Mar-1993 #text change 

31-Dec-1993 
ACCESSIONS A38297 
REFERENCE A38297 

#authors Nakane> M. ? Araix K.J Saheki» S.? Kunoi T.J Buechier, W.? 

Pluradi F. 

•journal J. Biol. Chen. (1990) 265:16841-16845 

•title Molecular cloning and expression of cDNAs coding for soluble 

guanylate cyclase fron rat lung, 
•cross-references MUID?91009100 
#accession A38297 
•Sp»olecule_type nRNA 
••residues" 1-690 ##label NAK 
##cross-references GB:H36075 
CLASSIFICATION •superfaaily soluble guanylate cyclase? guanylate cyclase 
catalytic donain homology 
cGMP synthesis? heterodiner? phosphorus-oxygen lyase 



KEYWORDS 
FEATURE 
424-673 

SUMMARY 
SEQUENCE 



•doaain guanylate cyclase catalytic donain istatus 
predicted #label CAT 
#length 690 •nolecular-ueight 77566 Schecksun 6429 



Initial Score = 
Residue Identity - 
Gaps = 



5 Optiimed Score = 5 Significance = 4.09 
83X Matches = 5 Mi snatches - 1 

0 Conservative Substitutions = 0 



X X 
PLDVYK 
II!!! 

G8I VGAKKFNEVTMLFSD I VGFT A I CSQCSPLSVI TMLNALYTRFDQSCGELDVYKVET I GDAYCVAGGLHR 
470 480 490 500 510 520 X 530 



ESDTHAVQIALMALKMMELSNEVMSPHGEPIKMR 
540 550 560 570 



13. US-08-249-182-2 (1-6) 

0YB077 guanylate cyclase (EC 4.6.1.2), 



solublei alpha-1 c 



solublei alpha-1 chain - 



ALTERNATE.NAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
•authors 



ENTRY 0YB077 Stype conptete 

TITLE guanylate cyclase (EC 4.6.1.2). 

bovine 

guanylate cyclase, soluble* 77K chain 
•fornaljiane Bos prinigenius taurus tcoftflonjnane cattle 
30-Sep-1991 #sequence_revision 30-Sep-1991 itext change 

31-Dec-1993 
S10713? A38767 
S10713 

Koesling, D.? Hartenecki C? Hunbert, P.? Bosserhoffr A.? 
Frank* R. ? Schultz* G. ; Boehnei E. 
#journal FEBS Lett. (1990) 266:128-132 

•title The prinary structure of the larger subunit of soluble 

guanylyl cyclase fron bovine lung. Homology between the tuo 
subunits of the enzyne. 
•cross-references MU1D?90306336 
♦accession S10713 
#iRolecule_type nRNA 
••residues" 1-691 ##label K0E1 
••cross-references EHBL? X54014 
•accession A38767 



&wnoiecuie_i,ype protein 



##residues 

CLASSIFICATION 

KEYWORDS 

SUMMARY 

SEQUENCE 



lI8-133;226-232;2S6-293;319-330;412-417;557-571?629-637 
##label K0E2 

#superfanily soluble guanylate cyclase; guanylate cyclase 

catalytic domain honology 
cGMP synthesis; heterodiner; phosphorus-oxygen lyase 
• length 691 tmolecular-ueight 77532 Schecksun 2903 



Initial Score = 
Residue Identity = 
Gaps = 



5 Optimized Score = 5 Significance = 4,09 
83X Hatches = 5 Mi snatches = 1 

0 Conservative Substitutions = 0 



X X 
PLDVYK 
lllll 

GHAVQAKRFGNVTMLFSDIVGFTAICS8CSPLQVITMLNALYTRFDRQCGELDVYKVETIGDAYCVAGGLHK 
470 480 490 500 510 520 X 530 540 



ESDTHAVQIALMALKMMELSHEVVSPHGEPIKMR 
550 560 570 



14. US-08-249-182-2 (1-6) 

S23098 guanylate cyclase (EC 4.6,1.2), 



solublet 81K chain 



ENTRY S23098 #type complete 

TITLE guanylate cyclase (EC 4.6.1.2), soluble, 81K chain - hunan 

ORGANISM Sfornal.nane Hono sapiens #connon_nane nan 

DATE 22-Nov-1993; #sequence_revision 22-Nov-1993; iiexi change 

22-Nov-1993 
ACCESSIONS S23098 
REFERENCE S23097 

#authors Giuili, G. ; Scholl, U.J Bulle, F.; Guellaen, G. 

tjournal FEBS Lett. (1992) 304?83-88 

Stitle Molecular cloning of the cDNAs coding for the two subunits of 

soluble guanylyl cyclase fron hunan brain, 
•accession S23098 

##status prelininary 
##residues 1-717 t#label GIU 
Sicross-references EMBL*X66534 
SUMMARY llength 717 tnolecular-ueight 81377 tchecksun 6372 

SEQUENCE 



Initial Score = 5 
Residue Identity = 83X 
Gaps = 0 



Optinized Score = 5 Significance = 4.09 
Matches - 5 Mi snatches = 1 

Conservative Substitutions s o 



X X 
PLDVYK 
lllll 

G9VVQAKKFSNVTMLFSDIVGFTAICS3CSPLQVITMLNALYTRFD88CGELDVYKVETIAMPIVWLGGLHK 
470 480 490 500 510 X 520 530 



ESDTHA V8I ALHALKMMELSDE VMSPHGEP I KMR 
540 550 560 570 



15. US-08-249-182-2 (1-6) 



A42163 Na+/nyo-inositol cotransporter - dog 

ENTRY A42163 . #type conplete 

TITLE Na+/nyo-inositol cotransporter - dog 

ORGANISM #fornal_nane Canis lupus faniliaris #connon_nane dog 

DATE 03-May-1994 tsequence revision 03-May-1994 Stext change 

03-May-1994 



REFERENCE A42163 

#authors Kuonr H.N.; Yanauchir A.> Uchidai S.f Preston 7 A.S.. 

Garcia-Perezi A.; Burg. M.B.J Handleri J.S. 
Ijournal J. Biol. Chen. (1992) 267:6297-6301 

Stitle Cloning of the cDNa for a Na(+)«yo-inositol cotransportert a 

hypertonicity stress protein, 
iaccession A42163 

fttstatus prelininary 
##nolecule_type nRNA 
^residues' 1-718 l«label KUO 
##cross-references GB:M85068 
SUMMARY Slength 718 #nolecular-ueight 79545 ichecksuw 3177 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



5 Optinized Score = 5 Significance = 4.09 
83X Matches = 5 Hi snatches = 1 

0 Conservative Substitutions - 0 



X X 
PLDVYK 
Mill 

GSRAGCSNIAYPRLVMKLVPVGLRGLMHAVMIAALMSDLDSIFNSASTIFTIDVYKLIRRSASSRELMIVGR 
350 360 370 380 390 X 400 410 

I F VAFMVVI SI AWVP I I VEHflGGQMYL Y I QEVAD 
420 430 440 

> 0 < 

0| |0 Intel liGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249_2s.res nade by on Thu 22 Sep 94 10:53?06-PDT. 



Query sequence being conpared:US-08-249-182-2 (1-6) 
Nunber of sequences searched: 36000 
Nunber of scores above cutoff: 4368 

Results of the initial comparison of US-08-249- 182-2 <l-6) with: 
Data bank : Suiss-Prot 28t all entries 

100000- 

N 

U50000- 

M 

B 

E ♦ 

R 

0 

F10000- 

* 

s 

E 5000* 

Q 

U 

E 

N 

C 

E 

S 1000- * 



500- 



100- 



50- 



10- 



0 

II I II III III I I I 

SCORE 0| 1 |1 2 | 2 3 | 3 4 4 5 
STDEV 0 12 3 



PARAMETERS 



Similarity natrix Unitary 

Mismatch penalty 1 

Gap penalty 1.00 

Gap size penalty 0.05 

Cutoff score 0 

Randomization group 0 



K-tuple 

Joining penalty 
Window size 



2 

20 
5 



Initial scores to save 40 
Optimized scores to save 0 



Alignments to save 15 
Display context 50 



SEARCH STATISTICS 



Scores J 



Mean Median Standard Deviation 

1 3 0.86 



Tines: 



CPU 
00:00:49.89 



Total Elapsed 
00:00:55.00 



Nunber of residues: 12496420 
Nunber of sequences searched: 36000 
Nunber of scores above cutoff: 4368 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 



A 1007. identical sequence to the query sequence uas not found. 



The list of best scores is? 
Sequence Nane Description 



Init. Opt. 
Length Score Score Sig. Frane 



**« 4 standard deviations above nean #«* 



1 

1 • 


Vf*Yi VCACT 
YLAO_YtAbl 


UVDrtTUCT T /*AI Ol 7 U1\ DDrtTCTM 

HYrUJHtlltAL cl./ Ku PROTEIN 


i on 
190 


5 


5 


4.63 


0 


o 

Cm 


AI\T| VCACT 

AUl l_YtAbl 


Ann ATD A DDT CD DDFtTCTkl 1 i AT\ 

ADPrATP CARRIER PROTEIN 1 (AD 


309 


5 


5 


4.63 


0 


7 


TADU1 CTDCD 

UKnl_blKta 


ncnvvDTDnuiin cacc ddc/midc-dd / 
UEOXYRIBONUCLEASE PRECURSOR ( 


327 


5 


5 


4.63 


0 


a 

4. 


CDTD CTACD 

tPlr^bl Atr 


CDTntDMTU DDfl/»CCC f klf* Cr/OTfctC O 

EPIDERnIN PROCESSING SERINE P 


At* 

461 


5 


5 


4.63 


0 


D. 


TDTT CVMD7 

CKI i_oYNr/ 


DuvTriCKic ncuvnonrcuAcc /cr i 
PHYTOENE DEHYDROGENASE (EC 1. 


474 


IT 

5 


5 


4.63 


0 


0. 


C! IMA Crni T 

rlmA^tCULI 


FUMARATe HYDRATASt CLASS It A 


CUB 

548 


5 


5 


A IT 

4.63 


0 


7 


m cmi t 
GCL_cCULI 


GLYUXYLATc CARBDLIGASt (EC 4. 


592 


5 


5 


4.63 


0 


0 

Q. 


timh crm T 
1 JnU_tLUL J 


1 rrt Hi KtslKllI lUN-fllwlrlCA 


643 


5 


c 


4.63 


0 


0 
7 • 


rvn dat 
LYUJ_KAI 


GUANYLATE CYCLASE SOLUBLE. AL 


690 


5 


5 


4.63 


0 


1 A 
1U. 


UYIj*S_dUVIN 


GUANYLATE CYCLASE SOLUBLE. AL 


691 


5 


5 


4.63 


0 


1 1 . 


rvrc uiiMAKi 
tYUD^HUnAN 


GUANYLATE CYCLASE SOLUBLE. AL 


717 


5 


5 


4.63 


0 


1 0 

1c . 


KIAMT fAMCA 

NAPU_CANr A 


SODIUM/MYO-INOSITOL COTRANSPO 


718 


5 


5 


A 1 1 

4.63 


0 


14. 


noni cm en 
UrUL_bULbU 


DMA POLYMERASE (EC 2.7.7.7). 


882 


5 


5 


A LI 

4.63 


0 




CD A t Dl ACf* 

tbAI^rLArC 


ERYTHRQCYTE-B1NDING ANTIGEN E 


1426 


5 


5 


4.63 


0 


ID. 


ODAG VCACT 

rKUo^YtAbl 


PRE-MRNA SPLICING FACTOR PRP8 


2413 


5 


5 


4.63 


0 






»*»* 3 standard deviations above nean **** 








1 k 


nccD nccrt 


DESULFOREDQXIN (DX). 


36 


4 




3.47 


A 

0 


1 7 


adai ia 
ArAl_LU!JA 


APULIPDPRuTEIN A-I (APO-AI) ( 


36 






3.47 


0 


10. 


1 Till Dl AC A 

LUH_rLAr A 


L-LACTATE DEHYDROGENASE (EC 1 


42 




* 


3.47 


0 


17. 


YCA*S_BP 14 


HYPOTHETICAL 5.1 KD PROTEIN I 


45 




4 


3.47 


0 


Oft 

20. 


DIUH_ACHDD 


DIURETIC HORNONE (DH) . 


46 




4 


3.47 


0 


0 1 

cl • 


W^DD CAIIAT 


CREATINE KINASE. B CHAIN (EC 


52 




4 


3.47 


0 


00 

cc. 


CAC7 DArMr 

bAbJ_BACrlfc 


SMALL. ACID-SOLUBLE SPORE PRO 


65 




4 


3.47 


0 


07 


C0X8_BDVIN 


CYTOCHROME C OXIDASE POLYPEPT 


70 




4 


3.47 


0 


OA 

24. 


P5AX_SYNY3 


PHOTOSYSTEM I IRON-SULFUR CEN 


80 




4 


3.47 


0 


oc 
c3i 


P5AC_SYNP2 


PHOTOSYSTEM I IRON-SULFUR CEN 


80 




4 


3.47 


0 


oz. 

CO. 


DC AT /*UI DC 

P5AC_CHLRE 


PHOTOSYSTEM I IRON-SULFUR CEN 


80 




4 


3.47 


0 


07 

Ctm 


YUKY_PYKWU 


HYPOTHETICAL 9.7 KD PROTEIN I 


87 




4 


3.47 


0 


oo 
28. 


VFUS_ORFNZ 


10 KD FUSION PROTEIN. 


89 




4 


3.47 


0 


oo 
27. 


YPS1_SYNP2 


HYPOTHETICAL 10.4 KD PROTEIN 


92 




4 


3.47 


0 


JO. 


CYC_TRYBB 


CYTOCHROME C (FRAGMENTS). 


93 




4 


3.47 


0 


7 i 


ruin A^VDC 

CnlU_ACYr5 


10 KD CHAPERON IN (PROTEIN CPN 


96 




4 


3.47 


0 


70 
J£ . 


UVUD AAI PA 

YaYd^CALSA 


HYPOTHETICAL 10.7 KD PROTEIN 


97 






3.47 


0 


77 


u^yic DDM1 c 
VU40_DPnLD 


GENE 45 PROTEIN (GP45). 


97 






3.47 


0 


34. 


CH10JCOLI 


10 KD CHAPERON IN (PROTEIN CPN 


97 






3.47 


0 


35. 


ELICJ>HYCI 


C1NNAH0MIN. 


98 






3.47 


0 


36. 


CH10.BRUAB 


10 KD CHAPERON IN (PROTEIN CPN 


98 






3.47 


0 


37. 


CHlO.ilYCBQ 


10 KD CHAPERONIN (PROTEIN CPN 


99 






3.47 


0 


38. 


CH10.HYCTU 


10 KD CHAPERONIN (PROTEIN CPN 


100 






3.47 


0 


39. 


CH10JTRAL 


10 KD CHAPERONIN (PROTEIN CPN 


102 






3.47 


0 


40. 


F IN-HOUSE 


FIBRONECTIN (FN) (FRAGMENT). 


103 






3.47 


0 



1. US-08-249-182-2 (1-6) 

YCX6.YEAST HYPOTHETICAL 21.7 KD PROTEIN IN TUP1-ABP1 INTERGEN 

ID YCX6 YEAST STANDARD; PRT; 190 AA. 

AC P25651J 

DT 01-MAY-1992 (REL. 22. CREATED) 

DT 01-MAY-1992 (REL. 22, LAST SEQUENCE UPDATE) 

DT 01-MAY-1992 (REL. 22. LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 21.7 KD PROTEIN IN TUP1-ABP1 INTERGENIC REGION. 

GN YCR86U. 

OS SACCHAROMYCES CEREVISIAE (BAKER'S YEAST) . 

OC EUKARY0TA5 FUNGI; ASCOMYCOTINA; HEHIASCOMYCETES. 

RN [11 

RP SEQUENCE FROM N.A. 

RA DUSTERHOFT A., ERDMANN D. . HECEMANN J.. PHILIPPSEN P., SCHWEITZER B. . 



RL SUBMITTED (tlAR-1992) TO EI1BL/GENBANK/DDBJ DATA BANKS. 

DR EMBL; X 59720; SCCHRIII. 

DR PIRf S19501; S19501. 

KM HYPOTHETICAL PROTEIN. 

SQ SEQUENCE 190 AA; 21751 MW? 199156 CN; 

Initial Score = 5 Optinized Score = 5 Significance = 4.63 
Residue Identity - 83% Hatches = 5 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
II III 

NDPLT VYKNSVKSQ I DSADLLVANL VNENF VLSEKLDTK ATE I K8LQKS I DSLNAQVK 
X 10 20 30 40 50 



2. US-08-249-182-2 (1-6) 

ADTl.YEAST ADP.ATP CARRIER PROTEIN 1 (ADP/ATP TRANSLOCASE 1) 

ID ADTl.YEAST STANDARD i PRT; 309 AA. 

AC P04710J 

DT 13-AUG-1987 (REL. 05, CREATED) 

DT 13-AUG-1987 (REL. 05, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE ADP,ATP CARRIER PROTEIN 1 (ADP/ATP TRANSLOCASE 1) (ADENINE NUCLEOTIDE 

DE TRANSLOCATOR 1) (ANT 1). 

GN AAC1 OR PET9. 

OS SACCHAROHYCES CEREVISIAE (BAKER'S YEAST) . 

oc eukaryota; fungi; ASCOHYCOTINA; HEMIASCOMYCETES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RN 87064348 

RA ADRIAN G.S., HCCAHNON M.T., HONTGOHERY D.L., DOUGLAS H.G.; 

RL HOL. CELL. BIOL. 6:626-634(1986). 

RN 12) 

RP SEQUENCE OF 265-309 FROM N.A. 

RC STRAIN=S288C; 

RN 91330299 

RA HOYT H.A., TOTIS L., ROBERTS B.T.; 

RL CELL 66:507-517(1991) . 

CC -!- FUNCTION! CATALYZES THE EXCHANGE OF ADP AND ATP ACROSS THE 
CC MITOCHONDRIAL INNER MEMBRANE. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. MITOCHONDRIAL 
CC INNER MEMBRANE. 

CC -!- SUBUNIT: HOMODIMER. 

CC -!- EACH CHAIN IS COMPOSED OF THREE HOMOLOGOUS DOMAINS. 

CC -!- SIMILARITY: BELONGS TO THE MITOCHONDRIAL CARRIER FAMILY. 

DR EMBL; HI 2514; SCPET9. 

DR EMBL; H64706; SCBUB2Q. 

DR PIR; A24849; A24849. 

DR PROSITE; PS00215; NITOCH CARRIER. 

KU MITOCHONDRION, INNER MEMBRANE; DUPLICATION; TRANSMEMBRANE; TRANSPORT; 

KU MULT I GENE FAMILY. 



TRANSMEM 


17 


34 


1 


(POTENTIAL). 


TRANSMEH 


79 


97 


2 


(POTENTIAL). 


TRANSMEM 


122 


139 


3 


(POTENTIAL). 


TRANSMEM 


183 


202 


4 


(POTENTIAL). 


TRANSMEM 


222 


239 


5 


(POTENTIAL). 


TRANSMEM 


278 


296 


6 


(POTENTIAL) . 


SEQUENCE 


309 AAi 


34120 MW; 


502547 CNJ 



Initial Score = 5 Optinized Score = 5 Significance = 4.63 
Residue Identity = 832 Matches = 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 
lllll 

YAKHFAGNLFSGGAAGGLSLLFVYSLDYARTRLAADARGSKSTSfiRQFNGLLDVYKKTLKTDGLLGLYRGFV 
120 130 140 150 160 X 170 180 



PSVLGIIVYRGLYFGLYDSFKPVLLTGALEGSFV 
190 200 210 



3. US-08-249-182-2 (1-6) 

DRN1.STREQ DE0XYR1 BQNUCLEASE PRECURSOR (EC 3.1.21.1) (STREPTO 



ID DRNl_STREfl STANDARD; PRT; 327 AA. 

AC P26295; 

DT 01-NAY-1992 (REL. 22, CREATED) 

DT 01-MAY-1992 (REL. 22, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE DE0XYR1B0NUCLEASE PRECURSOR (EC 3.1.21.1) (STREPTODORNASE) (DNASE). 

GN SDC. 

OS STREPTOCOCCUS EOUISIMILIS. 

OC PROKARYOTA; FIRNICUTES; COCCI? STREPTOCOCCAEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=H46A; 

RN 92039051 

RA WOLINOWSKA R., CEGLOUSKI P., KOK J., VENECIA G. ? 

RL GENE 106:115-119(1991). 

CC -!- FUNCTION: MAY HAVE A ROLE IN S.EQUISIMILIS VIRULENCE. 

CC -!- CATALYTIC ACTIVITY: ENDONUCLEOLYTIC CLEAVAGE TO 5'- 

CC PHOSPHODINUCLEOTIDE AND 5'-PH0SPHOOLIG0NUCLEOTIDE END-PRODUCTS. 

DR EHBL; X17241; SESDC. 

DR PIR? JT0584; JT0584. 

KU ENDONUCLEASE; NUCLEASE; SIGNAL. 

FT SIGNAL 1 24 OR 35 (POTENTIAL). 

FT CHAIN 25 327 DEOXYRIBONUCLEASE. 

SS SEQUENCE 327 AA; 36844 RU; 583871 CN; 



Initial Score = 5 Optinized Score = 5 Significance = 4.63 
Residue Identity = 83% Hatches = 5 fli snatches = 1 

Gaps - 0 Conservative Substitutions = 0 



X X 
PLDVYK 
III II 

YGEYKDYYTVIGESNID9SAFPKIYKTTERVYKGQGTSEKRVTVSDVVYNPLDGYKRSTGAYGVVTKDNIDM 
50 60 70 80 90 X 100 110 



SKGYREKUETNPEPSGUFRFYNRADNEEISEKEY 
120 130 140 



4. US-08-249-182-2 (1-6) 

EPIP_STAEP EPIDERN1N PROCESSING SERINE PROTEASE EPIP PRECURSO 

ID EPIP.STAEP STANDARD; PRT; 461 AA. 

AC P30199; 

DT 01 -APR- 1993 (REL. 25, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE EP I DERM IN PROCESSING SERINE PROTEASE EPIP PRECURSOR (EC 3.4.21.-). 

GN EPIP. 

OS STAPHYLOCOCCUS EPIDERHIDIS. 

OG PLASH ID PTU 32. 

OC PROKARYOTA! FIRNICUTES; COCCI; HICROCOCCACEAE. 



RP 
RC 
RM 
RA 
RA 
RL 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
KM 



S3 



SEQUENCE FROM N.A. 
STRAIN=TU 3298 / DSN 3095? 
92155237 

SCHNELL N.i ENGELKE G. , AUGUSTIN J.. ROSENSTEIN R., UNGERNANN V., 

GOETZ F., ENTIAN K.-D.I 

EUR. J. BIOCHEM. 204:57-68(1992). 

-!- FUNCTION: EPIP NIGHT BE THE PROTEASE WHICH CLEAVES THE MATURED 

LANTIBIOTIC FROM THE MODIFIED PREPEPTIDE. 
-!- SIMILARITY: BELONGS TO THE SUBTILASE PROTEASES FAMILY. 
EMBL; X62386; SEEPIDNA. 
PIR; S23420; S23420. 
PROSITE; PS00136; SUBTILASE ASP. 
PROSITE; PS00137; SUBTILASE~HIS. 
PROSITE; PS00138J SUBTILASE SER. 
HYDROLASE; SERINE PROTEASE; SIGNAL. 
SIGNAL 1 ? POTENTIAL. 

CHAIN ? 461 EP I DERM IN PROCESSING SERINE PROTEASE 

EPIP. 

ACT_SITE 149 149 CHARGE RELAY SYSTEM (BY SIMILARITY) . 

ACT.SITE 194 194 CHARGE RELAY SYSTEM (BY SIMILARITY) . 

ACTJITE 402 402 CHARGE RELAY SYSTEM (BY SIMILARITY) . 

SEQUENCE 461 AA; 51814 MM; 1103690 CN; 



Initial Score 
Residue Identity 
Gaps 



5 Optinized Score = 5 Significance = 4.63 
83% Matches = 5 Mi snatches = 1 

0 Conservative Substitutions - 0 



X X 
PLDVYK 



SLAAPKVSGALALEIDKYQLKD8PETAIELFKKKGIEKEKYMDKKHYGNGKLDVYKLLKE 
410 420 430 440 450 X 460 



5. US-08-249-182-2 (1-6) 

CRTI.SYNP7 PHYTOENE DEHYDROGENASE (EC 1.3.-.-) (PHYTOENE DESA 

ID CRTI_SYNP7 STANDARD; PRT; 474 AA. 

AC P26294J 

DT 01-MAY-1992 (REL. 22, CREATED) 

DT 01-MAY-1992 (REL. 22, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE PHYTOENE DEHYDROGENASE (EC 1.3.-.-) (PHYTOENE DESATURASE). 

GN PDS. 

OS SYNECHOCOCCUS SP. (STRAIN PCC 7942) (ANACYSTIS NIDULANS R2). 

oc prokaryota; gracilicutes; oxyphotobacteria; 

DC CYANOBACTERIA (BLUE-GREEN ALGAE); CHROOCOCCALES . 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 91322511 

RA CHAMOVITZ D., PECKER I., HIR5CHBERG J.; 

RL PLANT MOL. BIOL. 16:967-974(1991). 

CC -!- FUNCTION: THIS ENZYME CONVERTS PHYTOENE INTO ZETA-CAROTENE VIA THE 

CC INTERMEDIARY OF PHYTOFLUENE BY THE SYMMETRICAL INTRODUCTION OF TWO 

CC DOUBLE BONDS AT THE C-ll AND C-ll' POSITIONS OF PHYTOENE. 

CC -!- PATHWAY: CAROTENOID BIOSYNTHESIS. 

CC -!- COFACTOR: NAD, NADP, OR FAD (PROBABLE). 

CC -!- SUBCELLULAR LOCATION: MEMBRANE-ASSOCIATED (PROBABLE). 

CC -!- ENZYME REGULATION: INHIBITED BY THE HERBICIDE NORFLURAZON IN A 

CC NON-COMPETITIVE WAY. 

CC -!- SIMILARITY: TO OTHER PLANTS OR CYANOBACTERIAL PHYTOENE 

CC DESATURASES. 

DR EMBL; X55289; SYPDSG. 

DR PIR; S16250; S16250. 

KU CAROTENOID BIOSYNTHESIS; OXIDOREDUCTASE; NAD; FAD; 



FT NPJIND 7 23 FAD OR NAD (POTENTIAL) . 

FT VARIANT 403 403 V -> G (CONFERS RESITANCE TO THE 

FT HERBICIDE NORFLURAZON; STRAIN NFZ4). 

SQ SESUENCE 474 AA; 53296 HU; 1092205 CNi 

Initial Score = 5 Optinized Score = 5 Significance = 4.63 

Residue Identity = 83X Hatches = 5 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
II III 

ELVFAP AKDU IGRSDED I L AATHAEI EKLFPQHFSGENPARLRKYKI VKTPLSV YKATPGR89YRPD8ASPI 
350 360 370 380 390 400 X 410 420 

ANFFLTGDYTHQRYLASNEGAVLSGKLTA8AIIA 
430 440 450 



6. US-08-249-182-2 (1-6) 

FUMA.ECOLI FUMARATE HYDRATASE CLASS I, AEROBIC (EC 4.2.1.2) ( 

ID FUMA_ECQLI STANDARD; PRT; 548 AA. 

AC P00923; 

DT 21-JUL-1986 (REL. 01. CREATED) 

DT 21-JUL-1986 (REL. 01, LAST SE8UENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE FUMARATE HYDRATASE CLASS I, AEROBIC (EC 4.2.1.2) (FUHARASE). 

GN FUMA. 

OS ESCHERICHIA COLI. 

OC PROKARYOTA! GRACILICUTES; SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTERI ACEAE . 

RN [H 

RP SEGUENCE FROM N.A. 

RM 84221385 

RA MILES J.S., GUEST J.R.; 

RL NUCLEIC ACIDS RES. 12:3631-3642(1984). 

RN [21 

RP IDENTIFICATION OF THE STRUCTURAL GENE. 

RM 86142617 

RA GUEST J.R., MILES J.S., ROBERTS R.E.. WOODS S.A.; 

RL J. GEN. MICROBIOL. 131:2971-2984(1985). 

RN [33 

RP BIOCHEMICAL ANALYSIS OF FUMA AND FUNC. 

RM 88193096 

RA WOODS S.A., SHHARTZBACH S.D., GUEST J.R.; 

RL BIOCHIM. BIOPHYS. ACTA 954:14-26(1988). 

RN [4] 

RP IRON-SULFUR CLUSTER. 

RA FLINT D.H., EMPTAGE M.H., GUEST J.R.; 

RL J. INORG. BIOCHEM. 36:306-306(1989). 

CC -!- CATALYTIC ACTIVITY: L-MALATE = FUMARATE + H(2)0. 

CC -!- SUBUNIT: HOMODIMER. 

CC -!- PATHWAY: TRICARBOXYLIC ACID CYCLE. 

CC -!- COFACTOR: IRON-SULFUR 4FE-4S CLUSTER. 

CC -!- ENZYME REGULATION: SUBJECT TO AEROBIC RESPIRATORY CONTROL AND 
CC CATABOLITE REPRESSION. 

CC -!- FUMA ACCOUNTS FOR ABOUT 80X OF THE FUMARASE ACTIVITY WHEN 
CC E.COLI GROWS AEROBICALLY. 

CC -!- THIS IS A CLASS-I FUMARASE. CLASS-1 FUMARASES ARE THERMOLABILE. 

CC -!- SIMILARITY: 79X IDENTITY WITH E.COLI FUMB. 

DR EMBL; X00522; ECFUMA. 

DR PIR; A03531; UFECA8. 

OR ECOGENE; EG 10356; FUMA. 

DR PROSITE; PS00163J FUKARATE.LYASES. 



FT METAL 318 318 IRON-SULFUR (4FE-4S) (BY SIMILARITY) . 

FT ACTJITE 397 397 POTENTIAL. 

FT BINDING 463 463 CARBOXYL GROUP (POTENTIAL). 

SQ SEQUENCE 548 AA> 60298 NW; 1466324 CN> 

Initial Score = 5 Optinized Score = 5 Significance = 4.63 

Residue Identity = 83X Hatches = 5 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
III II 

VLPTCQDTGTAIIVGKKG8RVWTGGGDEAALARGVYNTYIEDNLRYSQNAPLDNYKEVNTGTNLPAQIDLYA 
110 120 130 140 150 X 160 170 

VDGDEYKFLCIAKGGGSANKTYLYQETKALLTPG 
180 190 200 



7. US-08-249-182-2 (1-6) 

GCL.ECOLI GLYOXYLATE CARBOLIGASE (EC 4.1.1.47) (TARTRONATE-S 

ID GCL.ECOLI STANDARD; PRT; 592 AA. 

AC P301465 

DT 01-APR-1993 (REL. 25. CREATED) 

DT 01-JUL-1993 (REL. 26, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE GLYOXYLATE CARBOLIGASE (EC 4.1.1.47) (TARTRONATE-SENIALDEHYDE 

DE SYNTHASE). 

GN GCL. 

OS ESCHERICHIA COLI. 

OC PROKARYOTA; GRACILICUTES! SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTERI ACEAE . 

RN [1] 

RP SEQUENCE FROH N.A., AND SEQUENCE OF 1-30. 

RC STRAIN=K12; 

RN 93179387 

RA CHANG Y.-Y., HANG A.-Y., CRONAN J.E. JR.; 

RL J. BIOL. CHEN. 26853911-3919(1993). 

CC -!- CATALYTIC ACTIVITY: 2 GLYOXYLATE = TARTRONATE SEMI ALDEHYDE + 
CC C0(2). 

CC -!- COFACTOR: THIAMINE PYROPHOSPHATE FLAVOPROTEIN, AND MAGNESIUM ION. 

CC -!- SUBUNITJ HOHOTETRAMER. 

CC -!- PATHWAY: GLYOXYLATE CAT ABOLISH. 

CC -!- INDUCTION: BY GLYOXYLATE. 

CC -!- SIMILARITY: WITH OTHER ENZYMES WHICH REQUIRE TPP. 

DR EMBL; L03845; ECGCL. 

DR PIR; JT0742; JT0742. 

DR ECOGENE; EG11583; GCL. 

KM LYASE; FLAVOPROTEIN; THIAMINE PYROPHOSPHATE? FAD; MAGNESIUM. 

FT INIT_MET 0 0 

SQ SEQUENCE 592 AA; 64600 MW; 1701474 CN; 

Initial Score = 5 Optinized Score - 5 Significance - 4.63 
Residue Identity = 83X Matches = 5 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
II III 

A VT VRE AAL VPR VLQQAFHLMRSGRPGPVLVDLPFDVQV AE I EFDPDHYEPLPVYKPAASRMQ I EKAVEHL I 
130 140 150 160 170 180 X 190 200 



QAERPVIVAGGGVINADAAALLQQFAELTSVPVI 
210 220 230 



8. US-08-249-182-2 (1-6) 

T3M0_EC0LI TYPE III RESTRICTION-MODIFICATION SYSTEM EC0P15 EN 

ID T3H0.EC0LI STANDARD; PRT; 645 AA. 

AC P123645 

DT 01-0CT-1989 (REL. 12, CREATED) 

DT 01-0CT-1989 (REL. 12, LAST SEQUENCE UPDATE) 

DT 01-HAR-1992 (REL. 21, LAST ANNOTATION UPDATE) 

DE TYPE III RESTRICTION-MODIFICATION SYSTEM EC0P15 ENZYME MOD 

DE (EC 2.1.1.72). 

GN MOD. 

OS ESCHERICHIA COLI. 

OG PLASMID P15B. 

OC PROKARYOTA! gracilicutes; scotobacteria; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTERIACEAE. 

RN 111 

RP SEQUENCE FROM N.A. 

RM 88245189 

RA HUEMBELIN M., SURI B. , RAO D.N., HORNBY D.P., EBERLE H., PRIPFL T., 

RA KENEL S., BICKLE T.A.J 

RL J. MOL. BIOL. 200:23-29(1988). 

CC -!- FUNCTION: THIS PROTEIN IS RESPONSIBLE FOR BINDING THE SYSTEM- 

CC SPECIFIC DNA RECOGNITION SITE IN BOTH RESTRICTION AND THE 

CC METHYLATION OF ONE OF THE ADENOSYL RESIDUES IN THE RECOGNITION 

CC SITE 5'-CAGCAC-3\ 

CC -!- CATALYTIC ACTIVITY: S-ADENOSYL-L-METHIONINE + DNA ADENINE = 

CC S-ADENOSYL-L-HOMOCYSTEINE + DNA 6-METHYLAH INOPUR I NE . 

CC -!- SUBUNIT: TYPE III RESTRICTION SYSTEM CONTAINS TWO DIFFERENT 

CC SUBUNITS: THE PRODUCTS OF THE RES AND MOD GENES. 

CC -!- SUBUNIT: TYPE III MODIFICATION ENZYME IS A TETRAMER OF A 

CC SINGLE SUBUNIT, THE PRODUCT OF THE MOD GENE. 

DR EMBL; X 06288; ECP15BM0. 

DR PIR; S03208; S03208. 

DR REBASE; EC0P15I; RELEASE 9402. 

DR PROSITE; PS00092; N6.MTASE. 

KW TRANSFERASE; METHYLTRANSFERASE; DNA-BINDING; RESTRICTION SYSTEM; 

KU PLASMID. 

SQ SEQUENCE 645 AA; 74222 MM; 2156590 CN; 

Initial Score = 5 Optinized Score = 5 Significance = 4.63 

Residue Identity = 83X Matches = 5 Misnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
III II 

EQLISEUKSNISDVKNLLVNIGEEFASKYTGNELQEKYTQUFREHRSELUPLDRYKYIDKDGIYTGS9SVHN 
260 270 280 290 300 X 310 320 

PGKEGYRYDIIHPKTKKPCKQQPLMGYRFPLDTM 
330 340 350 



9. US-08-249-182-2 (1-6) 

CYG3.RAT GUANYLATE CYCLASE SOLUBLE, ALPHA- 1 CHAIN (EC 4.6.1 

ID CYG3_RAT STANDARD; PRT; 690 AA. 

AC P19686; 

DT 01-FEB-1991 (REL. 17, CREATED) 

DT 01-FEB-1991 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE GUANYLATE CYCLASE SOLUBLE, ALPHA-1 CHAIN (EC 4.6.1.2) (82 KD CHAIN). 

OS RATTUS NORVEGICUS (RAT). 

oc eukaryota; metazoa; chordata; vertebrata; tetrapoda; mammalia; 



RN [1] 

RP SEQUENCE FROM N.A., AND PARTIAL SEQUENCE. 

rc tissue=lung; 

RM 91009100 

RA NAKANE H., ARA! K., SAHEKI S.> KUNO T.. BUECHLER W., MURAD F. ; 

RL J. BIOL. CHEN. 265:16841-16845(1990) . 

CC -!- CATALYTIC ACTIVITY: GTP = 3,'5'-CYCLIC GMP + PYROPHOSPHATE. 

CC -!- SUBUNIT: HETERODIMER OF AN ALPHA AND A BETA CHAIN. 

CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC. 

CC -!- THERE ARE TWO TYPES OF GUANYLATE CYCLASES: SOLUBLE FORKS AND 

CC MEMBRANE- ASSOCI ATED RECEPTOR FORMS. 

CC -!- SIMILARITY: TO OTHER GUANYLATE CYCLASES. 

DR EHBLJ M57405; RNGCSB. 

DR PIRf A38297; 0YRTA1. 

DR PROSITE; PS00452; GUANVLATE_CYCLASES . 

KM LYASE; CGMP SYNTHESIS; MULTIGENE FAMILY. 

S3 SEQUENCE 690 AA; 77566 MH? 2459503 CN? 



Initial Score = 5 Optinized Score = 5 Significance = 4.63 
Residue Identity = 83% Matches = 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
lllll 

GQIV8AKKFNEVTMLFSDIVGFTAICSQCSPLQVITMLNALYTRFD88CGELDVYKVETIGDAYCVAGGLHR 
470 480 490 500 510 520 X 530 



ESDTHAVGIALMALKMHELSNEVHSPHGEPIKHR 
540 550 560 570 



10. US-08-249-182-2 (1-6) 

CYG3.B0VIN GUANYLATE CYCLASE SOLUBLE, ALPHA- 1 CHAIN (EC 4.6.1 

ID CYG3.B0VIN STANDARD; PRT; 691 AA. 

AC PI 9687; 

DT 01-FEB-1991 (REL. 17, CREATED) 

DT 01-FEB-1991 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE GUANYLATE CYCLASE SOLUBLE, ALPHA-1 CHAIN (EC 4.6.1.2) (73 KD CHAIN). 

OS BOS TAURUS (BOVINE). 

oc eukaryota; metazoa; CHORD at a; vertebrata; tetrapoda; mammalia; 

oc eutheria; artiodactyla. 

RN [1] 

RP SEQUENCE FROM N.A., AND PARTIAL SEQUENCE. 

RC TISSUE=ADRENAL MEDULLA; 

RH 90306336 

RA KOESLING D., HARTENECK C, HUMBERT P., BOSSERHOFF A., FRANK R. , 

RA SCHULTZ G. , BOEHME E.; 

RL FEBS LETT. 266:128-132(1990). 

CC -!- CATALYTIC ACTIVITY: GTP = 3,'5'-CYCLIC GMP + PYROPHOSPHATE. 

CC -!- SUBUNIT: HETERODIMER OF AN ALPHA AND A BETA CHAIN. 

CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC. 

CC -!- THERE ARE THO TYPES OF GUANYLATE CYCLASES: SOLUBLE FORMS AND 

CC MEMBRANE-ASSOCIATED RECEPTOR FORMS. 

CC -!- SIMILARITY: TO OTHER GUANYLATE CYCLASES. 

DR EMBL; X54014; BTGUCY73. 

DR PIR; S10713; 0YB077. 

DR PROSITE; PS00452! GUANYLATE CYCLASES. 

KU LYASE; CGMP SYNTHESIS; MULTIGENE FAMILY. 

SQ SEQUENCE 691 AA; 77532 MM; 2500456 CN; 



Initial Score = 5 Optinized Score = 5 Significance = 4.63 
Residue Identity = 83% Matches = 5 Misnatches = 1 



v 



X X 
PLDVYK 
Hill 

GHAVQAKRFGNVTNLFSDIVGFTAICSQCSPLQVITHLNALYTRFDRQCGELDVYKVETIGDAYCVAGGLHK 
470 480 490 500 510 520 X 530 540 



ESDTHAVQIALHALKMMELSHEVVSPHGEPIKMR 
550 560 570 



11. US-08-249-182-2 (1-4) 

CYG5.HUMAN GUANYLATE CYCLASE SOLUBLE, ALPHA-3 CHAIN (EC 4.6.1 



ID CYG5 HUMAN STANDARD; PRT; 717 AA. 

AC 902108; 

DT 01-JUL-1993 (REL. 26, CREATED) 

DT 01- JUL- 1993 (REL. 26, LAST SEQUENCE UPDATE) 

DT 01 -FEB- 1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE GUANYLATE CYCLASE SOLUBLE, ALPHA-3 CHAIN (EC 4.6.1.2). 

GN GUCSA3. 

OS HOMO SAPIENS (HUMAN). 

OC EUKARYOTA; METAZOA? chordata; VERTEBRATA; TETRAPODAJ MAMMALIA? 

OC EUTHERIA; PRIMATES. 

RN [1] 

rp sequence from n.a. 

rc tissue=brain; 

RM 92316204 

RA GIUILI G., SCHOLL U., BULLE F., GUELLAEEN g.; 

RL FEBS LETT. 304 .'83-88 < 1992) . 

CC -!- CATALYTIC ACTIVITY: GTP = 3,*5'-CYCLIC GMP ♦ PYROPHOSPHATE. 

CC -!- SUBUNIT: HETERODIMER OF AN ALPHA AND A BETA CHAIN. 

CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC. 

CC -!- THERE ARE THQ TYPES OF GUANYLATE CYCLASES: SOLUBLE FORMS AND 

CC MEMBRANE- ASSOCIATED RECEPTOR FORMS. 

CC -!- SIMILARITY: TO OTHER GUANYLATE CYCLASES. 

DR EMBL; X 66534; HSGCSAA. 

DR PIR; S23098; S23098. 

DR MIM5 139396J TENTH EDITION. 

DR PROSITE; PS00452; GUANYLATE.CYCLASES. 

KH LYASE? CGMP SYNTHESIS; MULTIGENE FAMILY. 

SQ SEQUENCE 717 AA; 81377 HW; 2740969 CN; 



Initial Score = 5 Optinized Score = 5 Significance = 4.63 
Residue Identity = 83Z Matches = 5 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
PLDVYK 
Mill 

GQVVQ AKKFSNVTMLFSD I VGFTA ICSQCSPLQV ITMLNAL YTRFDQQCGELDV YKVET I AMP I VNLGGLHK 
470 480 490 500 510 X 520 530 



ESDTH A VQ I ALMALKMMELSDE VHSPHGEP I KMR 
540 550 560 570 



12. US-08-249-182-2 (1-6) 

NAHI.CANFA SODIUM/MYO-INOSITOL COTRANSPORTER (NA(+)/MYO-INDSI 

ID NAMI.CANFA STANDARD; PRT; 718 AA. 

AC P31637; 

DT 01-JUL-1993 (REL. 26, CREATED) 

DT 01-JUL-1993 (REL. 26, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 



GN 
OS 
OC 
OC 
RN 
RP 
RC 
RH 
RA 
RA 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
OR 
DR 
KU 



S9 

In 
Re 
Gaps 



juuiviuhiu inugi iwl tuinnrwr unitn inn \ •* i 1 n i u inuui i ul. vu i nnnarun i tni • 

SUIT. 

canis fakiliaris (dog). 

eukaryota; metazoa; chordata; vertebrata; tetrapoda; mammalia; 

eutheria; carnivora. 

[1] 

SEQUENCE FROM N.A. 

TISSUE=KIDNEYf 

92210609 

KWON H.M., YAMAUCHI A., UCHIDA S., PRESTON A.S.. GARCIA-PEREZ A., 

BURG 11. B., HANDLER J.S.J 

J. BIOL. CHEH. 267:6297-6301(1992). 

-!- FUNCTION: PREVENTS INTRACELLULAR ACCUMULATION OF HIGH 

CONCENTRATIONS OF HYO-INOSITOL (AN OSMOLYTE) THAT RESULT IN 
IMPAIRMENT OF CELLULAR FUNCTION. 
SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. 
TISSUE SPECIFICITY: BRAIN AND KIDNEY. 
INDUCTION: MEDIUM HYPERTONICITY. 

SIMILARITY: BELONGS TO THE SODIUH:SOLUTE SYMPORTER FAMILY <SSF). 
EHBL: N85068? CFSMIT. 
PROSITE; PS00456? NA.S0LUT_SYMP_1. 
PROSITE; PS00457; NA_S0LUT_SYHP_2. 

TRANSPORT; TRANSMEMBRANE; SODIUM TRANSPORT; SYHPORT. GLYCOPROTEIN. 



DOMAIN 


1 


9 


CYTOPLASMIC (POTENTIAL). 


TRANSMEM 


10 


29 


POTENTIAL. 


DOMAIN 


30 


38 


EXTRACELLULAR (POTENTIAL) . 


t n a injury 

TRANSMEM 


39 


57 


m\ «\ mm tm 1 1 mm * *g 

POTENTIAL. 


DOMAIN 


58 


86 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


87 


110 


POTENTIAL. 


DOMAIN 


111 


123 


EXTRACELLULAR (POTENTIAL). 


TRANSMEM 


124 


144 


POTENTIAL. 


DOMAIN 


145 


157 


CYTOPLASMIC (POTENTIAL). 


TRANSMEM 


158 


183 


POTENTIAL. 


DOMAIN 


184 


186 


EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


187 


205 


POTENTIAL. 


DOMAIN 


206 


303 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


304 


324 


POTENTIAL. 


DOMAIN 


325 


353 


EXTRACELLULAR (POTENTIAL). 


TRANSMEM 


354 


376 


POTENTIAL. 


DOMAIN 


377 


406 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


407 


430 


POTENTIAL. 


DOMAIN 


431 


443 


EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


444 


462 


POTENTIAL. 


DOMAIN 


463 


510 


CYTOPLASMIC (POTENTIAL). 


TRANSMEM 


511 


532 


POTENTIAL. 


DOMAIN 


533 


695 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


696 


716 


POTENTIAL. 


CARBOHYD 


32 


32 


POTENTIAL. 


SITE 


24 


24 


IMPLICATED IN SODIUM COUPLING 








. (BY SIMILARITY). 


SITE 


285 


285 


IMPLICATED IN SODIUM COUPLING 








(BY SIMILARITY). 


SEQUENCE 


718 aa; 


79545 HH; 2760723 CN; 



tial Score = 
idue Identity = 



5 Qptinized Score = 5 Significance 
83% Matches = 5 Mismatches 

0 Conservative Substitutions 



4.63 
1 
0 



X X 
PLDVYK 



GSRAGCSNIAYPRLVMKLVPVGLRGLMMAVHIAALMSDLDSIFNSASTIFTLDVYKLIRRSASSRELHIVGR 
350 360 370 380 390 X 400 410 



IFVAFMVVISIAWVPIIVEMQGGQNYLYIQEVAD 
420 430 440 



13. US-08-249- 182-2 (1-6) 

DPOLJULSO DNA POLYMERASE (EC 2.7.7.7). 



ID DPOLJULSO STANDARD; PRT; 882 AA. 

AC P26811; 

DT 01-AUG-1992 (REL. 23. CREATED) 

DT 01-AUG-1992 (REL. 23. LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23. LAST ANNOTATION UPDATE) 

DE DNA POLYMERASE (EC 2.7.7.7). 

GN POLS. 

OS SULFOLOBUS SOLFATARICUS. 

OC PROKARYOTA; mendosicutes; archaebacteria; SULFOLOBALES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MT 4f 

RM 92310966 

RA PISANI F.M.. MARTINO C. ROSSI II. J 

RL NUCLEIC ACIDS RES. 20:2711-2716(1992). 

CC -!- CATALYTIC ACTIVITY? N DEOXYNUCLEOSIDE TRIPHOSPHATE = 

CC N PYROPHOSPHATE + DNA (N). 

CC -!- SIMILARITY: BELONGS TO FAMILY B OF DNA POLYMERASES. 

DR EMBL; X64466; SSPOLS. 

DR PIR; S23019; S23019. 

DR PROSITE; PS00116; DNA.POLYMERASE.B. 

KU DNA-DIRECTED DNA POLYMERASE; DNA~REPLICATION; DNA-BINDING. 

SQ SEQUENCE 882 AA; 101333 MW; 42S8623 CNl 



Initial Score = 5 Dptinized Score = 5 Significance = 4.63 
Residue Identity = 83% Hatches = 5 Hi snatches = 1 

Gaps = 0 Conservative Substitutions - 0 



X X 
PLDVYK 
III II 

VKELNISINSPNDVKEIKRKIVDVVKGSYEKLKNKGYNLDELAFKVHLSKPLDAYKKNTP8HVKAALQLRPF 
740 750 760 770 780 X 790 800 



GVNVLPRDIIYYVKVRSKDGVKPVQLAKVTEIDA 
810 820 830 



14. US-08-249-182-2 (1-6) 

EBA1.PLAFC ERYTHR0CYTE-BIND1NG ANTIGEN EBA-175. 

ID EBA1_PLAFC STANDARD; PRT; 1426 AA. 

AC P19214; 

DT 01-N0V-1990 (REL. 16, CREATED) 

DT 01-N0V-1990 (REL. 16, LAST SEQUENCE UPDATE) 

DT 01-AUG-1991 (REL. 19, LAST ANNOTATION UPDATE) 

DE ERYTHROCYTE-BINDING ANTIGEN EBA-175. 

OS PLASMODIUM FALCIPARUM (ISOLATE CAMP / MALAYSIA). 

oc eukaryota; protozoa; apicomplexa; sporozoa; coccidia; eucoccidiida. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=LATE SCHIZONT; 

RM 90377299 

RA SIM B.K.L.; 

RL MOL. BIOCHEM. PARASITOL. 41:293-296(1990). 

DR EMBL", X52524J PFEBA175. 

DR PIR; SI 1561 ; SI 1561 . 

KW ANTIGEN. 

FT DOMAIN 159 1101 ESSENTIAL FOR BINDING TO 

FT ERYTHROCYTES. 

FT VARIANT 1028 1028 E -> V (IN STRAINS FCR3 AND ITG). 



Query sequence being conpared:US-08-249-182-3 (1-5) 
Nunber of sequences searched: 42145 
Nunber of scores above cutoff: 4652 

Results of the initial comparison of US-08-249- 182-3 (1-5) with: 
Data bank : A-GeneSeq 15> all entries 

100000- 

N 

U50000- 
H 

8 * 

E 
R 

t 

0 

F 10000- 
S 

E 5000- 

G 

U 

E 

N 

C - * 
E 

S 1000- 



500- 



100- 



50- 



10- 



o 

II I II III III 

SCORE 0| 1 |1 2 | 2 3 | 3 
STDEV 12 3 4 



PARAMETERS 



Initial Score = 5 Optinized Score = 5 Significance = 4.63 
Residue Identity = 83X Hatches = 5 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLOVYK 

Hill 

MKCNISIYFFASFFVLYFAKARNEYDIKENEKFLDVYKEKFNELDKKKYGNV8KTDKKIFTFIENKLDILNN 
10 20 30 X 40 SO 60 70 

SKFNKRHKSYGTPDNI 
80 



15. US-08-249-182-2 (1-6) 

PR08_YEAST PRE-MRNA SPLICING FACTOR PRP8. 



ID PROS YEAST STANDARD; PRT» 2413 AA. 

AC P33334J 

DT 01-FEB-1994 (REL. 28. CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE PRE-MRNA SPLICING FACTOR PRP8. 

GN PRP8 OR RNA8. 

OS SACCHAROMYCES CEREVISIAE (BAKER'S YEAST). 

OC EUKARYOTA; FUNGI; ASCOHYCOTINA; HEMIASCOMYCETES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RA HODGES P.E., JACKSON S.P.. BROUN J.D., BEGGS J.D.; 

RL SUBMITTED (JUL-1993) TO EMBL/GENBANK/DDBJ DATA BANKS. 

RN [2] 

RP CHARACTERIZATION. 

RM 88216580 

RA JACKSON S.P., LOSSKY M. > BEGGS J.D.; 

RL MOL. CELL. BIOL. 851067-1075(1988). 

CC -!- FUNCTION: INVOLVED IN PRE-MRNA SPLICING. U5 SNRNP PROTEIN. 

CC APPEARS TO CONTACT THE PRE-MRNA DURING SPLICING. 

CC -!- SUBCELLULAR LOCATION: NUCLEAR. 

CC -!- SIMILARITY: TO C.ELEGANS PROTEIN C50C3.6. 

DR EHBL; 224732; SCPRP8GNA. 

DR PIR; S34670; S34670. 

KM MRNA PROCESSING; MRNA SPLICING; SPLICEOSOME; NUCLEAR PROTEIN. 

SQ SEQUENCE 2413 AA; 279501 HU; 19891968 CN; 



Initial Score = 5 Optinized Score = 5 Significance = 4.63 
Residue Identity = 83% Matches = 5 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PLDVYK 
II III 

MENYQNISPVYSVDPLEKITDAYLDQYLUYEADQRKLFPNMIKPSDSEIPPLLVYKUTSGINNLSEIUDVSR 
980 990 1000 1010 1020 X 1030 1040 



GQSAVLLETTLGEMAEKIDFTLLNRLLRLIVDPN 
1050 1060 1070 

> 0 < 

Oj |0 IntelUGenetics 

FastDB - Fast Pairaise Comparison of Sequences 
Release 5.4 



Results file u249_3a.res nade by on Thu 22 Sep 94 10:29;3i-PDT. 



nuiuic inc i/nn-uinuiny jjr uuein nu t run ncraupruuu ano unermjpru 1 lu 

Bacilli; Gene cloning» overproduction and purification, 
tcross-references MUID;92354934 
^accession JC1206 
#fmolecule_type DNA 
##residues" i-90 ##label PAD 
Uteross-references GB* W73501 
COMMENT This protein binds to both single-stranded and double-stranded DNA 
and to RNA in stoichiometric amounts. 

GENETICS 

fgene hup 
CLASSIFICATION isuperfamily bacterial DNA-binding protein 
KEYWORDS DNA binding; RNA binding 

SUMMARY ttlength 90 imolecular-ueight 9716 ichecksum 4525 

SEQUENCE 



Initial Score = 4 Optimized Score = 4 Significance = 3.06 
Residue Identity = SOX Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
YPAFK 
till 

DAVFDS I TEALRKGDKV9L 1 GFGNFEVRERA ARKGRNPOTGEEME I PASKVPAFKPGKALKDAVK 
30 40 50 60 70 X 80 90 

> 0 < 

0| |0 Intel liGenetics 

> 0 < 



FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249JJs.res made by on Thu 22 Sep 94 10?58;04-PDT. 



Query sequence being conparediUS-08-249-182-3 (1-5) 
Number of sequences searched: 36000 
Number of scores above cutoff: 3832 

Results of the initial comparison of US-08-249-182-3 (1-5) with: 
Data bank : Suiss-Prot 28 t all entries 

100000- 

N 

U50000- 
II 

B * 

E 

R 

0 

F10000- 
t 

S 

E 5000- 

Q * 

U 
E 
N 
C 
E 

S 1000- 



500- 



OU A 



KAEKGW 
80 



14. US-08-249-182-3 (1-5) 

CCAI6 cytochrone c6 - Anabaena variabilis 



ENTRY 
TITLE 

ALTERNATE.NAHES 

ORGANISM 

DATE 



CCAI6 itype complete 
cytochrone c6 - Anabaena variabilis 
cytochrone c553» soluble cytochrone f 
«fornal_nane Anabaena variabilis 

17-Dec-1982 #sequence.revision 17-Dec-1982 »text change 
19-Hay-1994 
ACCESSIONS A94488; A93182; A00105 

REFERENCE A94488 

•authors Beechert J.; Margoliash, E. 

tcitation unpublished results, cited by Ulrich, E.L.. Krognann, D.W.i 

and Hark ley, J.L., J. Biol. Chen. 257. 9356-9364, 1982 
#accession A94488 

••nolecule_type protein 

••residues" 1-86 #t label BEE 
REFERENCE A93182 
tauthors Aitken, A. 
•journal Nature (1976) 263J793-796 
Stitle Protein evolution in cyanobacteria. 

•cross-references MUID577056395 
•accession A93182 

••nolecule_type protein 

••residues" 1-22J30-39J56-81»'D', 83, 'D', 85-86 #»label AIT 
CLASSIFICATION •superfanily cytochrone c; cytochrone c honology 
KEYWORDS electron transfer; hene, photosynthesis 

FEATURE 

14,17 

18,58 



SUHHARY 
SEQUENCE 



#binding_site hene (Cys) (ccvalent) Sstatus predicted\ 
•binding_site hene iron (His, Net) (axial ligands) 
•status predicted 
•length 86 Snolecular-ueight 8973 8checksun 5450 



Initial Score = 
Residue Identity = 
Gaps = 



4 Optinized Score = 4 Significance - 3.06 
807. Hatches = 4 Hi snatches - 1 

0 Conservative Substitutions = 0 



X X 
YPAFK 
llll 

K I FSANCASCHAGGKNLGV AQKTLKK ADLEKYGAYSAHAI GAflVTNGKNANPAFKGRLKPEE I ZBV AAY VLG 
10 20 30 40 50 60 X 70 

KAEAEWK 
80 



15. US-08-249-182-3 (1-5) 

JC1206 DNA-binding protein HU 



Bacillus caldotenax 



ENTRY JC1206 «type conplete 

TITLE DNA-binding protein HU - Bacillus caldotenax 

ORGANISM #fornal.nane Bacillus caldotenax 

DATE 05-Mar-1993 »sequence_revision 05-Har-1993 *text change 

23-Mar-1993 

ACCESSIONS JC1206 

REFERENCE JC1205 

•authors Padas, P.M.; Wilson, K.S.; Vorgias, C.E. 

•journal Gene (1992) 117:39-44 



♦♦nolecule_type protein 
♦♦residues" 1-82 ♦♦label AHB 
GENETICS 

nirM 

♦superfanily cytochrome c; cytochrone c homology 
electron transfer; bene; oxidative phosphorylation 



#gene 
CLASSIFICATION 
KEYWORDS 
FEATURE 

12,15 

16,61 

SUMMARY 
SEQUENCE 



♦binding_site heme (Cys) (covalent) #status predicted\ 
♦binding_site hene iron (His, Met) (axial ligands) 
♦status predicted 
#length 82 ♦nolecular-ueight 8612 ♦checksun 6641 



Initial Score = 
Residue Identity = 
Gaps = 



4 Optimized Score = 4 Significance s 3.06 
807. Matches = 4 Mi snatches = 1 

0 Conservative Substitutions = 0 



X X 
YPAFK 
IIII 

QDGEALFKSKPCAACHSIDAKLVGPAFKEVAAKYAG8DGAADLLAGHIKNGS0GVMGPIPMPPNPVTEEEAK 
10 20 X 30 40 50 60 70 



ILAEUI 



13. US-08-249-182-3 (1-5) 

CCPB6 cytochrone c6 



Plectonena boryanun 



ENTRY 
TITLE 

ALTERNATE_NAMES 
ORGANISM " 
DATE 

ACCESSIONS 
REFERENCE 

♦authors 

♦journal 

♦title 



CCPB6 Hype complete 
cytochrone c6 - Plectonena boryanun 
cytochrone c553; soluble cytochrone f 
♦fornal_nane Plectonena boryanun 

♦sequence_revision 07-May-1981 next change 19-Nay-1994 
A00109 
A00109 
Aitken. A. 

Eur. J. Biochen. (1977) 78:273-279 

Purification and prinary structure of cytochrone f fron the 
cyanobacteriunr Plectonena boryanun. 
♦cross-references MUID:78023897 
♦contents CCAP 1462/2 
♦accession A00109 

♦♦nolecule^type protein 
♦♦residues" 1-85 ♦♦label AIT 
COMMENT Plectonena is a genus of filanentous blue-green algae. 



CLASSIFICATION 
KEYWORDS 
FEATURE 
14,17 

16,58 

SUMMARY 
SEQUENCE 



♦superfanily cytochrone c? cytochrone c honology 
electron transfer; hene; photosynthesis 

♦binding_site hene (Cys) (covalent) ♦status 

experinental\ 
♦binding_site hene iron (His? Met) (axial ligands) 
♦status predicted 
♦length 85 ♦nolecular-ueight 8576 ♦checksun 3761 



Initial Score = 
Residue Identity - 
Gaps = 



4 Optinized Score = 4 Significance = 3.06 
80X Matches = 4 Mi snatches = 1 

0 Conservative Substitutions = 0 



X X 
YPAFK 
IIII 

KVFNANCAACHASGGGfllNGAKTLKKNALTANGKDTVEAIVAfiVTNGKGANPAFKGRLSDDSIfiSVALYVLD 



^accession 
#fstatus 
##residues 



^uoniuveo to x»ne cocl uax,a norary* uuiy 
S35946 

preliminary 

1-43 ##label BAC 



tlcross-references EMBLJX74279 
SUMMARY #length 43 Checksum 2542 

SEQUENCE 



Initial Score = 4 
Residue Identity - 80X 
Gaps = 0 



Optimized Score = 4 Significance 
Matches = 4 Mismatches 

Conservative Substitutions 



= 3, 



06 
1 
0 



X X 
YPAFK 
III) 

NLNDMRQFAEMHD I E I TV I DNDTRLPAFKDALRWNE VY YGFRR 
10 20 X 30 40 



11. US-08-249-182-3 (1-5) 

S10561 photosysten II 21K protein - barley 

ENTRY S10561 #type complete 

TITLE photosystem II 21K protein - barley 

ORGANISM #formal_name Hordeum vulgare #common_name barley 

DATE 02-Dec-1993; #sequence_revision 02-Dec-1993? #text change 

02-Dec-1993 
ACCESSIONS S10561 
REFERENCE S10561 

tauthors Morishige, D.T., Anandani S.? Jaingi J.T.; Thornber, J. P. 

# journal FEBS Lett. U990) 2645239-242 

#title Anino-terninal sequence of the 21 kDa apoprotein of a minor 

light-harvesting pigment-protein complex of the photosysten 
II antenna (LHC I Id/CP 24). 
tteross-references MUID:90292215 
{(accession S10561 

tistatus preliminary 
ttresidues 1-66 *#label M0R 
SUMMARY ilength 66 #molecular-weight 7169 Checksum 8176 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



4 Optimized Score = 4 
80X Matches = 4 

0 Conservative Substitutions 



Significance = 
Mismatches = 



.06 
1 
0 



X X 
YPAFK 



AAAGKKSWIPAFKSDAEFINPSWLDGSLPGDFGFDPLGLGKDPAFLKWYGWAELIXXXWAMXA 
10 X 20 30 40 50 60 



12. US-08-249-182-3 (1-5) 

CCPS5S cytochrome c551 



Pseudomonas stutzeri (strain 221 



ENTRY CCPS5S ftype complete 

TITLE cytochrome c551 - Pseudomonas stutzeri (strain 221) 

ORGANISM #fornal_nane Pseudomonas stutzeri 

DATE #sequence revision 13-Jul-1981 #text change 19-May-1994 

ACCESSIONS A00093 
REFERENCE A90266 

ftauthors Ambler* R.P.; Wynn> M. 

#journal Biochem. J. (1973) 131:485-498 

#title The amino acid sequences of cytochromes c-551 from three 

species of Pseudomonas. 
tcross-references MUID:73224976 



H«tCJC7 



##status preliminary 
##molecule_type protein 
##residues~ 1-114 §#label SIR 

##cross-references NCBIP?78526; NCBIP?78523; NCBIP:78521? NCBIP?7B518; 

NCB1P:78515; NCBIP:78512? NCBIP:7851l! NCBIP:78510! 
NCBIPJ78509; NCBIP:78508; NCBIP:7B5Q3 
##nole sequence extracted from NCBI backbone 

SUMMARY #length 114 Checksum 7335 

SEQUENCE 



Initial Score = 5 Optimized Score = 5 Significance = 4.08 
Residue Identity = 100X Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
YPAFK 
Mill 

GPTFKGGQPLWITATKSPPFENINLYYDVPHNETIPEEVTXPNYL9AEVSYPAFKPXLDVYKHHVAAN 
50 60 70 80 90 100 110 



9. US-08-249-182-3 (1-5) 

S12888 DNA-binding protein II - Thermus aquaticus 

ENTRY S12888 #type complete 

TITLE DNA-binding protein II - Thermus aquaticus 

ORGANISM #formal_name Thermus aquaticus 

DATE 21-Nov-1993? #sequence_revision 21-Nov-1993; #text change 

21-Nov-1993 
ACCESSIONS S12888 
REFERENCE SI 2888 

#authors Zierer* R.? Cholir D. 

#journal FEBS Lett. (1990) 273:59-62 

#title The primary structure of DNA binding protein II from the 

extreme thermophilic bacterium Thermus thermophilus. 
ftcross-references MUID:91032203 
^accession S12888 

i#status preliminary 
##residues 1-95 ##label ZIE 
SUMMARY tlength 95 #mo!ecular-ueight 10163 Checksum 68 

SEQUENCE 



Initial Score = 5 
Residue Identity = 100X 
Gaps = 0 



Optimized Score = 5 
Hatches = 5 

Conservative Substitutions 



Significance = 
Mismatches = 



4.08 
0 
0 



X X 
YPAFK 
illll 

DALLAKVEEALANGSKVQLTGFGTFEVRKRKARTGVKPGTKEKIKIPAT9YPAFKPGKALKDKVK 
40 50 60 70 80 X 90 



10. US-08-249-182-3 (1-5) 
S35946 araA protein 



Escherichia coli (fragment) 



ENTRY S35946 ttype fragment 

TITLE araA protein - Escherichia coli (fragment) 

ORGANISM #formal.name Escherichia coli 

DATE 22-Nov-1993; #sequence_revision 22-Nov-1993? #text_change 

22-Nov-1993 
ACCESSIONS S35946 
REFERENCE S35946 

#authors Bachellieri S.» Saurinr W. ; Perrin, D.J Hofnung, M. ; Gilsoni 

E. 



X X 
YPAFK 
lllll 

SLRTEENLAVVK8IPTERLLLETDAPHCEIKRTHASF6YLAKY8EVRDFEYPAFKSVKKNKLADKLNAEELY 
310 320 330 340 350 X 360 370 

MVKGRNEPCNME8VAIVVSEVKDVDLATLIDTT 
380 390 400 



7. US-08-249-182-3 (1-5) 

S39833 hypothetical protein YBL0511 



yeast (Saccharonyce 



yeast (Saccharonyces 



ENTRY S39833 ttype conplete 

TITLE hypothetical protein Y8L0511 

cerevisiae) 
#fornal_nane Saccharonyces cerevisiae 
27-May-1994; Ssequence revision 27-Hay-1994; #text change 

27-May-1994 
S39833 
S39824 

Scherensi 8.S el Bakkouryr 11.; Vierendeels. F.; Dubois, E.; 

Messenguy. F. 
Yeast (1993) 9:1355-1371 

Sequencing and functional analysis of a 32 560 bp segment on 
the left arn of yeast chronosone II. Identification of 26 
open reading franes. including the KIP1 and SEC17 genes, 
laccession S39833 

ttstatus preliminary 
♦♦residues 1-418 §# label SCH 
♦♦cross-references EMBL; Z232&1 
SUMMARY tlength 418 ♦nolecular-weight 47390 ♦checksun 9974 

SEQUENCE 



ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
♦authors 

#journal 
♦title 



Initial Score = 5 
Residue Identity = 100% 
Gaps = 0 



Optimized Score = 5 Significance = 4.08 
Hatches = 5 Mi snatches = 0 

Conservative Substitutions = 0 



X X 
YPAFK 
lllll 

SLRTEENLAVVK9IPTERLLLETDAPHCEIKRTHASF8YLAKY8EVRDFEYPAFKSVKKNKLADKLNAEELY 
310 320 330 340 350 X 360 370 

MVKGRNEPCNME8VA I V VSEVKDVDLATL I DTT 
380 390 400 



US-08-249-182-3 (1-5) 

A42329 autotaxin - hunan (fragnents) 



ENTRY A42329 ttype fragnents 

TITLE autotaxin - hunan (fragnents) 

ORGANISM #fornal_nane Hono sapiens #connon_nane nan 

DATE 04-Har-19935 #sequence_revision oI-Jan-1993; ttext change 

08-May-1993 
ACCESSIONS A42329 
REFERENCE A42329 

iauthors Stracke> H.L.? Krutzsch. H.C.! Unsuortht E.J.; Arestad, A.; 
Cioce. V.; Schiffnann> E.; Liotta> L.A. 

♦journal J. Biol. Chen. (1992) 267:2524-2529 

Ititle Identification! purification, and partial sequence analysis 

of autotaxin, a novel notility-stinulating protein, 
♦cross-references MUID:92129337 



mil 

MDARATIPEHIARTVILP8GYADDEVIYPAFKWLRDEQPLAMAHIEGYDPMWIATKHADVMQIGK9PGLFSN 
10 20 30 X 40 50 60 70 

AEGSEILYDG 
80 



5. US-08-249-182-3 (1-5) 

S27653 cytochrome P450-terpredoxin - Pseudomonas sp. 

ENTRY S27653 #iype complete 

TITLE cytochrome P450-terpredoxin - Pseudomonas sp. 

ORGANISM 8fornal_nane Pseudomonas sp. 

DATE 25-Feb-1994? #sequence_revision 25-Feb-1994; itext change 

25-Feb-1994 
ACCESSIONS S27653 
REFERENCE S27651 

#authors Petersoni J. A.; Lu, J.Y.? Geisselsoden J.; Graham-Lorencei 

S.t Carnonat C; Witney r F.? Lorence* M.C. 
tsubmission submitted to the EMBL Data Library* April 1992 
#accession 527653 

## s t atu s pre I i m i n ar y 

##residues 1-428 ##label PET 
Mcross-references EM8LJM91440 
SUMMARY Slength 428 #nolecular-ueight 47922 Checksum 8787 

SEQUENCE 

Initial Score = 5 Optimized Score = 5 Significance = 4.08 
Residue Identity = 100% Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
Hill 

MDARATIPEHIARTVILPSGYADDEVIYPAFKHLRDE9PLAMAHIEGYDPMWIATKHADVH8IGKflPGLFSN 
10 20 30 X 40 50 60 70 

AEGSEILYDG 
80 



6. US-08-249-182-3 (1-5) 

S37334 hypothetical protein YBL0511 



yeast (Saccharomyce 



yeast (Saccharomyces 



ENTRY S37334 #type complete 

TITLE hypothetical protein YBL0511 

cerevisiae) 
#fornal_name Saccharomyces cerevisiae 
31-Dec-1993 #sequence_revision 31-Dec-1993 Itext change 

31-Dec-1993 
S37334 
S37325 
Scherensi B. 

submitted to the EMBL Data Libraryr July 1993 
S37334 
##molecule_type DNA 
^residues" 1-418 t#label SCH 
## cross-references EHBL:Z23261 
GENETICS 

#mapjposition 2 

SUMMARY" ilength 418 imolecular-ueight 47390 Checksum 9974 

SEQUENCE 



ORGANISM 
DATE 

ACCESSIONS 

REFERENCE 
iauthors 
#submission 
#accession 



Initial Score = 5 Optimized Score = 5 Significance = 4.08 
Residue Identity = 100% Matches - 5 Mismatches = 0 



TITLE prismane protein - Desulfovibrio desulfuricans 

ORGANISM ♦formal^name Desulfovibrio desulfuricans 

DATE 25-Feb-1994; ♦sequence_revision 25-Feb-1994; ♦text change 

25-Feb-1994 
ACCESSIONS S24389 
REFERENCE S24389 

♦authors Stokkermansi J. P. W.G.J van den Berg> W.A.M.; van Dongen* 

W, M.A.N. J Veeger f C. 
#journal Biochim. Biophys. Acta (1992) 1132:83-87 
#ti tie The primary structure of a protein containing a putative 

E6Fe-6S3 prismane cluster from Desulfovibrio desulfuricans 
(ATCC 27774). 
♦cross-references HUID; 92379097 
#accession S24389 

##status preliminary 
##residues 1-545 ##label STO 
♦♦cross-references EMBL:Z11975 
SUMMARY ♦length 545 ♦molecular-ueight 58659 ♦checksum 9858 

SEQUENCE 



Initial Score = 5 
Residue Identity - 1007. 
Gaps - 0 



Optimized Score = 5 Significance = 4.08 
Matches = 5 Mismatches = 0 

Conservative Substitutions = 0 



X X 
YPAFK 
lllll 

EITKVNIGVGSNPGILISGHDLRDLEMLLK9TEGTGVDVYTHSEMLPAHYYPAFKKYAKFKGNYGNAHHK8K 
230 240 250 260 270 X X 280 290 



EEFESFNGPVLLTTNCLVPPKDSYKDRVYTTGI 
300 310 320 



US-08-249-182-3 (1-5) 

A42971 cytochrome P-450terp P P-450terp - Pseudomonas sp. 



Pseudomonas sp. 



ENTRY A42971 #type complete 

TITLE cytochrome P-450terp, P-450terp 

ORGANISM #fornal_nane Pseudomonas 

DATE 04-Mar-1993J ♦sequence_revision Q4-Har-1993; ttext change 

04-Mar-1993 
A42971 
A42971 

Petersoni J. A.; Lur J.Y.; Geisselsoderi J.; Graham-Lorence* 

S.; Carnonai C.r Witneyr F.J Lorencer B.C. 
J. Biol. Chen. (1992) 267:14193-14203 
Cytochrome P-450terp. Isolation and purification of the 
protein and cloning and sequencing of its operon. 
♦cross-references HUID:92332528 
♦accession A42971 

♦♦status preliminary 
#tnolecule_type nucleic acid? protein 
♦♦residues" 1-428 ♦♦label PET 
♦♦cross-references NCBIP: 108469 

♦♦note sequence extracted from NCBI backbone 

SUMMARY ♦length 428 ♦molecular-ueight 47922 ♦checksum 8787 

SEQUENCE 



ACCESSIONS 
REFERENCE 
♦authors 

♦journal 
♦title 



Initial Score = 5 
Residue Identity - 100% 
Gaps = 0 



Optimized Score = 5 Significance = 4.08 
Matches = 5 Mismatches = 0 

Conservative Substitutions = 0 



X X 
YPAFK 



^residues" 1-1096 ##label KAT 
GENETICS 

pulA 

glycosidase; hydrolase 



#gene 
KEYWORDS 
FEATURE 

1-19 

20-1096 

SUMMARY 
SEQUENCE 



idonain signal sequence # label SIG\ 
iprotein alpha-dextrin endo-lr6-alpha-glucosidase ftlabel 
MAT 

ftlength 1096 tnolecular-ueight 119335 Schecksun 1390 



Initial Score 



5 Optimized Score = 



5 Significance = 4.08 
Residue Identity = 1002 Matches - 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
YPAFK 
11)11 

GENKPIVRLYYSHSSKVAADSNGEFSDKYVKLTPTTVN68VSMRFPHLASYPAFKLPDDVNVDELLSGDDGG 
210 220 230 240 250 260 X 270 280 



I AESDG I LSLSHPG ADRRRAGR YLCRRAE ALS Y 
290 300 310 



2. US-08-249-182-3 (1-5) 

S29861 prisnane protein - Desulfovibrio vulgaris 

ENTRY S29861 #type conplete 

TITLE prisnane protein - Desulfovibrio vulgaris 

ORGANISM #fornal_nane Desulfovibrio vulgaris 

DATE 08-Dec-1993f #sequence_revision 08-Dec-1993i ttext change 

08-Dec-1993 
ACCESSIONS S29861 
REFERENCE S29861 

#authors Stokkernans* J.P.W.G.; Pierik* A.J.; Wolbert» R.B.G.? Hageni 

U.R.r van Dongen* U.M.A.M.J Veeger, C. 
#journal Eur. J. Biochert. (1992) 208:435-442 
ititle The prinary structure of a protein containing a putative 

[6Fe-6S] prisnane cluster fron Desulfovibrio vulgaris 
(Hildenborough) . 
taccession S29861 

##status prelininary 
ttresidues 1-553 ##label ST0 
##cross-references EMBL:Z11707 
SUMMARY #length 553 fnolecular-ueight 60163 fchecksun 5194 

SEOUENCE 



Initial Score = 5 
Residue Identity = 100% 
Gaps = 0 



Optimized Score = 5 
Matches = 5 

Conservative Substitutions 



Significance - 
Misnatches = 



4.08 
0 
0 



X X 
YPAFK 

urn 

EIT9VNIGVGKNPGILISGHDLKDMAELLKQTEGTGVDVYTHGEHLPANYYPAFKKYPHFVGNYGGSWWQ0N 
230 240 250 260 270 X 280 290 



PEFESFNGPILLTTNCLVPLKKENTYLDRLYTT 
300 310 320 



3. US-08-249-182-3 (1-5) 

S24389 prisnane protein - Desulfovibrio desulfuricans 



4. 


A42971 


cytochrome P-450terp> P-450te 


428 


5 


5 


4.08 


V 

0 


5. 


S27653 


cytochrome P450-terpredoxin - 


428 


5 


5 


4.08 


0 


6, 


S37334 


hypothetical protein YBL0511 


418 


5 


5 


4.08 


0 


7. 


S39833 


hypothetical protein YBL0511 


418 


5 


5 


4.08 


0 


8. 


A42329 


autotaxin - human (fragments) 


114 


5 


5 


4.08 


0 


9. 


S12888 


DNA-binding protein II - Ther 


95 


5 


5 


4.08 


0 



The list of other best scores is; 



Sequence Name Description 



Init. Opt. 
Length Score Score 



Sig. Frame 







**»* 3 standard deviations above nean *«** 






10. 


S35946 


aram protein - tscnericma co 


At A 
4 J 4 


I 3.06 


0 


11. 


S10561 


pnotosystem a cin protein 


kk A i 
00 4 * 


t 3.06 


0 


12. 


CCPS5S 


LyuOLnropic cjji rseuuopionas 


AO A i 
DC 4 J 


I 3.06 


0 


13. 


CCPB6 


cytocnropie co riectonena do 


OJ 4 i 


1 3.06 


0 


14. 


CCAI6 


cyLQunronc co nnaoaena van 


AA A i 
00 4 ' 


I 3.06 


0 


15. 


JC1206 


Lfwfl-Dinoing protein ho - oaci 


OA A 


f 3.06 


0 


16. 


JC1207 


LfNH~Dinoing protein nu oaci 


OA A 


I 3.06 


0 


17. 


DNBS2F 


utiH~Di noxng protein nu - oaci 


on a i 
70 4 J 


I 3.06 


0 


18. 


S24373 


UMH-Dinoing protein nosu tsyn 


00 A j 
7C 4 ' 


f 3.06 


0 


19. 


JC1208 


y»n oinuiny protein nu * oaci 


05 A i 
7C 4 ' 


» 3.06 


0 


20. 


JC1209 


uNH-DinDing protein nu oaci 


OO A 

7C *J ' 


1 3.06 


0 


21. 


S00015 


UNfl-Dinoing protein no - oaci 


OO A 

7C 4 J 


1 3.06 


0 


22. 


S28361 


nypotneticai protein lc.i - o 


1 AO A 

102 4 ' 


5 3.06 


0 


23. 


S06985 


nypotneticai protein mitni o 


i n*i a j 
1U0 4 J 


I 3.06 


0 


24. 


9QECRP 


hypothetical protein A-105 - 


1 AC A 

105 4 ' 


I 3.06 


0 


25. 


S16667 


nypotneticai protein ric - ri 


1 1 A A 

110 4 i 


f 3.06 


0 


26. 


B40785 


imrc protein - rice tungro oa 


1 1 A A 

110 4 * 


» 3.06 


0 


27. 


CCA I 53 


t-yLULnruFlc LO prctUrbur Ana 


111 A i 
ill 4 1 


I 3.06 


0 


28. 


S10101 


nodulation protein EN0D2 - ga 


112 4 ' 


? 3.06 


0 


29. 


S19492 


hypothetical protein YCR078C 


114 4 < 


? 3.06 


0 


30. 


CCRF2C 


cytochrome c2 precursor - Rho 


137 4 ' 


1 3.06 


0 


31. 


A35720 


hypothetical 16. IK protein (p 


146 4 ' 


t 3.06 


0 


32. 


S16853 


plastoquinol— plastocyanin re 


160 4 ' 


1 3.06 


0 


33. 


JFBYA1 


mating hormone alpha-1 - yeas 


175 4 ' 


1 3.06 


0 


34. 


A35309 


signal peptidase (EC 3.4.-.-) 


179 4 ' 


I 3.06 


0 


35. 


A34229 


signal peptidase (EC 3.4.-.-) 


192 4 ' 


} 3.06 


0 


36. 


HCHCA 


coagulogen precursor - Atlant 


195 4 i 


I 3.06 


0 


37. 


R5SP22 


ribosomal protein L22 - spina 


199 4 ' 


I 3.06 


0 


38. 


JS1243 


coat protein 1 - barley yello 


203 4 ' 


J 3.06 


0 


39. 


A48608 


Ei glycoprotein (C-terninal) 


207 4 - 


I 3.06 


0 


40. 


S24593 


hypothetical protein 3 - pota 


208 4 ' 


} 3.06 


0 



1. US-08-249-182-3 (1-5) 

A26879 alpha-dextrin endo-1 i 6-alpha-glucosidase (EC 3.2.1 

ENTRY A26B7? #type complete 

TITLE alpha-dextrin endo-iio-alpha-glucosidase (EC 3.2.1.41) 

precursor - Klebsiella pneumoniae 
ALTERNATE JIAMES pullulanase 

ORGANISM #fornal_nane Klebsiella pneumoniae 

DATE 30-Jun-198B #sequence_revision 30-Jun-1988 #text change 

28-Apr-1993 
ACCESSIONS A26879 
REFERENCE A26879 

^authors Katsuragir N.; Takizauai N.; Murookat Y. 

Ijournal J. Bacteriol. (1987) 169:2301-2306 

#title Entire nucleotide sequence of the pullulanase gene of 

Klebsiella aerogenes H70. 

Icross-references ltUID?87194626 

icontents K. aerogenesr strain H70 

#accession A26879 



100- 



50- 



10- 



5- 



II I II III III I I I 

SCORE 0| 1 |l 2 | 2 3 | 3 4 4 5 
STDEV 0 12 3 



PARAMETERS 



Similarity matrix 


Unitary 


K-tuple 


2 


Hisnatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 


15 


Optimized scores to 


save 0 


Display context 


50 



SEARCH STATISTICS 



Scores: Mean Median 

1 3 

Tines; CPU 

00:01:23.90 

Number of residues: 20816057 
Number of sequences searched: 70848 
Number of scores above cutoff: 3917 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 

The scores belou are sorted by initial score 
Significance is calculated based on initial score. 



Standard Deviation 
0.98 

Total Elapsed 
00:01:51.00 



9 100% similar sequences to the query sequence uere found: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



1. A26879 alpha-dextrin endo-l,6-alpha~ 1096 5 5 4.08 0 

2. S29861 prismane protein - Desulfovib 553 5 5 4.08 0 



native rLnv LOdL protein, urny tne unh sequence. 

CC The modified sequence nay be used to transforn plants of the Solanaceae 

CC fanily (e.g. potatoes* tobacco. tonato» pepper) to obtain transgenic 

CC plants resistant to PLRV. 

SQ Sequence 208 AA> 

SQ 10 A; 26 R; 10 N; 7 D; 0 B; 3 c; 12 8; 6 E; 0 z; 20 G; 3 H; 

SO 8 Ir 9 L? 12 KJ 3 H; 8 F? 11 Pf 23 S? 11 TJ 2 W; 5 YJ 19 VJ 

Initial Score = 4 Optinized Score = 4 Significance = 3.86 
Residue Identity = 80% Hatches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
Mil 

RRRRRRGGNRRSRRTGVPRGRGSSETFVFTKDNLVGNSQGSFTFGPSVSDCPAFKDGILKAYHEYKITSILL 
50 60 70 80 90 X 100 110 

SFVSEASSTSPGSIAYELDPHCKVSSL8SYVNK 
120 130 140 150 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 

Results file u249_3p.res nade by on Thu 22 Sep 94 10J09:58-PDT. 



Query sequence being conpared.'US-08-249-182-3 (1-5) 
Nuftber of sequences searched: 70848 
Nunber of scores above cutoff; 3917 

Results of the initial conparison of US-08-249-182-3 (1-5) with! 
Data bank : PIR 41, all entries 

100000- 

N 

U50000- * 

II 

B 

E 

R < 
0 

F10000- 

S - t 

E 5000- 

8 

U 

E 

N 

C 

E 

S 1000- 



500- 

# 



PT translation initiation codon in different frame than altered 

PT sequence 

PS Disclosure; Fig 4; 32pp; English. 

CC A synthetic nodified PLRV coat protein gene was designed to 

CC incorporate nucleotide changes to facilitate the expression of the 

CC gene in plants- The modifications make the non-plant gene more 

CC "plant-like 0 . 

CC Two translation initiation sites (beginning at nucleotides 26 and 32) 

CC at the 17 kD QRF in a different reading frane than the native PLRV coat 

CC protein are altered (ATG -> ACG). A stronger stop codon (beginning at 

CC nucleotide 625) is also provided in the modified PLRV DNA sequence 

CC (TAG -> TAA). In addition* a TAG codon is placed behing the TAA 

CC termination codon to create a tandem translation stop signal. 

CC The nodified sequence Ray be used to transform plants of the Solanaceae 

CC family (e.g. potatoes* tobacco* tomato* pepper) to obtain transgenic 

CC plants resistant to PLRV, 

SG Sequence 208 AAi 

SQ io A; 26 R; 10 N; 7 D; 0 B; 3 c; 12 8; 6 E; 0 2; 20 G; 3 H? 

SQ 8 I; 9 L; 12 K; 3 h; 8 F; ll P; 23 5; 11 T; 2 u; 5 Y; 19 V? 



Initial Score = 4 Optimized Score = 4 Significance = 3.86 
Residue Identity = 802 Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
YPAFK 
Mil 

RRRRRRGGNRRSRRTGVPRGRGSSETFVFTKDNLVGNSQGSFTFGPSVSDCPAFKDGILKAYHEYKITSILL 
50 60 70 80 90 X 100 110 



8FVSEASSTSPGSIAYELDPHCKVSSL8SYVNK 
120 130 140 150 



15. US-08-249-1S2-3 (1-5) 

R32917 Modified PLRV coat protein. 

ID R32917 standard; Protein; 208 AA. 

AC R32917J 

DT 29-JUN-1993 (first entry) 

DE Modified PLRV coat protein. 

KW Virus; resistance; translation initiation site; stop codon; 

KU Solanaceae; transgenic; plant; mutagenesis. 

OS Potato leafroll virus isolate Prosser LR7. 

PN EP-531273-A. 

PD 10-MAR-1993. 

PF 02-SEP-1992; 870141. 

PR 03-SEP-1991; US-753738. 

PA (M0NS ) MONSANTO CO. 

PI Henenuay CL* Lauson EC* Tuner NE» Weiss JD; 

DR WPI; 93-078751/10. 

DR N-PSDB; Q37630. 

PT Modified DNA sequence encoding potato leaf roll virus coat 

PT protein - comprises the DNA sequence having at least one internal 

PT translation initiation codon in different frane than altered 

PT sequence 

PS Disclosure; Fig 3; 32pp; English. 

CC The native PLRV coat protein DNA sequence was used to obtain the 

CC novel PLRV coat protein DNA sequence with inproved virus resistance 

CC characteristics. Tuo translation initiation sites (beginning at 

CC nucleotides 26 and 32) at the 17 kD 0RF in a different reading frane 

CC than the native PLRV coat protein are altered (ATG -> ACG). 

CC A stronger stop codon (beginning at nucleotide 625) is also provided 

CC in the nodified PLRV DNA sequence (TAG -> TAA). 

CC The mutagenesis does not alter the amino acid sequence of the 



FT Region 181.. 184 

FT /note= "tryptic fragnent" 

FT Region 189.. 197 

FT /note= °tryptic fragnent" 

FT Region 202.. 208 

FT /note= "tryptic fragment 11 

PN EP-531273-A. 

PD 10-MAR-1993. 

PF 02-SEP-1992; 870141. 

PR 03-SEP-1991; US-753738. 

PA (HONS ) MONSANTO CO. 

PI Henenuay CL» Lauson ECi Tuner NEi Weiss JD; 

DR HPI; 93-078751/10. 

DR N-PSDB; 037629. 

PT Modified DNA sequence encoding potato leaf roll virus coat 

PT protein - conprises the DNA sequence having at least one internal 

PT translation initiation codon in different frane than altered 

PT sequence 

PS Disclosure; Fig 1; 32pp; English. 

CC The native PLRV coat protein DNA sequence uas used to obtain the 

CC novel PLRV coat protein DNA sequence with inproved virus resistance 

CC characteristics. Tuo translation initiation sites (beginning at 

CC nucleotides 26 and 32) at the 17 IsD ORF in a different reading frane 

CC than the native PLRV coat protein are pref . altered. A stronger stop 

CC codon is also provided in the modified PLRV DNA sequence* i.e. TAA TAG. 

CC The nodified sequence nay be used to transforn plants of the Solanaceae 

CC fanily (e.g. potatoes* tobacco* tonato* pepper) to obtain transgenic 

CC plants resistant to PLRV. 

SO Sequence 208 AA; 

SQ 10 a; 26 R; 10 N; 7 D? 0 B; 3 C; 12 Q; 6 E; 0 7; 20 G? 3 H; 

so 8 I; 9 l; 12 K; 3 H; 8 F; 11 P? 23 S; 11 T; 2 U; 5 Y; 19 V; 

Initial Score = 4 Optimized Score = 4 Significance = 3.86 
Residue Identity = 807. Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
llll 

RRRRRRGGNRRSRRTGVPRGRGSSETFVFTKDNLVGNS8GSFTFGPSVSDCPAFKDGILKAYHEYKITSILL 
50 60 70 80 90 X 100 110 

QFVSEAS5TSPGSIAYELDPHCKVSSLQSYVNK 
120 130 140 150 



14. US-08-249-182-3 (1-5) 

R32918 Synthetic nodified PLRV coat protein. 

ID R32918 standard? Protein; 208 AA. 

AC R32918; 

DT 29-JUN-1993 (first entry) 

DE Synthetic modified PLRV coat protein. 

KW Virus; resistance? translation initiation site; stop codon; 

KW Solanaceae; transgenic; plant. 

OS Synthetic. 

PN EP-531273-A. 

PD 10-MAR-1993. 

PF 02-SEP-1992; 870141. 

PR 03-SEP-1991; US-753738. 

PA (MONS ) MONSANTO CO. 

PI Henenuay CL* Lauson Ed Tuner NE* Weiss JD; 

DR WPI; 93-078751/10. 

DR N-PSDB; 037631. 

PT Modified DNA sequence encoding potato leaf roll virus coat 



100 110 120 



12. US-08-249-1S2-3 (1-5) 

R44505 PLRV integument protein. 

ID R44505 standard; Protein; 208 AA. 

AC R44505? 

DT 21-JUN-1994 (first entry) 

DE PLRV integument protein. 

KW Primer; isolation? potato leaf roll virus; PLRV; integument protein; 

KW transformation; plasmid; potato; electroporation; agrobacterium; 

KW microinjection; redifferentiation; resistance. 

OS Potato leaf roll virus. 

PN J05304847-A. 

PD 19-NQV-1993. 

PF 30-APR-1992; 135561. 

PR 30-APR-1992; JP-135561. 

PA (H0KK-) HOKKAIDO GURIINBAIO KENKYUSHQ KK. 

DR WPI; 93-408237/51. 

DR N-PSDB; 053460. 

PT Potatoes resistant against potato leaf roll virus - comprises 

PT transforming potato cells using recombinant vector combined DNA 

PS Disclosure; Page 7; 8pp; Japanese. 

CC This sequence is encoded by the potato leaf roll virus (PLRV) 

CC integument protein gene. The PLRV integument protein gene may be 

CC transformed into a microorganism which is then cultured. The cultured 

CC plasmid nay be introduced in to a potato cell by electroporationr 

CC agrobacterium transformation or by microinjection. Expression of the 

CC integument protein causes redifferentiation of transformed cells. This 

CC is used in the production of potatoes resistant to PLRV. 

SO Sequence 208 AA; 

S6 10 a; 26 Rf 10 N; 7 0; 0 B; 3 C; 12 0; 6 E; 0 z; 20 G; 3 H; 

SO 8 I; 10 l; 12 K; 3 R; 8 F; 9 P; 24 S? 12 T; 2 w; 5 Y; 18 v; 

Initial Score = 4 Optimized Score = 4 Significance = 3.86 
Residue Identity = BOX Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
llll 

RRRRRRGGNRRSRRTGVPRGRGSSETFVFTKDNLVGNSOGSFTFGPSLSDCPAFKDGILKAYHEYKITSILL 
50 60 70 80 90 X 100 110 

8FVSEASSTSSG5IAYELDPHCKVSSLG5YVNK 
120 130 140 150 



13. US-08-249-182-3 (1-5) 

R32916 Native PLRV coat protein. 

ID R32916 standard; Protein; 208 AA. 

AC R32916; 

DT 29-JUN-1993 (first entry) 

DE Native PLRV coat protein. 

KW Virus; resistance; translation initiation site; stop codon? 

KW Solanaceae; transgenic; plant. 

OS Potato leafroll virus isolate Prosser LR7, 

FH Key Location/Qualifiers 

FT Region 32., 42 

FT /note= "tryptic fragment" 

FT Region 77.. 91 

FT /note= n tryptic fragment 0 

FT Region 141,. 151 



PR 04-N0V-1987? US-1 17099. 

PA (CALB-) California Biotechn. 

PI Benson BJi White RTi Schilling JHi Buckley D, Scarborough Rll* 

DR HPI; 89-145617/22. 

DR N-PSDB; N90107. 

PT Hunan 5P-18 and SP-5 derived peptide is) - 

PT with alveolar surfactant protein activity* used for treating 

PT respiratory distress syndrone* pneunonia and bronchitis 

PS Disclosure; Fig 9? .pp; English. 

CC ASP proteins including the hSP-18- and hSP-5-derived peptides can be 

CC used as a carrier or vehicle for delivery of other active and important 

CC nolecules to and/or through the lung to the blood vasculature. 

SG Sequence 187 AA; 

SQ 11 Ai 5 R; 7 N; 7 D; OB? 5 C; 8 Q; 11 E? 0 Z? 7 G; 11 H? 

S0 12 I? 14 l; 11 K? 6 fi; 15 F; 7 P; 8 S; 10 T; 3 w; 8 y? 21 V; 

Initial Score - 4 0ptini2ed Score = 4 Significance = 3.86 
Residue Identity = 80X Matches = 4 Misnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
Mil 

TGYTTVDIS8WHRKEHFEAFaSVA8CTYNQTVQLDITAFLKTVKKNKHKFYPAFIHILARLMNAHPEFR«AI1 
10 20 30 40 50 X 60 70 

KDGELVIHDSVHPCYTVFHEQTETFSSLWSEYH 
80 90 100 110 



tl. US-08-249-182-3 (1-5) 

P90731 Linulus polyphenus coagulogen. 



ID P90731 standard? protein? 195 AA. 

AC P90731? 

DT 13-0CT-1989 (first entry) 

DE Linulus polyphenus coagulogen. 

KW Coagulogen; Linulus polyphenus; haenatocytes? coagulation. 

OS Linulus polyphenus. 

PN J01160484-A. 

PD 23-JUN-1989. 

PF 17-DEC-1987? 317425. 

PR 17-DEC-19B7? JP-317425. 

PA (GREC) Green Cross Corp. 

DR HPI; 89-224282/31. 

PT Coagulogen cDNA 

PT - which originates fron Linulus polyphenus and is present nainly 

PT in haenatocytes. 

PS Clain 1; fig 1; 15pp; Japanese. 

CC Coagulogen fron Linulus polyphenus (see corresp. N90363). Is present 

CC nainly in haenatocytes* and is involved in body hunor coagulation. 

SQ Sequence 195 AA? 

SQ 9 A? 13 R? 5 N? 6 D? 0 B; 16 C; 8 Qi 15 E? 0 Z? 14 Gi 4 H? 

SQ 9 I? 11 L? 11 K? 1 M? 13 F? 10 P? 13 S? 14 T? 1 U? 5 Y; 17 V; 



Initial Score = 4 Optinized Score = 4 Significance - 3.86 
Residue Identity = 80X Matches = 4 Misnatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
YPAFK 
III! 

NVPTCLCEEPTLLGRKVIVSQETKDKIEEAVQAITDKDEISGRGFSIFGGHPAFKECGKYECRTVTSEDSRC 
30 40 50 60 70 X 80 90 



ID R15612 standard; Protein; 187 AA. 

AC R15612; 

DT 16-HAR-1992 (first entry) 

DE SP-C fron pC149SP-C insert. 

KW Alveolar surfactant protein; CAT; fusion protein; SP-5. 

OS Hoflo sapiens. 

FH Key Location/Qualifiers 

FT Protein 1..149 

FT /label= CAT 

FT Region 150.. 152 

FT /label* linker 

FT Protein 152.. 187 

FT /label= SP-C 

PN W09118015-A. 

PD 28-N0V-1991. 

PF 17-MAY-1991; U03490. 

PR 17-MAY-1990? US-524360. 

PA (CALB-) CALIF BIOTECHN INC. 

PI Benson BJ* White RTr Schilling JWr Buckley DI> Scarborough RH; 

DR HPI; 91-369185/50. 

DR N-PSDB5 815266. 

PT New alveolar surfactant protein analogues - used for treating 

PT respiratory distress syndrone* pneunonia and bronchitis 

PS Disclosure; Fig 9; 67pp; English. 

CC An anino acid sequence of a 187 residue fusion protein encoded 

CC by pC149SP-C is a slight modification of the sequence shoun in 

CC 815265. In plasnid pC149SP-C, the 149 anino acids of CAT are 

CC joined to 35 anino acids of SP-5 through a linker of 3 anino acids. 

CC The SP-5 comprises 18. 11 of the total fusion. 

CC To construct pC149SP-Ci a portion of the CAT segnent of pC210SP-C 

CC extending fron the Ddel site at nucleotide 523 (815265) to the 

CC EcoRI site at nucleotide 728 uas renoved and replaced by a set of 

CC two complementary oligonucleotides. 

CC See also R15602-7, 815262-63 and 815265-66. 

SQ Sequence 187 AA; 

SQ 11 A; 5 R; 7 N; 7 D; 0 B; 5 c? 8 6; n E; o z; 7 G; li H; 

SQ 12 I; 15 L; 11 k; 6 h; 15 F; 7 P; 8 S; 10 T; 3 u; 8 y; 20 V; 

Initial Score = 4 Optimized Score = 4 Significance = 3.86 
Residue Identity = 80X Hatches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
Mil 

TGYTTVDISQHHRKEHFEAF8SVA8CTYN8TV8LDITAFLKTVKKNKHKFYPAFIHILARLMNAHPEFRHAM 
10 20 30 40 50 X 60 70 

KDGELV I WDSVHPCYT VFHE8TETFSSLWSE YH 
80 90 100 110 



10. US-08-249-182-3 (1-5) 

P98407 Sequence of surfactant protein CAT-SP-5 fusion pro 

ID P98407 standard; protein; 187 AA. 
AC P98407; 

DT 19-JAN-1991 (first entry) 

DE Sequence of surfactant protein CAT-SP-5 fusion protein encoded by 
DE pC149SP-C 

KW Alveolar surfactant protein; respiratory distress syndrome; 
KW pneunonia; bronchitis; therapy. 
OS Homo sapiens. 
PN W08904326-A. 
PD 18-MAY-1989. 



Initial Score = 4 Optiflized Score = 4 Significance = 3.86 
Residue Identity = 802 Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
II II 

SLTKPRPVI LDP ADPTGNLGGGDPKGWR9LA9EAE AHtNYPCFKNUDGSPVSSW I LLAESNSTDDETDDPRT 
10 20 30 40 X SO 60 70 

YQKYGYIGTHEYPHFSHRPSTL 
80 90 



8. US-08-249- 182-3 (1-5) 

R05445 CAT-A4-75U hybrid protein. 

ID R05445 standard; protein; 132 AA. 

AC R05445; 

DT 30-JUL-1990 (first entry) 

DE CAT-A4-75H hybrid protein. 

KM CAT; hybrid protein; A4-751 i protein. 

OS Synthetic 

FH Key Location/Qualifiers 

FT Peptide 1..76 

FT /label=CAT 

FT Peptide 77.. 132 

FT /label=M-75li 

PN W09001540-A. 

PD 22-FEB-1990. 

PF 09-AUG-19895 U03417. 

PR ll-AUG-1988; US-231224. 

PA (CALB-) Calif Biotechn Inc. 

PI Hilliker S, White R; 

DR HPI; 90-083499/11. 

DR N-PSDB; 003563. 

PT Heterologous protein expression on prokaryotic host - 

PT using 3' truncated chloramphenicol acetyl transferase gene to 

PT stably express hybrid protein. 

PS Exanple; Fig 4A; 67pp; English. 

CC The N-terninus of the CAT protein joined to the synthetic A4-75U gene 

CC product preceded by a chemical cleavage site (hydroxy 1 anine) 

CC encoded by Asn-Gly. 

CC See also 003557 to 805366! and 804767. 

SO Sequence 132 AA; 

SS 11 A; 5 R; 7 N; 4 D; OB; 7 C; 6 fl; 10 E; 0 z; 9 G; 5 H; 

S9 7 I; 4 l; 9 k; 4 h; io F; 4 P; 5 s; 10 T; 2 u; 6 Y; 7 v; 

Initial Score = 4 Optinized Score = 4 Significance = 3.86 
Residue Identity = 802 Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
till 

TGYTT VD IS9WHRKEHFEAF8SV AQCT YN9T V9LD ITAFLKT VKKNKHKF YPAF I H I LARLMNAHPEFNGEV 
10 20 30 40 50 X 60 70 

CSE8AETGPCRAM I SRWYFDVTEGKCAPFFYGG 
80 90 100 110 



9. US-08-249-182-3 (1-5) 

R15612 SP-C fron pC149SP-C insert. 



PA (YEDA ) YEDA RES & DEV CD LTD. 

PI Revel Hr Chebath J; 

DR ypi; 87-095196/14. 

DR N-PSDB; N70146. 

PT Recombinant enzyme having (2'-5 r ) oligo A synthetase activity - 

PT used for monitoring the response of a patient to an interferon 

PS Example; Fig IB; 90pp; English. 

CC A partial cDNA clone (El) for the OAS mRNA from human SV80 cells was 

CC first obtd. through its ability to select by hybridisation a rRNA 

CC producing OAS activity upon translation in Xenopus laevis oocytes. 

CC The El cDNA insert hybridises to 3RNA species of 1 .6 r 1.8 and 3.6 kb 

CC which are coinduced by I FN in 9V80 cells. cDNA clones for the 1.6 

CC and 1.8 RNAs have been isolated and sequenced. 

SQ Sequence 110 AA; 

SO 7 A; 6 R; 5 NJ 5 D; 0 b; 2 c; 5 Q? 5 EJ 0 z; 6 G; 1 h; 

SQ 7 Ii 13 L; 6 K; 0 M? 4 F? 13 P; 5 S? 4 T; 5 U> 6 Yi 5 V; 

Initial Score = 4 Optimized Score = 4 Significance = 3.86 
Residue Identity = 80X Hatches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions » 0 

X X 
YPAFK 
II II 

KNPIIEKYLRR9LTKPRPVlLDPADPTGNLGGGDPKGMRQLAflEAEAHLNYPCFK«HDGSPVSSHILLVRPP 
30 40 50 60 70 X 80 90 

ASSLPFIPAPLHEA 
100 110 



7. US-08-249-182-3 (1-5) 

P71705 Partial (2'-5M oligo A synthetase sequence. 

ID P71705 standard; Protein; 111 AA. 

AC P71705? 

DT 1B-APR-1991 (first entry) 

DE Partial (2'-5 r ) oligo A synthetase sequence. 

KM In vivo interferon assay; OAS. 

OS Homo sapiens. 

FH Key Location/Qualifiers 

FT Modified -site 61 

FT /label= glycosylated Asn 

PN EP-217102-A. 

PD 08-APR-1987. 

PF 21-AUG-1986J 111585. 

PR 28-AUG-1985; IL-076233. 

PR 08-APR-1986? IL-078445. 

PA (YEDA ) YEDA RES & DEV CO LTD. 

PI Revel «» Chebath J; 

DR WPIi 87-095196/14. 

DR N-PSDB; N71375. 

PT Recombinant enzyme having (2'-5') oligo A synthetase activity - 

PT used for monitoring the response of a patient to an interferon 

PS Claim 3; Fig 7B; 90pp; English. 

CC A partial cDNA clone (El) for the OAS mRNA from human SV80 cells was 

CC first obtd. through its ability to select by hybridisation a mRNA 

CC producing OAS activity upon translation in Xenopus laevis oocytes. 

CC The El cDNA insert hybridises to 3RNA species of 1.6, 1.8 and 3.6 kb 

CC which are coinduced by I FN in SVSO cells, cDNA clones for the 1.6 

CC and 1.8 RNAs have been isolated and sequenced. This amino acid 

CC sequence is deduced from the 1.8E clone which comprises the last 5 

CC codons of exon 6 and all of exons 7a and 8. 

SG Sequence 111 AA; 

SQ 8 A; 4 R; 4 N; 9 D; 0 B? 2 C; 6 Q; 7 E? 0 Z; 8 Gr 3 Hr 



10 20 30 40 50 X 60 70 



5. US-08-249-182-3 (1-5) 

R05446 CAT-GLP-1 hybrid protein. 

ID R05446 standard; protein; 104 AA. 

AC R05446; 

DT 30-JUL-1990 (first entry) 

DE CAT-GLP-1 hybrid protein. 

KU CAT; hybrid protein; GLP-1 protein. 

OS Synthetic 

FH Key Location/Qualifiers 

FT Peptide 1..64 

FT /label=CAT 

FT Peptide 65.. 105 

FT /label=GLP-l (7-37) 

PN W09001540-A. 

PD 22-FEB-1990. 

PF 09-AUG-1989; U03417. 

PR 11 -AUG- 1988! US-231224. 

PA (CALB-) Calif Biotechn Inc. 

PI Hilliker S, White R; 

DR WPI; 90-083499/11. 

DR N-PSDB; 004767. 

PT Heterologous protein expression on prokaryotic host - 

PT using 3' truncated chloranphenicol acetyl transferase gene to 

PT stably express hybrid protein. 

PS Example! Fig 4B; 67ppi English. 

CC First 73 anino acids of the CAT protein joined in-frane to 

CC the synthetic GLP-1 gene product preceded by a Net codon. 

CC (Cyanogen bronide cleavage site.) 

CC See also 803557 to 005366. 

SQ Sequence 104 AA; 

S8 10 A; 3 R; 3 n; 3D; 0B; I c; 6 Q; 7 E; o z? 4 G; 6 H; 

SQ 6 I; 6 L; 10 K; 3 n; 8 F; 2 p; 5 S; 9 T; 2 m 4 y; 6 v; 

Initial Score = 4 Optimized Score = 4 Significance - 3.86 
Residue Identity = 80% Matches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
llll 

TGYTTVDISQUHRKEHFEAFSSVA9CTYNSTVQLDITAFLKTVKKNKHKFYPAFIHILARLHNAHPEFNHAE 
10 20 30 40 50 X 60 70 

GTFTSDVSSYLEGSAAKEFIAHLVKGR 
80 90 100 



6. US-08-249-182-3 (1-5) 

P70092 Sequence encoded by (2' -5*) oligo A synthetase (OA 

ID P70092 standard; Protein; 110 AA. 

AC P70092; 

DT 18-APR-1991 (first entry) 

DE Sequence encoded by <2'-5') oligo A synthetase (OAS) El cDNA 

DE clone 174-3. 

KM In vivo interferon assay. 

OS Hono sapiens. 

PN EP-217102-A. 

PD 08-APR-1987. 

PF 21-AUG-1986; 111585. 

PR 28-AUG-1985; IL-076233. 



CC tolerance. The peptides have nol.uts. of 5 kD* 9 kDr 11 kD> 22 kDi 

CC 24 kD, 30 kD> 36 kDr 60 kD and 68 kD. The N-terninals of the 30 kDr 

CC 11 kD and 9 kD peptides are given in R31 179-81 respectively. 

SQ Sequence 20 AA; 

SQ 3 A; 0 R; l N; 0 D? 0 B; l C; l G; 0 E; 0 z; 3 G? 0 H; 

so 2 I; l L; 0 K; 0 H; 2 F; 4 P; o s; 0 T; o w; 1 Y? 1 v? 



Initial Score = 4 
Residue Identity = 802 
Gaps = 0 



Optimized Score = 4 Significance = 3.86 
Matches = 4 Hi snatches = 1 

Conservative Substitutions = 0 



X X 
YPAFK 
Mil 

AIFCGGVNPALGPPIYPAFG 
10 X 20 



4. US-08-249-182-3 (1-5) 

R32366 CAT 1-73 peptide. 



ID R32366 standard; Protein; 73 AA. 

AC R32366J 

DT 18-JUN-1993 (first entry) 

DE CAT 1-73 peptide. 

KU Hunan; proinsulin; vector; pUC19; pPINS; CAT; pUC-CAT-proinsulin; 

KW insulin analogue; type I; type II; diabetes. 

OS Synthetic. 

PN W09303174-A. 

PD 18-FEB-1993. 

PF 31-JUL-1992; U06451, 

PR 08-AUG-1991; US-741938. 

PR 30-JUL-1992; US-918953. 

PA (PFIZ ) PFIZER INC. 

PA (SCI0-) SCI0S INC. 

PI Andy RJr Larson ER; 

DR WPI; 93-076530/09. 

DR N-PSDB; 037002. 

PT New hepato selective and peripheral selective human insulin 

PT analogues - and their corresp. DNAi for treatment of type I and 

PT type II diabetes 

PS Disclosure; Fig 2a; 58pp; English. 

CC The sequence given represents amino acids 1-73 of CAT. This sequence 

CC uas used in the construction of proinsulin analogues. The sequences 

CC in Q36996-7001 are oligonucleotides which can be combined to form a 

CC gene which codes for human proinsulin. The resulting cDNA coding for 

CC proinsulin uas inserted into plasnid vector pUC19 and digested uith 

CC Kpnl and Hindlll. This resulted in the formation of the vector pPINS. 

CC This CAT fragment uas inserted into pPINS to give a plasmid uhich 

CC contained DNA sequences uhich coded for amino acids 1-73 of CATr an 8 

CC amino acid linker sequence and human proinsulin (see Q37003). This 

CC plasmid* pUC-CAT-proinsulin» could be used in the formation of insulin 

CC analogues uhich nay be used in the treatment of types I and II 

CC diabetes. 

SQ Sequence 73 AA; 

SQ 6 A; 2 R; 3 N; 2 D; 0 6; l C; 5 Q; 4 E ; 0 Z; l G; 5 Hi 

SQ 5 I; 4 L; 8 K; 2 «; 6 F? 2 P? 2 S? 7 T; l h; 3 y; 4 v; 



Initial Score = 4 Optimized Score = 4 Significance = 3.86 
Residue Identity = 80X Matches = 4 Mismatches = I 

Gaps = 0 Conservative Substitutions = 0 



X X 
YPAFK 
llll 



2. US-08-249-132-3 (1-5) 

P82507 Pullulanase protein. 



ID P82507 standard; protein; 1096 AA. 

AC PS2507; 

DT 01-N0V-1990 (first entry) 

DE Pullulanase protein. 

KW Pullulanase; starch; alcohol prodn. 

OS Klebsiella aerogenes. 

PN J63245676-A. 

PD 12-QCT-1988. 

PF 31-MAR-1987; 078355. 

PR 31-HAR-1987; JP-078355. 

PA (ELED) Denki Kagaku Kogyo KK; (SUNR) Suntory Ltd. 

DR WPI; 88-333488/47. 

DR N-PSDB; N81341. 

PT Gene encoding pullulanase - derived fron reconbinant plasnid pHPl contg. 

PT gene fron Klebsiella genus. 

PS Disclosure; p; Japanese. 

CC The pullulanase protein cleaves alpha-lr6-glucoside bonds of starch and 

CC is effective in decomposition of branched starch. It is used in the 

CC prodn. of naltose and glucose fron starchi and of alcohol fron starch 

CC via glucose. Anino acid residues 1-19 can be deleted. 

SO Sequence 1096 AA; 

SS 108 A; 63 R; 42 N; 93 D? 0 B; 10 C? 51 G; 42 E; 0 Z; 89 G; 22 Hf 

S9 38 I; 86 L5 42 K; 21 M; 31 F; 52 P? 96 S; 70 T;18 w; 41 y; 81 v; 

Initial Score = 5 Optimized Score = 5 Significance * 4.82 
Residue Identity = 100X Hatches = 5 Hisnatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
Mill 

GENKPIVRLYYSHSSKVAADSNGEFSDKYVKLTPTTVN88VSMRFPHLASYPAFKLPDDVNVDELLQGDDGG 
210 220 230 240 250 260 X 270 280 

IAESDGILSLSHPGADRRRAGRYLCRRAEALSY 
290 300 310 



3. US-08-249-182-3 (1-5) 

R31181 N-teninal of 9 kD antifreeze polypeptide. 

ID R3U81 standard; peptide; 20 AA. 

AC R31181; 

DT 14-MAY-1993 (first entry) 

DE N-teninal of 9 kD antifreeze polypeptide. 

KU Frost tolerance; plant? ice crystal; cryopreservation; winter rye. 

OS Secale cereale cv. Musketeer. 

PN W09222581-A. 

PD 23-DEC-1992. 

PF 12-JUN-1992; CA0255. 

PR 13-JUN-1991; GB-012774. 

PR 13-DEC-1991; GB-026485. 

PA (UYHA-) UNIV WATERLOO. 

PI Griffith N; 

DR WPI; 93-018083/02. 

PT Polypeptide(s) responsible for frost tolerance in plants - used 

PT in cryo preservation of biological tissues and for inproving 

PT quality of frozen foods 

PS Disclosure; Page 25; 52pp; English. 

CC Antifreeze polypeptides connon to frost tolerant plants were 

CC isolated. The peptides are located in extracellular spaces of plant 

CC cells to control ice crystal growth in the intercellular plant spacei 



Similarity natrix 


Unitary 


K-tuple 


2 


Mi snatch penalty 


1 


Joining penalty 


20 


Gap penalty 


i f\t\ 

1 .00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignnents to save 15 




Optimized scores to 


save 0 


Display context 50 






SEARCH STATISTICS 




Scores J 


Mean 


Median Standard Deviation 




0 


1 1.04 




Tines: 


CPU 


Total Elapsed 






00:00:24.91 


00:00:25,00 




Nunber of residues? 




5287517 




Number of sequences 


searched: 


42145 




Number of scores above cutoff: 


4652 





Cut-off raised to 2. 
Cut-off raised to 3. 

The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence was found: 
Sequence Nane Description 



Init. Opt. 
Length Score Score Sig. Frane 



1. R37445 Autotaxin peptide ATX 20. 5 5 

A 100X similar sequence to the query sequence uas found: 



4.82 



Sequence Nane Description 

2. P82507 Pullulanase protein. 

The list of other best scores is? 

Sequence Nane Description 



Init. Opt. 
Length Score Score Sig. Frane 



1096 



5 4.82 0 



Init. Opt. 
Length Score Score Sig. Frame 







*«** 3 standard deviations above nean ***» 






3. 


R31181 


N-teninal of 9 kD antifreeze 


20 4 ' 


I 3.86 


0 


4. 


R32366 


CAT 1-73 peptide. 


73 4 ' 


1 3.86 


0 


5. 


R05446 


CAT-GLP-1 hybrid protein. 


104 4 ' 


} 3.86 


0 


6. 


P70092 


Sequence encoded by (2'-5') o 


110 4 ' 


1 3.86 


0 


7. 


P71705 


Partial (2*-5') oligo A synth 


111 4 * 


} 3.86 


0 


8. 


R05445 


CAT-A4-75H hybrid protein. 


132 4 ' 


» 3.86 


0 


9. 


R15612 


SP-C frow pC149SP-C insert. 


187 4 < 


I 3.86 


0 


to. 


P98407 


Sequence of surfactant protei 


187 4 ' 


I 3.86 


0 


11. 


P90731 


LiRulus polyphenus coagulogen 


195 4 t 


» 3.86 


0 


12. 


R4450S 


PLRV integunent protein. 


208 4 < 


I 3.86 


0 


13. 


R32916 


Native PLRV coat protein. 


208 4 - 


I 3.86 


0 


14. 


R32918 


Synthetic modified PLRV coat 


208 4 > 


I 3.86 


0 


15. 


R32917 


Modified PLRV coat protein. 


20B 4 < 


1 3.86 


0 


16. 


R135B6 


PLRV capsid protein. 


208 4 < 


} 3.86 


0 


17. 


R05101 


Potato leaf roll virus 23K co 


208 4 ■ 


» 3.86 


0 


18. 


R0542S 


Anino acid sequence for a CA 


.240 4 i 


I 3.86 


0 


19. 


P92070 


Sequence of chloramphenicol a 


241 4 ' 


1 3.86 


0 
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n« nrti u.ru. i/uncs n.v.r OLnnccnuuun n.» uhouumh i.i UHVita vJ.».» 

RA HULL R. ! 

RL NUCLEIC ACIDS RES. 19:2615-2621(1991). 

DR EMBL; X57924; RTBVPHIL. 

DR PIR; S16667; S16667. 

KM HYPOTHETICAL PROTEIN. 

SQ SEQUENCE 110 AA; 11910 MM; 64722 CNJ 



Initial Score = 
Residue Identity = 
Gaps = 



4 Optinized Score = 4 Significance = 3.39 
80X Hatches = 4 Hi snatches = 1 

0 Conservative Substitutions = 0 



X X 
YPAFK 



MSADVPTFKEALEKFKNLESDTAGKDKFNUVFTLENIKSAADVNLASKGLVQLYALQEI 
X 10 20 30 40 50 



15. US-08-249-182-3 (1-5) 

YP12.RTBV HYPOTHETICAL P12 PROTEIN (ORF 2). 



ID YP12_RTBV STANDARD; PRTf 110 AA. 

AC P27529; 

DT 01-AUG-1992 (REL. 23. CREATED) 

DT 01-AUG-1992 (REL. 23, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL P12 PROTEIN (ORF 2). 

OS RICE TUNGRO BACILLI FORM VIRUS (RTBV). 

OC VIRIDAE; NOT YET CLASSIFIED. 

RN [1] 

RP SEQUENCE FROH N.A. 

RN 92024093 

RA QU R.D., BHATTACHARYYA M. , LACO G.S., DE KOCHKO A., RAO B.L., 

RA KANIEWSKA II. B., ELMER J.S., ROCHESTER D.E., SMITH C.E., 

RA BE ACHY R.N.; 

RL VIROLOGY 185:354-364(1991). 

DR EMBL; M65026! LERTUORFS. 

DR PIR! B40785; B40785. 

KU HYPOTHETICAL PROTEIN. 

SS SEQUENCE 110 AA; 11924 HW; 64761 CN; 



Initial Score = 4 Optimized Score = 4 Significance = 3.39 
Residue Identity = 807. Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
II II 

MSADYPTFKEALEKFKNLESDTAGKDKFNHVFTLENIKTAADVNLASKGLVQLYALQEI 
X 10 20 30 40 SO 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 ^.^^ *\ 

Results file u249_4a.res Bade by on Thu 22 Sep 94 10: 12J35-PDT. 



Query sequence being cofipared:US-08-249-182-4 (1-5) 
Nunber of sequences searched: 42145 
Nunber of scores above cutoff: 3812 



Results of the initial conparison of US-08-249-182-4 (1-5) with: 



uaia oam ; H-ueneseq n> an entries 



100000- 
N 

U50000- 

H 

B 

E * 
R 

0 

F10000- 
S 

E 5000- 

9 

U 

E 

N 

C 

E 

S 1000- 



500- 



100- 



10- 



0 

II I II III III I I I 

SCORE 0| 1 |1 2 | 2 3 | 3 4 4 5 
STDEV 12 3 4 



PARAMETERS 



Sinilarity natrix Unitary 

Hi snatch penalty 1 

Gap penalty 1.00 

Gap size penalty 0.05 

Cutoff score 0 

Randonization group 0 



K-tuple 

Joining penalty 
Window size 



2 

20 
5 



Initial scores to save 40 Alignments to save 15 
Optimized scores to save 0 Display context 50 



SEARCH STATISTICS 



Scores; 



Tines! 



Mean 
0 

CPU 
00:00?26.98 



Median 
1 



Standard Deviation 
1.12 

Total Elapsed 
00:00:29.00 



5287517 
42145 
3812 



Number of residues: 

Number of sequences searched: 

Number of scores above cutoff: 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100% identical sequence to the query sequence uas found: 
Sequence Name Description 



Init. Opt. 
Length Score Score Sig. Frame 



1. R37446 Autotaxin peptide ATX 24. 5 5 

5 100X similar sequences to the query sequence uere found: 



4.48 



Sequence Name Description 



Init. Opt. 
Length Score Score Sig. Frame 



2. 


R4284B 


VIP receptor protein. 


459 


5 


5 


4.48 


0 


3. 


R26150 


ASB7 gH-2 antibody grafted he 


146 


5 


5 


4.48 


0 


4. 


R20793 


CDR-grafted» hunanised heavy 


146 


5 


5 


4.48 


0 


5. 


R41469 


HAb 2502 hunanised heavy chai 


140 


5 


5 


4.48 


0 


6. 


R40179 


Hunanised antibody CMX5-3 he a 


135 


5 


5 


4.48 


0 



The list of other best scores is: 



Sequence Name Description 



Init. Opt. 
Length Score Score 



Sig. Frame 







**** 3 standard deviations above nean «*** 






7. 


R41113 


HCV peptide XIV or HCV8 (aa 1 


22 4 


» 3.58 


0 


8. 


R41186 


HCV NS4 protein HCV7/8. 


26 4 > 


? 3.58 


0 


9. 


P90810 


Synthetic HTLV-1 peptide anti 


26 4 


» 3.58 


0 


10. 


R40053 


Hib 0HP PI peptide HIBP1-1 (1 


30 4 


1 3.58 


0 


11. 


R131B6 


Peptide VII innunoreactive ui 


38 4 


I 3.58 


0 


12. 


R06408 


HTLV-1 corresponding peptide 


38 4 < 


» 3.58 


0 


13. 


R20772 


Peptide V based on innunoreac 


40 4 ' 


» 3.58 


0 


14. 


R13185 


Peptide (V) innunoreactive ui 


40 4 


I 3.58 


0 


15. 


R13591 


HTLV-I env precursor epitope 


40 4 < 


1 3.58 


0 


16. 


R33871 


Polypeptide pl684 conprising 


67 4 ' 


I 3.58 


0 


17. 


R 13345 


P1684 HCV antigen (1684-1750) 


67 4 ' 


» 3.58 


0 


18. 


R 10026 


onpA Signal peptide used to c 


78 4 ' 


\ 3.58 


0 


19. 


R 13557 


HCV C-100 protein innunodonin 


90 4 


^ 3.58 


0 


20. 


R 13962 


Putidaredoxin. 


107 4 < 


» 3.58 


0 


21. 


R33872 


Polypeptide pi 689 conprising 


117 4 ' 


» 3.58 


0 


22. 


P92019 


Sequence of the polypeptide e 


117 4 > 


1 3.58 


0 


23. 


P90136 


Sequence of hepatitis C virus 


117 4 ' 


I 3.58 


0 


24. 


R 13354 


P1689 HCV antigen (1689-1805) 


118 4 


1 3.58 


0 


25. 


P92018 


Sequence of the polypeptide e 


128 4 


I 3.58 


0 



£0. 




sequence ot nepaims k, virus 


ica • 


» * 




V 


27. 


R25U3 


Non-A> Non-B Hepatitis Virus 


134 ' 


I 4 


3.58 


0 


28. 


R38093 


nodC N-terninal portion. 


153 ' 


> 4 


3.58 


0 


29. 


P93319 


Anino acid sequence of suine 


193 i 


1 4 


3.58 


0 


30. 


P93318 


Amino acid sequence of suine 


193 i 


1 4 


3.58 


0 


31. 


R10650 


Adenylate kinase. 


194 i 


1 4 


3.58 


0 


32. 


P90745 


Reconbinant hunan cardiac nyo 


195 i 


I 4 


3.58 


0 


33. 


P91391 


Hunan ventricular nyosin ligh 


195 i 


I 4 


3.58 


0 


34. 


R43885 


Consensus sequence of C-terni 


199 i 


1 4 


3.58 


0 


35. 


R43879 


C -terminal portion of FIPV sp 


200 ' 


1 4 


3.58 


0 


36. 


R 13897 


Nitrite hydratase alpha subun 


207 i 


1 4 


3.58 


0 


37. 


R11717 


ENV93/HTLV-1-II fusion protei 


217 i 


» 4 


3.58 


0 


38. 


R33714 


A29379. 


219 i 


1 4 


3.58 


0 


39. 


R10102 


gp46/p21E env fusion protein 


219 i 


1 4 


3.58 


0 


40. 


P71662 


Sequence encoded by adult T-c 


228 ' 


\ 4 


3.58 


0 



1. US-08-249-182-4 (1-5) 

R37446 Autotaxin peptide ATX 24. 

ID R37446 standard; peptide; 5 AA. 

AC R37446; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 24. 

KU Cell notility stimulating; cancer metastasis; antibody; detection; 

KW innunostainsi disease outcome prediction; therapy choice; 

KH cancer therapy; crossl inked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch Hr Liotta LA, Schiffnann E, Stracke K. 

DR WPI; 93-085861/10. 

PT Motility stinulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 24. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in inmunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which nay also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SO Sequence 5 AA; 

SQ i A; o R; o N; o D; 0 B; 0 C; l a; l E; o z; 0 G; 0 H; 

S9 o I; o L; 0 K; 0 H; o F; o P; l s; o T; o H; 0 Y; l v; 



Initial Score = 5 
Residue Identity - 100% 
Gaps = 0 



Optimized Score = 5 Significance = 4.48 
Matches = 5 Mismatches = 0 

Conservative Substitutions = 0 



X X 
QAEVS 

lllll 
QAEVS 
X X 



2. US-08-249-182-4 (1-5) 

R42848 VIP receptor protein. 



ID R42848 standard; Protein; 459 AA. 
AC R4284S; 



u\ iij-nHf-iY7*t wirsir entry; 

DE VIP receptor protein. 

KW vasoactive intestinal polypeptide receptor; VIP; rat; binding; 

KW adenylate cyclase activity; stimulus. 

OS Rattus rattus. 

PN J05255394-A. 

PD 05-0CT-1993. 

PF 13-FEB-1992; 026607. 

PR 13-FEB-1992; JP-026607, 

PA (0SAB-) ZH OSAKA BIOSCIENCE KENKYUSHO . 

DR WPI; 93-348480/44. 

DR N-PSDB; 950349. 

PT Vasoactive intestinal polypeptide - prepd. in large ant. by 

PT culturing nicrobe transformed by new DNA coding polypeptide 

PS Clain 2; Page 6; 14pp; Japanese. 

CC The sequence can be used to produce large anounts of the VIP 

CC receptor peptidet by culturing a nicroorganisn transformed by 

CC the sequence. 

SQ Sequence 459 AA; 

SQ 32 A; 22 R; 14 N; 11 D; 0 B; 17 C; 22 9; 20 E; 0 Z; 27 G; 11 H; 

SO 31 IJ 52 Li 15 K? 10 H; 30 F; 21 P? 40 S? 20 T; 15 W; 16 Yi 33 V; 



Initial Score = 5 Optinized Score = 5 Significance = 4.48 
Residue Identity = 1002 Hatches = 5 Hiswatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
OAEVS 

inn 

RKHRRUHLflGVLGWSSKSGHPMGGSNGATCSTQVSMLTRVSPSARRSSSFSAEVSLV 
410 420 430 440 450 X X 



3. US-08-249-182-4 (1-5) 

R26150 A5B7 gH-2 antibody grafted heavy chain. 



ID 


R26150 standard; Protein; 146 AA. 


AC 


R26150; 


DT 


03-FEB-1993 (first entry) 


DE 


ASB7 gH-2 antibody grafted heavy chain. 


KU 


hunanised antibody; chinaeric; carcino-enbryonic antigen; therapy; 


KW 


diagnosis; carcinoids; CDR; IgG; hunan; nurine; ss. 


05 


Chinaeric. 


FH 


Key Location/Qualifiers 


FT 


Region 26.. 35 


FT 


/note= "grafted nurine CDR1" 


FT 


Region 50.. 65 


FT 


/note= "grafted nurine CDR2" 


FT 


Region 95-102 


FT 


/note= "grafted nurine CDR3" 


- PN 


W09201059-A. 


PD 


23-JAN-1992. 


PF 


05-JUL-1991; GO 1108. 


PR 


05-JUL-1990; GB-014932. 


PR 


21-DEC-1990J W0-G02017. 


PR 


05- JUL- 1991; H0-G01108. 


PA 


(CELL-) CELLTECH LTD. 


PI 


Adair JR> Bodner MW. Mountain A, Ouens RJ; 


DR 


WPI ; 92-284316/34. 


DR 


N-PBDB! 027354. 


PT 


Hunanised antibody nolecules - comprising nurine and hunan regionsi 


PT 


specific for carcino-enbryonic antigen* useful for diagnosis and 


PT 


therapy 


PS 


Exanple 4; Figure 10; 71pp; English. 


CC 


This sequence is CDR-grafted A5B7 hunan antibody having 


CC 


nurine CDRs at anino acids 26-35 (CDR1), 50-65 (CDR2), and 95-102 



iuik.u ano aooiuonai nunne traneuorK resioues ai ir /Ct 

CC 73i 76> and 93. The LAY franeuorfc was chosen when naking the coding 

CC construct (927354) as it shows the highest homology to A5B7. The 

CC antibody has specificity for carcinoenbryonic antigen* produced by 

CC tunoursr and the Ab is thus useful in both therapy and diagnosis of 

CC certain carcinonas. 

SQ Sequence 146 AA; 

SQ 7 a; 7 r; 4 n; 4 d; o B; 2 c; 6 Q? 6 e; o z; 18 G; i h; 

SQ 3 I; 14 Li 6 K; 3 M; 9 F; 3 PJ 16 S; 13 T; 5 W; 9 Y; 10 V; 

Initial Score = 5 Qptinized Score = 5 Significance = 4.48 
Residue Identity = 100X Matches * 5 Mi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
GAEVS 

inn 

SAPCKGLEHLGFIGNKANGYTTEYSASVKGRFTISRDKSKSTLYLQMNGLflAEVSAIYYCTRDRGLRFYFDY 
60 70 80 90 100 110 X 120 

WG8GTLVTVSSASTKGP 
130 140 



4. US-08-249-182-4 (1-5) 

R20793 CDR-grafted* hunanised heavy chain gHl. 

ID R20793 standard; Protein; 146 AA, 

AC R20793; 

DT 19-MAY-1992 (first entry) 

DE CDR-graftedi hunanised heavy chain gHl. 

KU piurine mono clonal antibody; MAb; A5B7; hunanised antibody; CEA; 

KW conplenentarity determining region. 

OS Hono sapiens. 

OS Mus nusculus. 

FH Key Location/Qualifiers 

FT Peptide 1..19 

FT /labels signal 

FT Protein 20.. 146 

FT /label= VH 

FT /note* "hunan LAY franework uith A5B7 CDRs" 

FT Region 45.. 54 

FT /label= CDR1 

FT /note= "nurine residues" 

FT Region 69.. 87 

FT /label= CDR2 

FT /note= "nurine residues 0 

FT Region 120.. 129 

FT /label= CDR3 

FT /note= "nurine residues" 

FT Miscjiifference 20 

FT /note- "nurine residue" 

FT Misc.difference 67,. 68 

FT /note= "nurine residues" 

FT Hisc_difference 94.. 95 

FT /note= "nurine residues" 

FT Miscjiifference 98 

FT /note= "nurine residue" 

FT Misc_difference 118 

FT /note* "nurine residue" 

PN W09201059-A. 

PD 23-JAN-1992. 

PF 05-JUL-1991J G01108. 

PR 05-JUL-1990; GB-014932. 

PR 21-DEC-I990J W0-G02017. 

PR 05-JUL-1991; MG-G01I08. 



rn vucll-/ ucLUicun Liu. 

PI Adair JR» Bodmer MW, Mountain A» Owens RJ; 

DR WPI; 92-056874/07. 

DR N-PSDB; G20987. 

PT Neu CDR-grafted anii carcinoembryonic antigen antibodies - useful 

PT in therapy and diagnosis of carcinoma 

PS Claim 14; Fig 10? 70pp? English. 

CC This heavy chain sequence comprises a human framework (i.e. the LAY 

CC region) which contains murine sequences (frofl the murine anti-CEA 

CC A5B7 MAb) in the CDRs and at other positions predicted to be 

CC important for antigen-binding and at which human and A587 sequences 

CC differ. (See S20984 for A5B7 heavy chain coding sequence). 

SQ Sequence 146 AA> 

SQ 7 A; 7 R; 4 N; 4 D? 0 B; 2 C? 6 Q? 6 Ei 0 Z? 18 G? 1 H? 

SQ 3 I? 14 L? 6 K; 3 M? 9 F ; 3 P; 16 S? 13 T; 5 H? 9 Y? 10 V? 



Initial Score = 5 Optimized Score = 5 Significance = 4.48 
Residue Identity * 1007. Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 
lllll 

Q APGKGLEWLGF I GNK ANG YTTE YSASVKGRFT I SRDKSKSTLYLQMNGLSAEVSA I YYCTRDRGLRF YFDY 
60 70 80 90 100 110 X 120 



WG9GTLVTVSSASTKGP 
130 140 



5. US-08-249-182-4 (1-5) 

R41469 MAb 25D2 humanised heavy chain. 

ID R41469 standard; Protein; 140 AA. 

AC R41469; 

DT 03-MAR-1994 (first entry) 

DE MAb 25D2 humanised heavy chain. 

KM Heavy; VH; light; VL; chain? variable region? antihuman? interleukin-4? 

KW IL-4? monoclonal antibody? MAb? 25D2? single chain binding protein? 

KW complementarity determining region? CDR? humanised; Fv region? BABS? 

KW antagonist? polymerase chain reaction? PCR? primer? amplify; gamma4? 

KU pSV. SPORT. 

OS Rattus rattus. 

OS Homo sapiens. 

FH Key Location/Qualifiers 

FT Peptide 1..19 

FT /note= "Leader sequence" 

FT Protein 20.. 140 

FT /note= "25D2 H chain" 

c PN U09317106-A. 

PD 02-SEP-1993. 

PF 18-FEB-1993? U01301. 

PR 19-FEB-1992? US-841659. 

PA (SCHE ) SCHERING CORP. 

PI Abrams JS» Dalie B, Le HV, Miller K, Murgolo NJ? 

PI Nguyen Hi Pearce M» Tindall S» Zavodny PJ? 

DR WPI; 93-288412/36. 

DR N-PSDB; Q48350. 

PT Monoclonal antibodies against human interleukin-4 corresp. DNA 

PT and CDRs - are useful for detection of interleukin-4 and treatment 

PT of related diseases 

PS Example 9; Page 89-90? 114pp? English. 

CC This sequence represents the humanised heavy (H) chain of the antihuman 

CC interleukin-4 UL-4) monoclonal antibody (MAb) 25D2. The 25D2 H 

CC chain coding region uas cloned in three fragments using the primers 

CC given in Q48351-60. The amplified fragments were designed to 



contain siient restriction sites * nowever several cooons nao to 

CC be changed to incorporate further restriction sites. The primers 

CC given in 948361-66 were used to amplify the entire H chain variable 

CC region (VH) of an unrelated humanised antibody. The amplified fragments 

CC were then cloned into pSV. Sport which already contained the 25D2 H 

CC chain fragnents. The primers given in 048367-72 were used in 

CC further manipulations to anpl if y a human gamna4 constant region cDNA 

CC which was used to replace the genomic DMA. The humanised MAb is an 

CC IL-4 antagonist. It nay be used in a pharmaceutical composition for 

CC detecting^ measuring and imnunopurifying human IL-4 and blocking IL-4 

CC activity in IL-4-related diseases. 

SQ Sequence 140 AA? 

SQ 9 A? 7 R? 4 N; 5 D? 0 B; 3 C; 6 G? 4 E? 0 Z; 16 G; 2 H; 

SQ 6 I; 12 L; 2 K; 3 M; 7 F? 4 P; 18 s? 8 T; 5 w; 9 Y; 10 V; 

Initial Score = 5 Optimized Score = 5 Significance = 4.48 
Residue Identity = 100X Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions - 0 

X X 
QAEVS 

inn 

VRQAPGKGLEHVASISISGDNTYYPDSVRGRFTISRNDSKNTLYL8HNGL9AEVSAIYYCARDPYYFSGHYF 
60 70 80 90 100 X 110 120 

DFWGSGTLVTVSS 
130 140 



6. US-08-249-182-4 (1-5) 

R40179 Humanised antibody CMX5-3 heavy chain variable reg 

ID R40179 standard? Protein? 135 AA. 

AC R40179; 

DT 14-FEB-1994 (first entry) 

DE Humanised antibody CMX5-3 heavy chain variable region. 

KW Primer; polymerase chain reaction; amplify; PCR; human; kappa; L; 

KW constant region; heavy; H; chain; pUC19; humanised; antibody; 

KW light; REI; VL3 fragment; CMX5-1; CHX5-3. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Peptide 1..19 

FT /note= "Secretory leader peptide" 

FT Protein 20.. 135 

FT /note= "CMX5-3 heavy chain variable region 0 

PN W09316184-A. 

PD 19-AUG-1993. 

PF 04-FEB-1993; U00759. 

PR 06-FEB-1992; US-832842. 

PA (SCHE ) SCHERING CORP. 

PI Abrans JSi Chou C, Jenh Ci llurgolo NJ* Petro ME; 

PI Silver JE» Tindall St Windsor WT, Zavodny PJ; 

DR WPI; 93-272888/34. 

PT Humanised monoclonal antibody - comprises variable animal region 

PT and constant human regionr binds to human interleukin-5 

PS Example; Page 91-92; 118pp; English. 

CC The sequences given in R40179-80 represent the variable regions of 

CC the heavy and light chains of the humanised antibody CMX5-3 

CC respectively. These sequences were based on the humanised antibody 

CC CMX5-1. These sequences were generated using the primer sequences 

CC given in 948068-71. These primers were based on sequences derived 

CC from antibody JES1-39D10 and human LAY VH framework sequences. The 

CC amplification products were used to replace the VH1 and VH3 fragments 

CC of CMX5-1 H chain cDNA in pSV. Sport (see also R40175). 

SQ Sequence 135 AA; 

SQ 9 a; 5 R; 7 N; 4 D; 0 B? 3 c; 6 Q; 5 E; 0 Z; 15 G; l H; 



/ ii ID L r J T\$ C Hi * M £ V r if D i 0 lr 3 W* / Tr iU Vf 



Initial Score = 5 Qptinized Score = 5 Significance = 4.48 
Residue Identity = 100X Hatches = 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
SAEVS 
Mill 

WIRQAPGKGLEWVALIWSNGDTDYNSAIKSRFTISRNDSKNTLYLQHNGLfiAEVSAIYFCAREYYGYFDYHG 
60 70 80 90 100 X 110 120 

QGTLVTVSS 
130 



7. US-08-249-182-4 (1-5) 

R41U3 HCV peptide XIV or HCV8 (aa 1730-1749). 

ID R41113 standard; peptide; 22 AA. 

AC R4U13; 

DT 22-MAR-1994 (first entry) 

DE HCV peptide XIV or HCV8 (aa 1730-1749). 

KW Hunan immunodeficiency virus; HIV; hepatitis C virus; HCV; 

KW non-A non-B hepatitis; NANBH; hunan T-cell lynphotropic virus; HTLV; 

KW epitope; antibody; biotin; diagnosis; detection; vaccine. 

OS Synthetic. 

FH Key Location/8ualif iers 

FT Nodified_site 1 

FT /note= "the N-terninal comprises (A)-<B)-(X)-Y; where 

FT B= biotin; 

FT X= biotinylation cpd. incorporated 

FT during synthesis; 

FT Y= bond or linking gp(s). which 

FT nininises steric hindrance* 

FT where Y is not a bond it is pref. 1-10 

FT residues of (sane or different) glycine! 

FT beta-alanine* 4-aninobutyric acid* 

FT 5-aninovaleric acid or 6-aninohexanoic acid; 

FT parenthesis around B and X indicate opt. presence 

FT at the specified positions but B or X oust be 

FT present in at least one of the positions shown* 

FT B interacts with the peptide to give a cpd. 

FT with greater diagnostic sensitivity; 

FT A (optional)^ one or nore anino acidsi NH2 or 

FT gp. which modifies the N-terninus? 

FT Z= one or nore anino acids r OHf NH2i or a 

FT linkage involving either of these 2 gps." 

FT Modified_site 22 

FT /note= "the C-terninal conprises Y-(X)-Z a 

PN W09318054-A. 

PD 16-SEP-1993. 

PF 08-HAR-1993; E00517. 

PR 06-HAR-1992; EP-400598. 

PA (INN0-) IMNOGENETICS NV SA. 

PI De LEYS R; 

DR WPI; 93-303397/38. 

PT New biotinylated peptide(s) corresp. to inRuno-doptinant 

PT epitope(s) - with increased antigenicity* useful in antibodies 

PT detection and vaccines against hepatitis C* HIV and HTLV 

PS ClaiR 4; Page 90-98; 133pp; English. 

CC Peptide conpsns. comprise at least one and pref. a conbination of 

CC two* three* four or wore biotinylated peptides chosen fron the 

CC sequences given in R41058-R41166. The peptides represent 

CC innunologically important regions of viral proteins and are 

CC prepd. by solid phase peptide synthesis. The conpsns. are 



lv. user u i ror tne Detection or 

CC and/or HTLV-I or II. 

SQ Sequence 22 AA; 

S8 5 A; 1 R; 0 N; 0 D; 0 

SQ l I; 3 L; l K; 0 H; 0 

SQ 2 Others; 



annoooies to nv.v> ano/or mvr 



B; 
F; 



C; 



5; 



E; o 
t; o 



G; 
y; 



h; 
v; 



Initial Score = 4 
Residue Identity = 80X 
Gaps = 0 



Optinized Score = 4 Significance = 3.58 
Hatches = 4 Hi snatches = 1 

Conservative Substitutions = 0 



X X 
QAEVS 

mi 

X QK ALGLLQT ASRQAEV I APAX 
10 X 20 



8. US-08-249- 182-4 (1-5) 

R41186 HCV NS4 protein HCV7/S. 

ID R41186 standard; peptide; 26 AA. 

AC R41136! 

DT 22-HAR-1994 (first entry) 

DE HCV NS4 protein HCV7/8. 

KM Hunan innunodef iciency virus; HIV; hepatitis C virus; HCV; 

KU non-A non-8 hepatitis; NANBH; nun an T-cell lynphotropic virus; HTLV; 

KU epitope! antibody; biotini diagnosis; detection; vaccine. 

OS Synthetic. 

FH Key Location/Qualifiers 

FT Region 10.. 16 

FT /label= epitope_4 

PN W09318054-A. 

PD 16-SEP-1993. 

PF 08-HAR-1993; E00517. 

PR 06-HAR-1992J EP-400598. 

PA (INN0-) INNOGENET ICS NV SA. 

PI De LEYS R; 

DR HPI; 93-303397/38. 

PT Net) biotinylated peptide(s) corresp. to innuno-doninant 

PT epitope(s) - with increased antigenicityr useful in antibodies 

PT detection and vaccines against hepatitis C> HIV and HTLV 

PS Disclosure; Page 79; 133pp; English. 

CC Peptide conpsns. conprise at least one and pref. a combination of 

CC tud three i four or nore biotinylated peptides chosen fron the 

CC sequences given in R41058-R41166. 

CC The peptides nay be hybrids consisting of conbinations of the core 

CC epitopes of the HCV core (R41171-R41180) , HCV NS4 (R41181-R41186) or 

CC the HCV NS5 (R41 187-R41 193) region separated by Gly and/or Ser residues. 

CC Pref. hybrid peptides are given in R41161-R41163. 

CC The peptides represent innunologically inportant regions of viral 

CC proteins and are prepd. by solid phase peptide synthesis. The conpsns. 

CC are useful for the detection of antibodies to HCVr and/or HIVi 

CC and/or HTLV-I or II. 

SQ Sequence 26 AA; 

SQ 6 A; l R; 0 N; 0 D; 0 B; 0 C; 4 Q; 2 E; o z; l G; 0 H; 

S9 l l; 4 L; 2 K; 0 M; l F; l P; l s; l T; o H; 0 Y; l v; 

Initial Score = 4 Optinized Score = 4 Significance = 3.58 
Residue Identity - 80% Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 

mi 

LAEQFKSKALGLLQTASRSAEVI APA 



9. US-08-249-182-4 (1-5) 

P90810 Synthetic HTLV-1 peptide antigen. 



ID P90810 standard; peptide; 26 BP. 

AC P90810? 

DT 07-FEB-1990 (first entry) 

DE Synthetic HTLV-1 peptide antigen. 

KW HTLV-1 ; antigen; vaccine; env gene. 

FH Key Location/Oualif iers 

FT Region 1 

FT Region 25 

FT Region 26 

PN W08908664-A. 

PD 21-SEP-1989. 

PF 10-HAR-19B9; SE0126, 

PR 10-HAR-1988; US-166205. 

PA (VIR0-) Virovahl S.A. 

PI Svennerholm Br Ryno L, Jeansson S, Horal P; 

DR WPI; 89-292495/40. 

PT Synthetic HTLV-1 peptide antigens - used for detection of HTLV-1 

PT infection or in vaccines to elicit prodn. of antibodies, 

PS Clain 3; page 25; 34pp; English. 

CC The synthetic peptide sequence corresponds to an innunodoninant region 

CC ofthe envelope glycoprotein encoded by bps 6018-6086 of the env gene of 

CC HTLV-1. AAI = H or an anino acid added to facilitate coupling to a 

CC carrier protein; AA25 = absent or Cys; and AA26 = OH or NH2. The 

CC Ag peptide is used for detection of HTLV-1 infection or in vaccines 

CC See also P90809r P90811, and P90812. 

SO Sequence 26 AA; 

so 1 A; OR; i n; i o; o B; 2 c; 2 9; o E; o z; o G; 2 H; 

SG 3 1; 2 L; OK; 0 n; if; 2 p; 3 S; l T; l y; o Y; l V; 

SG 3 Others? 

Initial Score = 4 Optinized Score - 4 Significance = 3.58 
Residue Identity = 80X Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
GAEVS 
II II 

XWTHCFDPGIGAIVSSPCHNSLILXX 
10 X 20 



10. US-08-249-182-4 (1-5) 

R40053 Hib OMP PI peptide HIBP1-1 (1-29). 

ID R40053 standard; peptide; 30 AA. 

AC R40053; 

DT 04-FEB-1994 (first entry) 

DE Hib OHP Pi peptide HIBP1-1 (1-29). 

KW Haemophilus influenzae; type b; Hib; outer nenbrane protein? PI; P2; 

KW P6; vaccine; antibody; detection; lipoglycopeptide conjugate; 

KW innunogen. 

OS Synthetic. 

FH Key Location/Gualif iers 

FT Nisc.difference 30 

FT /note= "Hay be absent 0 

PN W09315205-A. 

PD 05-AUG-1993. 

PF 03-FEB-1993; CA0041. 

PR 03-FEB-1992; GB-002219. 

PA (CONN-) CQNNAUGHT LAB LTD. 



ri unong r? tvanoit ruem rmi aia 

DR UPI; 93-258681/32. 

PT Synthetic Haemophilus influenzae conjugate vaccine - comprising 

PT T-helper cell determinants and B-cell epitope(s) linked to 

PT synthetic oligo: saccharide(s) 

PS Table 1; Page 47; 99pp! English. 

CC The sequences given in R40053-101 are peptide fragments derived from 

CC the Haemophilus influenzae type b (Hib) outer membrane proteins Pl» 

CC P2 and P6. These peptides may be used in a vaccine against Hib 

CC infection and antibodies against these peptides nay be used in test 

CC kits to detect H, influenzae in a sample. The vaccine may further 

CC comprise a immunogenic or immunostimulatory molecule or the peptides 

CC may be modified with lipids^ or linked to synthetic PRP as synthetic 

CC lipoglycopeptide conjugates to produce alternative vaccines. 

SQ Sequence 30 AA; 

SQ 9 A; l R; 1 N; i D; 0 B; 1 C; I Q; 2 E; 0 Z; 3 G; 0 H; 

SQ 1 IJ 2 Lr 0 K; 0 M? 1 Fi 0 P? 3 S; 1 T; 0 ti; 1 Y; 2 V? 

Initial Score = 4 Optimized Score = 4 Significance = 3.58 
Residue Identity - 807. Hatches = 4 Hismatches ~ 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 

mi 

AAFQLAEVSTSGLGRAYAGEAAIADNASVC 
X 10 20 30 



11. US-08-249-182-4 (1-5) 

R13186 Peptide VII immunoreactive uith anti-HTLV antibodi 

ID R13186 standard; Protein; 38 AA. 

AC R13186; 

DT 09-0CT-1991 (first entry) 

DE Peptide VII immunoreactive uith anti-HTLV antibodies. 

KW human T-cell leukaemia virus; AIDS; ATU detection; 

KM envelope protein gp6l; acquired immunodef iency syndrome. 

OS Synthetic. 

PN EP-439077-A. 

PD 31-JUL-1991. 

PF 18-JAN-1991; 100616. 

PR 24-JAN-1990J US-469291. 

PA (UNBI-) UNITED BIOMEDICAL. 

PI yang CY; 

DR WPI; 91-224505/31. 

PT Peptide compsns. corresp. to envelope fragments of HTLV-li2 - for 

PT detecting antibodies to these viruses and diagnosing HIV and 

PT adult T-cell leukaemia infections 

PS Claim 1; Page 17; 27pp? English. 

CC This peptide is one of 16 peptides useful for detecting antibodies to 

CC HTLV or HIV viruses. The peptides correspond to partial sequences of 

CC the HTLV virus designated gp21 and gp64> both part of gp61r which 

CC defines the envelope protein of the HTLV-I or HTLV-II virus. The 

CC peptides can be amidated at the C-terminal. This particular peptide 

CC is used in a composition uith at least two of the other peptides of 

CC the invention. See R13184-R13193 and R13861-6. 

SQ Sequence 38 AA; 

SQ 2 A; 3 R; l N; l D; 0 B; 2 C; 2 Q; 0 E; 0 2; l G; 1 H; 

SQ 3 I; 4 L; 0 K; o H; 2 F; 6 P; 7 s; 1 T? 0 w? 0 Y; 2 v; 

Initial Score = 4 Optimized Score = 4 Significance = 3.58 
Residue Identity = 807. Hatches = 4 Hismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 



CFDP8I8A1VSSPCHNSLILPPFSLSPVPTLGSRSRRA 
X 10 20 30 



12. US-08-249- 182-4 (1-5) 

R06408 HTLV-1 corresponding peptide (VI). 

ID R06408 standard? protein; 38 AA. 

AC R06408; 

DT 21-DEC-1990 (first entry) 

DE HTLV-1 corresponding peptide (VI). 

KU HTLV-1; HIV; antibodies; vaccines; polyners; 

OS Synthetic. 

PN H09008162-A. 

PD 26-JUL-1990. 

PF 16-JAN-1990J U00260. 

PR 13-JAN-1989? US-297635. 

PA (UNBI-) UNITED B I OWED INC. 

PI Yang CY; 

DR WPI; 90-254015/33. 

PT Synthetic peptide(s) corresponding to HTLV-1 and op. HIV - used 

PT for detection of antibodiesr in vaccines and for development of 

PT antibodies 

PS Clain 1 (VI); Page 38; 52pp; English. 

CC Peptides having specific innunoreactivity to antibodies to HTLV-1 

CC comprise this sequence on its otin. or an analogue of it in which 

CC anino acids nay be added i deleted or substdr or segnents> nixts.t 

CC conjugates or polyners of the peptides representes in R0&403-O8. 

CC The peptides are safe? sensitive and specific in the detection of 

CC antibodies. This peptide corresponds to a partial segnent of the 

CC anino acid sequence of the HTLV-1 virus gp.21 or gp.46 and are 

CC prepared by solid phase synthesis. 

SQ Sequence 38 AA; 

S8 2 A; 3 R; l n; 1 D; 0 B; 2 C; 2 8; 0 E; 0 Z; l G; 1 H; 

S8 3 I; 4 l; 0 K; 0 n; 2 F; b P; 7 S; l T; 0 u; 0 Y; 2 v; 

Initial Score = 4 Optinized Score = 4 Significance = 3.58 
Residue Identity = 80% Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
fiAEVS 
II II 

CFDP8I8AIVSSPCHNSLILPPFSLSPVPTLGSRSRRA 
X 10 20 30 



13. US-08-249-182-4 (1-5) 

R20772 Peptide V based on innunoreactive region of Hepati 

ID R20772 standard; Protein; 40 AA. 

AC R20772; 

DT 05-HAY-1992 (first entry) 

DE Peptide V based on innunoreactive region of Hepatitis C virus. 

KM Non-A» non-B hepatitis virus! non-structural protein; vaccine. 

OS Synthetic. 

PN EP-468527-A. 

PD 29-JAN-1992. 

PF 26-JUL-1991; 112620. 

PR 26-JUL-1990; US-558799. 

PR 07-FEB-19915 US-651735. 

PR ll-HAR-1991; US-667275. 

PR 24-JUN-1991; US-719819. 

PA (UNBI-) UTD BIOMEDICAL INC. 



DR 
PT 
PT 
PT 
PS 
CC 
CC 
CC 
SQ 

sa 

SQ 



WPU 92-034279/05. 

Neu synthetic peptide specific for HCV antibodies - for detection 
of HCV or NANBHV e.g. by enzyne-l inked immunosorbent assay and is 
innunogen for preparation of vaccines 
Disclosure; Page 13; 98pp; English. 

Peptide V is fron the non-structural protein region of HCV. It uas 
found to be reactive and useful for the detection of antibodies to 
HCV and diagnosis of NANBH. See R20751-R20782. 
Sequence 40 AA; 
6 A; l R; 2 n; 0 
l I; 4 l; 4 km 



0; 
h; 



B; 
F; 



C; 
P; 



s; 



E; 
T; 



Z; 



G; 
Y; 



h; 

Vi 



Initial Score = 4 
Residue Identity = 802 
Gaps - 0 



Optinized Score = 4 Significance = 3.58 
Hatches = 4 Hi snatches - 1 

Conservative Substitutions = . 0 



X X 
8AEVS 



KQKALGLL8TASR8AEVIAPAV8TNU8KLETFUAKHHHNF 
10 X 20 30 40 



14. US-08-249-182-4 (1-5) 

R13185 Peptide (V) innunoreactive uith anti-HTLV antibodi 

ID R13185 standard; Protein; 40 AA. 

AC R 1 3 1 85 

DT 09-0CT-1991 (first entry) 

DE Peptide (V) innunoreactive uith anti-HTLV antibodies. 

KU hunan T-cell leukaenia virus; AIDS; ATI; detection; 

KW envelope protein gp61i acquired innunodef iency syndrone. 

OS Synthetic. 

PN EP-439077-A. 

PD 31-JUL-1991. 

PF 18-JAN-1991; 100616. 

PR 24-JAN-1990; US-469291. 

PA (UNBI-) UNITED BIOMEDICAL. 

PI Hang CY; 

DR WPI; 91-224505/31. 

PT Peptide conpsns. corresp. to envelope fragnents of HTLV-1*2 - for 

PT detecting antibodies to these viruses and diagnosing HIV and 

PT adult T-cell leukaenia infections 

PS Clain 1; Page 17; 27pp; English. 

CC This peptide is one of 16 peptides useful for detecting antibodies to 

CC HTLV or HIV viruses. The peptides correspond to partial sequences of 

CC the HTLV virus designated gp21 and gp64i both part of gp61> uhich 

CC defines the envelope protein of the HTLV-I or HTLV-II virus. The 

CC peptides can be anidated at the C-terninal. This particular peptide 

CC is used in a composition uith at least tuo of the other peptides of 

CC the invention. See also R13184. R13186-R13192 and R13861-6. 

SQ Sequence 40 AA; 

SQ 3 A; 0 R; 1 N; l D; 0 B; 2 c; 2 8; 0 E; 0 Z; 0 G; 3 H; 

SQ 2 I; 6 L; o k; o h; 2 f; 7 P; 5 s? 3 t; l u; l y; l v? 

Initial Score - 4 Optinized Score = 4 Significance = 3.58 
Residue Identity = 802 Hatches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
8AEVS 
II II 

SSTPLLYPSLALPAPHLTLPFNHTHCFDPQIQAIVSSPCH 
10 20 30 X X 40 



15. US-08-249-182-4 (1-5) 

R13591 HTLV-I env precursor epitope peptide. 



ID R13591 standard; Protein; 40 AA. 

AC R13591; 

DT 03-0CT-1991 (first entry) 

DE HTLV-I env precursor epitope peptide, 

KW HTLV-I ; epitope; diagnosis? env protein; gp46; antibody; vaccine, 

OS Synthetic. 

PN US5003043-A. 

PD 26-HAR-1991. 

PF 25-MAY-1988; 198416. 

PR 25-MAY-1988; US-198416. 

PA (TRIT-) TRITON BIOSCIENCES. 

PI Akita RU> Florine DL> Ralston JS; 

DR WPI? 91-221557/30. 

PT Synthetic peptide (s) and antibodies corresp. to an epitope of 

PT HTLV-I - used in diagnosis* therapy prepn. of vaccines and 

PT prognostic indicators of HTLV-I infection 

PS Disclosure; Page 3; 10pp; English. 

CC The peptide has specific binding affinity for O.Salpha nonodonal 

CC antibody. It represents an epitopic site on the major HTLV-I 

CC envelope precursor! i.e it corresponds to residues 281-320. The 

CC peptide and its antibody can be used in diagnosing the presence of 

CC HTLV-I associated diseases? as vaccines against HTLV-I infection or 

CC as prognostic indicators after HTLV-I infection. 

CC See also R13077, R13590 and R13591. 

SQ Sequence 40 AA; 

SQ 3 A; 3 R; 0 N; 0 D; 0 B; 2 C; l 9; 0 E; 0 Z; 1 G; 1 H; 

SQ 3 I; 5 L; 0 k; o N; l F; 6 P; 7 5; l t; l h? 0 Y; 5 V; 



Initial Score = 4 Optimized Score = 4 Significance = 3.58 
Residue Identity = 807. Matches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
QAEVS 
II II 

I Q A I VSSPCHCSL I LPPFSLSP VPTLGSRSRRA VPVAVWL 
X X 10 20 30 40 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 



FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249_4p.res made by on Thu 22 Sep 94 10:25:41-PDT. 



Query sequence being compared:U5-08-249-182-4 (1-5) 
Number of sequences searched: 70848 
Number of scores above cutoff 5 4444 

Results of the initial comparison of US-08-249-182-4 (1-5) with? 
Data bank : PIR 41? all entries 

100000- 

N 

U50000- * 

M 

B 

E 

R 



0 

F 10000- 
S 

E 5000- 

0 

U 

E 

N 

C 

E 

S 1000- 



500- 



100- 



50- 



10- 



5- 



0 

II I II III III I I 

SCORE 0| 1 |i 2 | 2 3 | 3 4 4 
STDEV 0 12 3 



PARAMETERS 



Similarity natrix 


Unitary 


K-tuple 


2 


Mismatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 


15 


Optimized scores to 


save 0 


Display context 


50 



SEARCH STATISTICS 



Scores: Mean Median Standard Deviation 

1 3 0.95 



Tines? 



CPU 



Total Elapsed 



Number of residues: 20816057 
Nuaber of sequences searched; 70848 
NuRber of scores above cutoff; 4444 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 



19 100X similar sequences to the query sequence were found; 



Sequence Nane Description 



Init. Opt. 
Length Score Score Sig. Fraae 



4 



1. 


S20590 


2. 


A49227 


3. 


S39558 


4. 


S33533 


5. 


S3 1862 


6. 


A33827 


7. 


WMCVFM 


8. 


A30828 


9. 


S16719 


10. 


S38397 


11. 


JH0594 


12. 


JN0604 


13. 


S16562 


14. 


A48470 


15. 


A27659 


16. 


A33980 


17. 


S18659 


18. 


S06727 


49. 


A42329 



sialidase (EC 3.2.1.18) - Act 
sialidase - Actinonyces visco 
HSP90 hoRolog - Madagascar pe 
heat shock protein 90 howolog 
GRP94 protein honolog - barle 
regulatory protein ral2 - f is 
inclusion body aatrix protein 
steroid 17alpha-Ronooxygenase 
steroid 17alpha-«onooxygenase 
vasoactive intestinal peptide 
vasoactive intestinal peptide 
vasoactive intestinal peptide 
nolF protein - Rhizobiun Reli 
elongation factor 1 alpha, EF 
cytochrone P450 17 - rat (fra 
steroid i7alpha-nonooxygenase 
hypothetical protein - fiycopl 
hypothetical protein 1 (ninic 
autotaxin - hunan (fragnents) 



913 


5 


5 4. 


19 0 


901 


5 


5 4. 


19 0 


817 


5 


5 4. 


19 0 


809 


5 


5 4. 


19 0 


809 


5 


5 4. 


19 0 


611 


5 


5 4. 


19 0 


512 


5 


5 4. 


19 0 


507 


5 


5 4. 


19 0 


507 


5 


5 4. 


19 0 


460 


5 


5 4. 


19 0 


459 


5 


5 4. 


19 0 


457 


5 


5 4. 


19 0 


367 


5 


5 4. 


19 0 


346 


5 


5 4. 


19 0 


237 


5 


5 4. 


19 0 


235 


5 


5 4. 


19 0 


148 


5 


5 4. 


19 0 


122 


5 


5 4. 


19 0 


114 


5 


5 4. 


19 0 



The list of other best scores is: 



Sequence Nane Description 



Init. Opt. 
Length Score Score Sig. Frane 







«»«« 3 standard deviations above nean **** 






20. 


A35776 


Recfi protein - Escherichia co 


5 4 - 


I 3.14 


0 


21. 


PCI 149 


equinatoxin 1A - sea anenone 


13 4 - 


1 3.14 


0 


22. 


A40634 


orfl9 3' of eryK - Saccharopo 


15 4 ' 


» 3.14 


0 


23. 


S35970 


ribosonal protein L10 - Citro 


20 4 ' 


1 3.14 


0 


24. 


S35978 


ribosonal protein L10 - Prote 


21 4 ■ 


\ 3.14 


0 


25. 


B 607 01 


31K antigen - Canpylobacter j 


22 4 < 


» 3.14 


0 


26. 


S35976 


ribosonal protein L10 - Klebs 


23 4 ' 


1 3.14 


0 


27. 


S35975 


ribosonal protein L10 - Enter 


23 4 ' 


I 3.14 


0 


28. 


S04171 


aadA protein - Klebsiella pne 


23 4 ' 


» 3.14 


0 


29. 


A60701 


31K antigen PEB4 - Canpylobac 


38 4 ' 


t 3.14 


0 


30. 


S10765 


glutanate synthase - Azospiri 


40 4 ' 


1 3.14 


0 


31. 


A 10265 


alpha-lactalbunin I - eastern 


42 4 ' 


» 3.14 


0 


32. 


B39880 


streptonycin/spectinonycin re 


42 4 * 


» 3.14 


0 


33. 


S22426 


thymosin beta-12 - rainbow tr 


44 4 ■ 


I 3.14 


0 


34. 


S25708 


hypothetical protein 4a - nun 


44 4 i 


1 3.14 


0 


35. 


SU162 


photosysten II protein psbK p 


46 4 • 


t 3.14 


0 


36. 


A05024 


hypothetical protein 55 - liv 


55 4 ' 


} 3.14 


0 


37. 


S01585 


photosysten 11 protein psbK p 


55 4 ' 


1 3.14 


0 


38. 


S30963 


gene 18 protein - Mycobacteri 


57 4 ' 


I 3.14 


0 


39. 


S31447 


photosysten II protein psbK - 


61 4 ■ 


I 3.14 


0 


40. 


S28768 


photosysten II protein psbK - 


61 4 ' 


1 3.14 


0 



1. US-08-249-182-4 (1-5) 

S20590 sialidase (EC 3.2.1.18) - Actinomyces viscosus 



ENTRY S20590 #type complete 

TITLE sialidase (EC 3.2.1.18) - Actinomyces viscosus 

ORGANISM ftformal_nane Actinomyces viscosus 

DATE 22-Nov-1993i #sequence_revision 22-Nov-1993? #text change 

22-Nov-1993 
ACCESSIONS 520590 
REFERENCE 520590 

/#authors Henningsen* M. 5 Roggentin, P.; Schauer, R. 
# journal Biol. Chen. Hoppe-Seyter (1991) 372:1065-1072 
Hitle Cloningi sequencing and expression of the sialidase gene from 

Actinomyces viscosus DSP! 43798. 
#cross-references MU1D:92162190 
#accession S20590 

##status preliminary 
##residues 1-913 ##label HEN 
##cros5-references EMBL?X62276 
SUMMARY tlength 913 #molecular-ueight 96216 ^checksum 4303 

SEQUENCE 



Initial Score = 5 
Residue Identity = 100X 
Gaps = 0 



Optimized Score = 5 Significance = 4.19 
Matches = 5 Mismatches = 0 

Conservative Substitutions = 0 



X X 
SAEVS 

inn 

TETGKKVGYSDPSYVVDHQTGTIFNFHVKSYDQGWGGSRGGTDPENRGIIflAEVSTSTDNGHTUTHRTITAD 
370 380 390 400 410 420 X 430 440 



ITKDKPWTARFAASGQGIQIflHGPHAGRLV«8Y 
450 460 470 



2. US-08-249-182-4 (1-5) 

A49227 sialidase - Actinomyces viscosus 



ENTRY A49227 #type complete 

TITLE sialidase - Actinomyces viscosus 

ORGANISM #fornal_name Actinomyces viscosus 

DATE 19-Dec-1993; Isequence revision 19-Dec-1993; itext change 

19-Dec-1993 
ACCESSIONS A49227 
REFERENCE A49227 
#authors Yeungi M.K. 

#journal Infect. Imnun. (1993) 61:109-116 

#title Complete nucleotide sequence of the Actinomyces viscosus T14V 

sialidase gene: presence of a conserved repeating sequence 
among strains of Actinomyces spp. 
tcross-references MUID:93114861 
Contents TI4V 
iaccession A49227 

##status preliminary 
#fmolecule_type nucleic acid 
##residues" 1-901 ##label YEU 
##cross-references NCBIN: 121598; NCBIP: 121599 
##note sequence extracted from NCBI backbone 

SUMMARY Slength 901 Snolecular-ueight 92860 ichecksum 7681 

SEQUENCE 



Initial Score = 5 Optimized Score = 5 Significance = 4.19 
Residue Identity = 100% Matches = 5 Mismatches = 0 



v ^uiiiei vduve ouotiLi huh tuna 



X X 
QAEVS 
Mill 

TETGKKVGYSDPSYVVDHQTGTIFNFHVKSYDQGUGGSRGGTDPENRGIIflAEVSTSTDNGHTNTHRTITAD 
370 380 390 400 410 420 X 430 440 

I TKDKPUT ARF A ASG9GI 9 I QHGPHAGRL VfiQY 
450 460 470 



3. US-08-249-182-4 (1-5) 
S39558 HSP90 honolog 



Madagascar periwinkle 



ENTRY 
TITLE 
ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 
Sauthors 

#journal 
Hitle 



♦accession 
tlstatus 
Siresidues 

SUMMARY 

SEQUENCE 



S3955B Hype conplete 
H5P90 honolog - Madagascar periwinkle 
#fornal_nane Catharanihus roseus #connon_nane Madagascar 
periwinkle 

19-May-1994. #sequence_revision 19-May-1994; Hext change 

19-May-1994 
S3955S 
S39558 

Schroeder» G.f Beckt M.; Eicheh J.; Vetterr H.P.; Schroederr 
J. 

Plant Mol. Biol. (1993) 23:583-594 

HSP90 honologue fron Madagascar periwinkle (Catharanihus 
roseus): cDNA sequencer regulation of protein expression 
and location in the endoplasnic reticulum. 
S39558 

prelininary 
1-817 Sllabel SCH 
# length 817 inolecular-ueight 93491 Ichecksun 9348 



Initial Score = 5 
Residue Identity = 100% 
Gaps = 0 



Qptinized Score = 5 Significance = 4.19 
Hatches = 5 Mi snatches = 0 

Conservative Substitutions = 0 



X X 
8AEVS 
Mill 

DSDAPVDPPKVEDKIGAVPNGLSTDSDVAKREAESMSMRNLRSDAEKFEF8AEVSRLHDIIINSLYSNKDIF 
40 50 60 70 80 X X 90 100 

LREL 1 SNASDALDK I RFLALTDKE I LGEGDTAK 
110 120 130 



4. US-08-249-182-4 (1-5) 

S33533 heat shock protein 90 honolog precursor - barley 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
•authors 

Sjournal 
«title 



taccession 



S33533 Hype conplete 

heat shock protein 90 honolog precursor - barley 
#fornal_nane Hordeun vulgare iconnonjiane barley 
31-Dec-1993 Ssequence revision 31-Dec-1993 Hext change 

03-May-1994 
S33533 
S33533 

Walther-Larseni H.J Brandt. J.; Collinge. D.B.; 

Thordal-Christenseni H. 
Plant Mol. Biol. (1993) 21:1097-1108 

A pathogen-induced gene of barley encodes a HSP90 honologue 
showing striking sinilarity to vertebrate forns resident in 
the endoplasnatic reticulun. 

S33533 



»»noiecuie_i.ype mm 

ttresidues" 1-809 St label HAL 

t#cross-references ENBL:X67960 
CLASSIFICATION tsuperfanily heat shock protein 90 
KEYWORDS glycoprotein 
FEATURE 

1-20 tdonain signal sequence Sstatus predicted tlabel SIG\ 

21-809 Iprotein heat shock protein 90 honolog tstatus predicted 

tlabel HAT\ 

111 r 410i 450» 617 tbinding_site carbohydrate (Asn) (covalent) tstatus 
predicted 

SUHMARY I length 809 tnolecular-weight 92916 tehee k sun 2897 

SE8UENCE 

Initial Score = 5 Optinized Score = 5 Significance = 4.19 
Residue Identity - 100X Hatches - 5 Nisnatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 
lllll 

SSDEVGDFPKVEEKLGAVPHGLSTDSEVV8RESES I 5RKTLRNSAEKFEF8AEVSRLHDI I INSLYSNKDI F 
40 50 60 70 80 X X 90 100 

LRELISNASDALDKIRFLALTDKEVMGEGDTAK 
110 120 130 



5. US-08-249-182-4 (1-5) 

S31862 GRP94 protein honolog - barley 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
tauthors 
tsubnission 
taccession 
ttstatus 
ttresidues 
ttcross-references EHBL.*X67960 
SUMMARY # length 809 tnolecular-ueight 92916 tchecksun 2897 

SEQUENCE 



S31862 ttype conplete 
GRP94 protein honolog - barley 
Sfornaljiane Hordeun vulgare tconnon_nane barley 
20-Nay-1994» tsequence revision 20-Nay-1994; ttext change 

20-May-1994 
S31862 
S31862 
Brandt: J. 

submitted to the EMBL Data Library, August 1992 
S31862 

prelininary 
1-809 tt label BRA 



Initial Score = 5 
Residue Identity = 100X 
Gaps = 0 



Optinized Score = 5 Significance = 4.19 
Matches = 5 Nisnatches = 0 

Conservative Substitutions = 0 



X X 
8AEVS 
lllll 

SSDEVGDFPKVEEKLGAVPHGLSTDSEVVQRESESISRKTLRNSAEKFEF8AEVSRLHDIIINSLYSNKDIF 
40 50 60 70 80 X X 90 100 



LRELISNASDALDKIRFLALTDKEVMGEGDTAK 
110 120 130 



6. US-08-249-182-4 (1-5) 

A33827 regulatory protein ra!2 - fission yeast (Schizosac 



ENTRY 
TITLE 



A33827 ttype conplete 

regulstory protein ral2 - fission yeast (Schizosaccharonyces 



ORGANISM #formal_name Schizosaccharomyces pombe 

DATE 23-Nar-i990 #sequence_revision 23-Mar-1990 #text change 

30-Sep-1993 
ACCESSIONS A33827 
REFERENCE A33827 

iauthors Fukui, Y.J Miyake* S.i Saiohr tl.; Yanamoto* M, 

#journal Hoi. Cell. Biol. (1989) 9:5617-5622 

#title Characterization of the Schizosaccharomyces ponbe ral2 gene 

implicated in activation of the rasl gene product, 
♦cross-references NUID:90066514 
♦accession A33827 

#f status preliminary 
#imolecule_type DNA 
♦♦residues" 1-611 ##label FUK 
♦♦cross-references GB:M30827 
SUMMARY ♦length 611 ♦molecular-weight 69847 ♦checksum 9734 

SEQUENCE 



Initial Score » 5 
Residue Identity = 100X 
Gaps = 0 



Optimized Score = 5 
Matches = 5 

Conservative Substitutions 



Significance = 
Mismatches = 



4,19 
0 
0 



X X 
8AEVS 

iiiii 

GKLLLNGISDMEILTIERMHIPCLSRMLYKRWPAF9KlMDRAVEKN9EAFflAEVS9LGP8LTDLPFSSIHST 
430 440 450 460 470 480 490 



GSRALYMPYSFETCSAFLHYIYCGTLNGSYCTA 
500 510 520 530 



7. US-08-249- 182-4 (1-5) 

HMCVFM inclusion body matrix protein - figuort mosaic vir 

ENTRY HMCVFM #type complete 

TITLE inclusion body matrix protein - figuort mosaic virus 

ORGANISM ♦formal_nane figuort mosaic virus 

DATE 30-Sep-1991 ♦sequence revision 30-Sep-1991 #text change 

08-Apr-1994 
ACCESSIONS S01284 
REFERENCE S01279 

♦authors Richinsi R.D.; Scholthof, H.B.J Shepherdr R.J. 
♦journal Nucleic Acids Res. (1987) 15:8451-8466 
#title Sequence of figuort mosaic virus DNA (caulimovirus group), 

♦cross-references MUID:88040466 
♦accession S01284 
♦♦molecule_type DNA 
♦♦residues" 1-512 ##label RIC 
##cross-references EMBL:X06166 
♦Snote 

CLASSIFICATION 
KEYWORDS 
SUMMARY 
SEQUENCE 



the translation of the nucleotide sequence is not given 
in this paper 

♦superfamily caulimovirus inclusion body matrix protein 
matrix protein 

#length 512 #molecular-yeight 58207 ♦checksum 1605 



Initial Score = 5 
Residue Identity = 100X 
Gaps = 0 



Optimized Score = 5 Significance = 4.19 
Matches = 5 Mismatches = 0 

Conservative Substitutions = 0 



X X 
SAEVS 
IIIII 

FKPDYLRAASNGQSWFAVYKGPNKEFFTEWEIVADICKKRQKSKRFRSKEQAEVSISLYNKDIQDPVNFLRP 



iJV iOV i/U lOV 17U A A tiv 

VKLVKEERAAQPLKFKAIAAEQTIOFDEFROIK 
220 230 240 



8. US-08-249-182-4 (1-5) 

A30828 steroid 17alpha-nonooxygenase (EC 1,14.99.9) cytoc 



ENTRY 
TITLE 

ORGANISM 
DATE 

ACCESSIONS 

REFERENCE 
#authors 
tsubnission 
taccession 



A30828 Hype complete 

steroid 17alpha-nonooxygenase (EC 1.14.99.9) cytochrone P450 
17 - rat 

tfornal^nane Rattus norvegicus #conRon_nane Noruay rat 
19-May-1989 #sequence_revision 19-May-1989 ttext change 

28-Apr-1993 
A30828; A31359 
A94511 
Dufau; M.L. 

submitted to GenBanki December 1988 
A30828 
ttlnolecule.type rRNA 
##residues~ 1-507 ##label DUF 
REFERENCE A90154 

/ iauthors Nanikii N.; Kitanura* M.; Buczkot E.f Dufaut M.L. 
/ #journal Biochen. Biophys. Res. CoRftun. (1988) 157:705-712 

fttitle Rat testis P-450-17-alpha cDNA: the deduced aRino acid 

sequence* expression and secondary structural 
configuration, 
ftcross-references MUID? 89076306 
taccession A31359 
##p»oleculeJ,ype nRNA 
t#residues~ 1-507 iilabel NAM 
CLASSIFICATION #superf ani ly cytochrome P450 

KEYWORDS endoplasmic reticulum hene; nenbrane protein; Ronooxygenase; 

oxidoreductase 



FEATURE 
2-21 
169-186 
441 

SUMMARY 
SEQUENCE 



#donain transReRbrane #label TM1\ 
SdoRain transRewbrane Slabel TM2\ 
#binding_site heme iron (Cys) (axial ligand) ftstatus 
predicted 

Slength 507 #nolecular-ueight 57250 tchecksuw 9025 



Initial Score = 
Residue Identity = 
Gaps = 



5 Optimized Score = 5 Significance 
100X Matches = 5 Mismatches 

0 Conservative Substitutions 



* 4, 



19 
0 
0 



X X 
9AEVS 
lilll 

FVFTALLLSRFDLDVSDDKQLPRLEGDPKVVFLIDPFKVKITVRGAWMDA9AEVST 
460 470 480 490 500 X X 



9. US-08-249-182-4 (1-5) 

S16719 steroid 17alpha-Ronooxygenase (EC 1.14.99.9) cytoc 



ENTRY S16719 #type complete 

TITLE steroid 17alpha-«onooxygenase (EC 1.14.99.9) cytochrone P450 

17 - rat 

ORGANISM #fornal_nane Rattus norvegicus #cocmon_nane Noruay rat 

DATE 21-Nov-1993; #sequence_revision 21-Nov-1993; #text change 

2i-Nov-1993 
ACCESSIONS S16719 
REFERENCE S16719 

ftauthors Fevold* H.R.; Lorence, M.C.; McCarthy, J.L.; Trant, J.M.; 



ftjournal Hoi. Endocrinol. (1989) 3:968-975 

#title Rat P450(17-alpha) fron testis? characterization of a 

full-length cDNA encoding a unique steroid hydroxylase 
capable of catalyzing both Delta<4)- and Delta 
(5)-steroid-17i 20-lyase reactions, 
^cross-references MU 10:89295447 
#accession 516719 

##status preliminary 
^residues i-507 ##label FEV 
##cross-references EMBLJM31681 
SUMMARY tlength 507 #nolecular-weight 57250 

SEQUENCE 



tchecksun 9025 



Initial Score = 5 
Residue Identity = 100% 
Gaps = 0 



Optinized Score = 5 Significance = 4.19 
Matches = 5 Mismatches = 0 

Conservative Substitutions = 0 



X X 
QAEVS 
lllll 

FVFTALLLQRFDLDVSDDKQLPRLEGDPKVVFLIDPFKVKITVRQAUMDA8AEVST 
460 470 480 490 500 X X 



10. US-08-249-182-4 (1-5) 

S38397 vasoactive intestinal peptide receptor 



hunan 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
Sauthors 
isubmission 
#accession 
##status 
##residues 



S38397 itype complete 

vasoactive intestinal peptide receptor - hunan 

#fornal_nane Homo sapiens #common_name man 

22-Jan-1994? #sequence revision 22-Jan-1994J #text change 

22-Jan-1994 
S38397 
S38397 

Couvineaui A. 

submitted to the EMBL Data Libraryr Septenber 1993 
S38397 

preliminary 

1-460 Iflabel C0U 



##cross-references EMBL:X75299 
SUMMARY ilength 460 f molecular-weight 51929 tchecksun 1116 

SEQUENCE 



Initial Score = 
Residue Identity - 
Gaps = 



5 Optinized Score = 5 Significance = 4.19 
100% Matches = 5 Misnatches = 0 

0 Conservative Substitutions = 0 



X X 
QAEVS 
lllll 

RKWRRHHLQGVLGWNPKYRHPSGGSNGATCSTQVSMLTRVSPGARRSSSFQAEVSLV 
410 420 430 440 450 X 460 



11. US-08-249-182-4 (1-5) 

JH0594 vasoactive intestinal peptide receptor precursor 



ENTRY JH0594 Stype conplete 

TITLE vasoactive intestinal peptide receptor precursor - rat 

ORGANISM #formal_nane Rattus norvegicus #conmon_nane Norway rat 

DATE 17-Jul-1992 isequence revision 17-Jul-1992 #text change 

18-Jun-1993 

ACCESSIONS JH0594 

REFERENCE JH0594 

tauthors Ishihara. T.J Shigenotot R.; Morii K.J Takahashir K.? Nagatai 



tjournal Neuron (1992) 8:811-819 

ttitle Functional expression and tissue distribution of a novel 

receptor for vasoactive intestinal polypeptide, 
tcross-references MUID:92232309 
♦contents Lung 
•accession JH0594 
ttnolecule_type rRNA 
ttresidues" 1-459 ttlabel ISH 
##cros5-references GB:M86835 
KEYWORDS glycoprotein; nenbrane protein 

FEATURE 



1-30 


drift nairi <;i final cpniiarira ft<;tAtijQ nppHirt pH ftl^hal t|in\ 


31-459 


tprotein vasoactive intestinal polypeptide receptor 




tstatus predicted tlabel MAT\ 


146-168 


tdonain transnenbrane tlabel TM1\ 


176-195 


tdonain transnenbrane #label TM2\ 


218-241 


tdowain transmembrane tlabel TM3\ 


256-277 


tdonain transmembrane tlabel TM4\ 


295-318 


tdomain transmembrane tlabel TM5\ 


344-363 


tdonain transmembrane tlabel TM6\ 


376-395 


tdonain transmembrane tlabel TN7\ 


58,69,100,292 


tbinding_site carbohydrate (Asn) (covalent) tstatus 




predicted 



SUMMARY tlength 459 tmolecular-ueight 52057 tchecksum 2598 

SEQUENCE 



Initial Score = 5 Optimized Score = 5 Significance « 4.19 
Residue Identity = 100X Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
8AEVS 
Hill 

RKWRRWHLQGVLGWSSKSQHPWGGSNGATCSTQVSMLTRVSPSARRSSSF9AEVSLV 
410 420 430 440 450 X X 



12. US-08-249-182-4 U-5) 

JN0604 vasoactive intestinal peptide receptor - human 



ENTRY JN0604 ttype complete 

TITLE vasoactive intestinal peptide receptor - human 

ORGANISM tformal_nane Homo sapiens tcommon_name nan 

DATE 31-Dec-1993 #sequence_revision 31-Dec-1993 ttext change 

31-Dec-1993 
ACCESSIONS JN0604 
REFERENCE JN0604 

tauthors Sreedharan, S.P.? Patel. D.R.J Huang, J.X.; Goetzh E.J. 

tjournal Biochem. Biophys. Res. Commun. (1993) 193:546-553 

ttitle Cloning and functional expression of a human neuroendocrine 

vasoactive intestinal peptide receptor, 
taccession JN0604 
ttnolecule_type mRNA 
ttresidues" 1-457 ttlabel SRE 
ttcross-references GB:L13288 



ttnote 

KEYWORDS 

FEATURE 
135-157 
175-194 
216-240 
255-276 
293-316 
342-361 
374-393 



the nucleotide sequence is not given in this paper 
glycoprotein; hormone receptor? membrane protein 

tdonain transmembrane tlabel TM1\ 

tdociain transmembrane tlabel TM2\ 

fldoRain transmembrane tlabel TM3\ 

tdomain transmembrane tlabel TM4\ 

tdomain transmembrane tlabel TM5\ 

tdonain transmembrane tlabel TM6\ 

tdomain transmembrane tlabel TM7\ 



ooroii ivu»c7v eoinoing_5n.e caroonyorate iflsn* icovaienu ffsi/aLus 
predicted 

SUMMARY tlength 457 Snolecular-yeight 51547 ichecksun 9283 

SEQUENCE 

Initial Score = 5 Optinized Score = 5 Significance = 4.19 

Residue Identity = 1Q0X Matches - 5 Misnatches - 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
QAEVS 
Mill 

RKWRRHHLQGVLGHNPKYRHPSGGSNGATCSTQVSHLTRVSPGARRSSSFQAEVSLV 
410 420 430 440 450 X 



13. U5-08-249-182-4 (1-5) 

516562 nolF protein - Rhizobiun neliloti 



ENTRY S16562 #type conplete 

TITLE nolF protein - Rhizobiun neliloti 

ORGANISM #fornal_nane Rhizobiun neliloti 

DATE 21-Nov-1993; tsequence_revision 21-Nov-1993; #text change 

21-Nov-1993 
ACCESSIONS S16562 
REFERENCE S16561 

#authors Baev» N. ; Entire, G. ; Petrovicsr G.; Banfalvii Z.? Kondorosi, 

A. 

ijournal Mol. Gen. Genet. U991) 228:113-124 

#title Six nodulation genes of nod box locus 4 in Rhizobiun neliloti 

are involved in nodulation signal production* nodM codes 
for D-glucosanine synthetase. 
#cross-references MUID:91360053 
#accession S16562 

##status prelininary 
##residues 1-367 ##label BAE 
##cross-references EMBL;X58632 
SUMMARY tlength 367 #nolecular-yeight 39541 #checksun 9824 

SEQUENCE 



Initial Score = 5 
Residue Identity = 100X 
Gaps = 0 



Optinized Score = 5 Significance = 4.19 
Matches = 5 Misnatches = 0 

Conservative Substitutions = 0 



X X 
QAEVS 
Hill 

RGGCLSAQTELAEAVLERNTRLGERGAASEATRLAALADVLDLRAHVRSKGAEVSDAERSLSHAEVRAEFGG 
120 130 140 150 160 X 170 180 



VIRARSVEEGQTVPLNTQLMTIVELNRLEVDAG 
190 200 210 



14. US-08-249-182-4 (1-5) 



A48470 



elongation factor 1 alpha* EF-1 alpha - Eineria bo 



ENTRY A48470 #type fragnent 

TITLE elongation factor 1 alphar EF-1 alpha - Eineria bovis 

(fragnent) 

ORGANISM #fornal_nane Eineria bovis 

DATE Ol-Dec-1993; #sequence_revision Ol-Dec-1993? #text change 

Ol-Dec-1993 
ACCESSIONS A48470 
REFERENCE A48470 

#authors Abrahansen* M.S.; Clark r T.G.J Mascolor P.; Speeri C.A.; 



#journal Mol. Biochen. Parasitol. (1993) 57:1-14 

#iitle Developmental gene expression in Eineria bovis. 

tcross-references MU1D:93149194 

tcontents merozoites 

#accession A48470 

##status prel ininary 

##molecule type nucleic acid 
##residues~ 1-346 Wlabel ABR 
#tcross-references NCBIN: 123619; NCBIP: 123622 
#8note sequence extracted from NCBI backbone 

SUMMARY Slength 346 #checksum 2541 

SEQUENCE 



Initial Score = 5 Optimized Score = 5 Significance = 4.19 
Residue Identity = 100X Matches = 5 Misnatches = 0 

Gaps = 0 Conservative Substitutions - 0 



X X 
QAEVS 

Mill 

GFEGAFSKEGSTREHALLAFTLGVKfiMIVGINKMDATTPDKYSETRFNEIQAEVSRYLKTVGYNPEKVPFVP 
20 30 40 50 60 70 X 80 



ISGFHGDNNVERSSNHPWYKGKILVEALDNVEP 
90 100 110 120 



15. US-08-249-182-4 (1-5) 

A27659 cytochrome P450 17 - rat (fragment) 



ENTRY 
TITLE 

ALTERNATE NAMES 
ORGANISM " 
DATE 



A27659 #type fragment 
cytochrome P450 17 - rat (fragment) 
cytochrome P450-17-alpha 

#fornal_name Rattus norvegicus #comnon_name Norway rat 
31-Har-1989 #sequence_revision 31-Mar-1989 #text_change 
lS-Jun-1993 
ACCESSIONS A27659 
REFERENCE A27659 

iauthors Nishiharai M,; Wintersi C.A.J Buzko* E.J Waterman* H.R.; 
DufaUf M.L. 

ftjournal Biochem. Biophys. Res. Connun. (1988) 154:151-158 
ititle Hormonal regulation of rat Leydig cell cytochrome 

P-450-17-alpha mRNA levels and characterization of a 
partial length rat P-450-17-alpha cDNA. 
fkross-references MUID?88280759 
#accession A27659 
#fnolecule_type mRNA 
#tresidues" 1-237 ##label NIS 



##note 

CLASSIFICATION 
KEYWORDS 
FEATURE 
171 

SUMMARY 
SEQUENCE 



the authors translated the codon GAT for residues 18i 
131 i and 208 as Glu» Asn, and Asn respectively 
#superfamily cytochrome P450 
heme; monooxygenase; oxidoreductase 



#binding_site heme iron 
predicted 
#length 237 #checksun 8451 



(Cys) (axial ligand) tstatus 



Initial Score = 5 Optimized Score = 5 Significance = 4.19 
Residue Identity = 100X Matches = 5 Misnatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
QAEVS 
1111! 



rvr iHLLLunruLUYoi/imBLrnLLUi/rftvvr-Liijrriwru i vnBftBnuHUHtvai 
190 200 210 220 230 X X 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 

Results file u249_4s.res nade by on Thu 22 Sep 94 10:18:01-PDT. 



Query sequence being conparecUUS-08-249-182-4 (1-5) 
Number of sequences searched: 36000 
Number of scores above cutoff: 4070 

Results of the initial comparison of US-08-249-182-4 (1-5) uith: 
Data bank '• Suiss-Prot 28 1 all entries 

100000- 

N 

U50000- 

« 

B 

E * 
R 

0 

F 1 0000- 

* 

S 

E 5000 t 

0 

U 

E 

N 

C 

E 

S 1000- 



500- * 



100- 
50- 

* 



10- 



5- 



0 

II I II III III I I I 

SCORE Oj 1 |1 2 | 2 3 | 3 4 4 5 
STDEV 0 12 4 



PARAMETERS 



Similarity matrix 


Unitary 


K-tuple 




2 


Mismatch penalty 


1 


Joining penalty 




20 


Gap penalty 


1.00 


Window size 




5 


Gap size penalty 


0.05 








Cutoff score 


0 








Randomization group 


0 








Initial scores to save 


40 


Alignments to save 


15 




Optimized scores to save 0 


Display context 


50 





SEARCH STATISTICS 



Scores? Mean Median 

1 3 

Times: CPU 

00:00:49.92 

Number of residues: 12496420 
Number of sequences searched: 36000 
Number of scores above cutoff: 4070 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4, 

The scores below are sorted by initial score 
Significance is calculated based on initial score* 



Standard Deviation 
0.84 

Total Elapsed 
00:00:54.00 



8 100X similar sequences to the query sequence were found: 



Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



1. 


ENPL_CATR0 


ENDOPLASHIN H0M0L0G PRECURSOR 


817 


5 


5 


4.75 


0 


2. 


RAL2_SCHP0 


RAL2 PROTEIN. 


611 


5 


5 


4.75 


0 


3. 


IBMP_FMVD 


INCLUSION BODY MATRIX PROTEIN 


512 


5 


5 


4.75 


0 


4. 


CPT7.RAT 


CYTOCHROME P450 XVI IA1 (P450- 


507 


5 


5 


4.75 


0 


5. 


VIPRJAT 


VAS0ACTIVE INTESTINAL P0LYPEP 


459 


5 


5 


4.75 


0 


6. 


VIPR.HUMAN 


VASOACTIVE INTESTINAL P0LYPEP 


457 


5 


5 


4.75 


0 


7. 


N0LF_RHIME 


N0DULATI0N PROTEIN N0LF. 


367 


5 


5 


4.75 


0 


8. 


YM2_STRC0 


MINI-CIRCLE HYPOTHETICAL 13.3 


122 


5 


5 


4.75 


0 



The list of other best scores is: 



Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 







**** 3 standard deviations 


above nean 


**## 








9. 


RP30_YEAST 


RIBOS0HAL PROTEIN RP30 (FRAGM 


25 


4 


4 


3.56 


0 


10. 


TYBB.0NCMY 


THYMOSIN BETA-12. 


42 


4 


4 


3.56 


0 


11. 


LCA.MACGI 


ALPHA-LACTALBUMIN I (LACTOSE 


42 


4 


4 


3.56 


0 


12. 


THI0.EUBAC 


THI0RE00XIN (FRAGMENT) . 


45 


4 


4 


3.56 


0 


13. 


PSBK.CHLRE 


PHOTOSYSTEM II 4 KD REACTION 


46 


4 


4 


3.56 


0 



15. VG18.BPML5 

16. PSBK.TOBAC 

17. PSBK.SINAL 

18. PSBK_ORYSA 

19. PSBK_HORVU 

20. RINBJPPHA 

21. HST.YEREN 

22. CPB2_EC0LI 

23. YPB2.EC0LI 

24. Y622JPT4 

25. E111.ADEM1 

26. HG14.HUMAN 

27. HG14_BDVIN 

28. YKE4 YEAST 

29. PT1.BACSU 

30. CCMK.SYNP7 

31. THI1_YEAST 

32. PUTX_PSEPU 

33. INS1 XENLA 

34. YLC6.YEREN 

35. INS.CAVPO 

36. GVPK_HALHA 

37. Y13K.TYLCM 

38. HV3N~HUHAN 

39. LCA.MACRG 

40. trmI ECOLI 



GENE 18 PROTEIN (GP18). 57 

PHOTOSYSTEM II 4 KD REACTION 61 

PHOTOSYSTEM II 4 KD REACTION 61 

PHOTOSYSTEM II 4 KD REACTION 61 

PHOTOSYSTEM II 4 KD REACTION 61 

TRANSCRIPTIONAL ACTIVATOR RIN 62 

HEAT-STABLE ENTEROTOXIN PRECU 71 

COPB PROTEIN (REPA2 PROTEIN) . 84 

HYPOTHETICAL 10.0 KD PROTEIN. 87 

HYPOTHETICAL 10.9 KD PROTEIN 92 

EARLY El A 11 KD PROTEIN. 96 

NONHISTONE CHROMOSOMAL PROTEI 99 

NONHISTONE CHROMOSOMAL PROTEI 100 

HYPOTHETICAL 11.4 KD PROTEIN 101 

PHOSPHOENOLPYRUVATE-PROTEIN P 102 

CARBON DIOXIDE CONCENTRATING 102 

THIOREDOXIN I (TR-I). 104 

PUTIDAREDOXIN (PDX). 106 

INSULIN 1 PRECURSOR. 106 

HYPOTHETICAL 12.2 KD PROTEIN 110 

INSULIN PRECURSOR. 110 

GVPK PROTEIN. 113 

HYPOTHETICAL 13.3 KD PROTEIN 115 

IG HEAVY CHAIN V-III REGION ( 119 

ALPHA-LACT ALBUMIN (LACTOSE SY 121 

TRAM PROTEIN. 127 



3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 
3.56 



1. US-08-249-182-4 (1-5) 

ENPL.CATRO ENDOPLASMIN HOMOLOG PRECURSOR (HEAT SHOCK 90 KD PR 



ID ENPL.CATRO STANDARD; PRT; 817 AA. 

AC P35016J 

DT 01-FEB-1994 (REL. 28, CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01 -FEB- 1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE ENDOPLASMIN HOMOLOG PRECURSOR (HEAT SHOCK 90 KD PROTEIN) . 

GN HSP90. 

05 CATHARANTHUS ROSEUS (ROSY PERIWINKLE) (MADAGASCAR PERIWINKLE) . 

oc eukaryota; planta; embryophyta; angiospermae; dicotyledoneae; 

0C GENTIAN ALES; APOCYNACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CP3Ai 

RH 94033337 

RA SCHROEDER G., BECK M. > EICHEL J., VETTER H.P., SCHROEDER J.; 

RL PLANT HOL. BIOL. 23:583-594(1993). 

CC -!- SUBCELLULAR LOCATION: ENDOPLASMIC RETICULUM LUMEN. 

CC -!- SIMILARITY: BELONGS TO THE HEAT SHOCK PROTEIN HSP90 FAMILY. 

OR EMBLJ L14594; CTRHSP90A. 

KM CHAPERONE; ENDOPLASMIC RETICULUM; HEAT SHOCK; GLYCOPROTEIN; SIGNAL. 



FT 


SIGNAL 


1 


29 


POTENTIAL. 


FT 


CHAIN 


21 


795 


ENDOPLASMIN HOMOLOG. 


FT 


CARBOHYD 


111 


111 


POTENTIAL. 


FT 


CARBOHYD 


306 


306 


POTENTIAL. 


FT 


CARBOHYD 


416 


416 


POTENTIAL. 


FT 


CARBOHYD 


456 


456 


POTENTIAL. 


FT 


CARBOHYD 


624 


624 


POTENTIAL. 


FT 


SITE 


814 


817 


PREVENT SECRETION FROM ER. 


SG 


SEQUENCE 


817 AA! 


93491 


MM! 3430345 cn; 



Initial Score = 5 Optimized Score = 5 Significance = 4.75 
Residue Identity = 100X Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions - 0 



A A 



GAEVS 
Mill 

DSDAPVDPPKVEDKIGAVPNGLSTDSDVAKREAESMSHRNLRSDAEKFEFQAEVSRLHDIIINSLYSNKDIF 
40 50 60 70 80 X X 90 100 

LRELISNASDALDKIRFLALTDKEILGEGDTAK 
110 120 130 



2. US-08-249- 182-4 (1-5) 
RAL2.SCHP0 RAL2 PROTEIN. 



ID RAL2_SCHP0 STANDARD; PRT; 611 AA. 

AC P15258; 

DT 01-APR-1990 (REL. 14, CREATED) 

DT 01-APR-1990 (REL. 14. LAST SEQUENCE UPDATE) 

DT Ol-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE RAL2 PROTEIN. 

GN RAL2. 

OS SCHIZOSACCHARONYCES POMBE (FISSION YEAST) . 

oc eukaryota; fungi; ascomycotina; hemiascqmycetes. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 90066514 

RA FUKUI Y. » MIYAKE S., SATOH H. » YAHAHOTO H.J 

RL MOL. CELL. BIOL. 9:5617-5622(1989). 

CC -!- FUNCTION: IMPLICATED IN ACTIVATION OF THE RAS1 PROTEIN. IT IS 

CC PROBABLY A GDP-GTP EXCHANGE PROTEIN FOR RAS1. 

DR EHBL; 1130827; SPRAL2. 

OR PIR; A33827. A33827. 

KM GUANINE-NUCLEOTIDE RELEASING FACTOR. 

SO SEQUENCE 611 AA; 69847 MH; 1910377 CN? 

Initial Score = 5 Optiwized Score = 5 Significance - 4.75 
Residue Identity = 100X Matches = 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
QAEVS 

lllll 

GKLLLNGI SDHE I LT I ERMH I PCLSRMLYKRHPAF8K INDRAVEKNQEAFQAEVSQLGPQLTDLPFSSI HST 
430 440 450 460 470 480 490 



GSRALYNPYSFETCSAFLHYIYCGTLNGSYCTA 
500 510 520 530 



3. US-08-249-182-4 (1-5) 

IBMPJMVD INCLUSION BODY MATRIX PROTEIN (VIROPLASHIN) . 

ID IBMP.FMVD STANDARD; PRT; 512 AA. 
AC P09524; 

DT 01-MAR-1989 (REL. 10. CREATED) 
DT 01-HAR-1989 (REL. 10. LAST SEQUENCE UPDATE) 
DT 01-AUG-1991 (REL. 19, LAST ANNOTATION UPDATE) 
DE INCLUSION BODY MATRIX PROTEIN (VIROPLASHIN). 
GN VI. 

OS FIGWORT MOSAIC VIRUS (STRAIN DXS) (FMV) . 

OC VIRIDAE; DS-DNA NONENVELOPED VIRUSES; CAULIMOVIRIDAE. 

RN [1] 

RP SEQUENCE FROM N.A. 
RM 88040466 

RA RICHINS R.D., SCHOLTHOF H.B., SHEPHERD R.J.; 
RL NUCLEIC ACIDS RES. 15:8451-8466(1987). 

CC -!- FUNCTION: ENHANCES THE TRANSLATION OF DOWNSTREAM ORF'S ON 



CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC INCLUSION BODIES. 

CC -!- THE INCLUSION BODIES ARE THE SITE OF VIRAL DNA SYNTHESIS, VIRION 

CC ASSEMBLY AND ACCUMULATION IN THE INFECTED CELL. 

CC -!- SIMILARITY? HIGH, WITH OTHER CAULIMOVIRUS VIROPLASMIN. 

DR EMBL; X06166; CAFHVXX. 

DR PIR; S01284; MHCVFM. 

KM TRANS-ACTING FACTOR; TRANSLATION REGULATION. 

SQ SEQUENCE 512 AA; 58207 MU; 1378914 CN; 



Initial Score = 5 Optimized Score = 5 Significance = 4.75 
Residue Identity = 1007. Hatches = 5 Mi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
8AEVS 
lllll 

FKPDYLRA ASNGSSWFAV YKGPNKEFFTEWE I VAD I CKKR8KSKRFRSKE8AEVSI SLYNKDI QDPVNFLRP 
150 160 170 180 190 X X 200 210 



VKLVKEERAAQPLKFKAI AAEQTIQFDEFRQIW 
220 230 240 



4. US-08-249-182-4 (1-5) 

CPT7JAT CYTOCHROME P450 XVIIA1 (P450-C17) (EC 1.14.99.9) ( 

ID CPT7 RAT STANDARD; PRT; 507 AA. 

AC PI 1715! 

DT 01-0CT-1989 (REL. 12, CREATED) 

DT 01-AUG-1990 (REL. 15, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE CYTOCHROME P450 XVI IA1 (P450-C17) (EC 1.14.99.9) (STEROID 17-ALPHA- 

DE HYDROXYLASE/17,20 LYASE). 

GN CYP17. 

OS RATTUS NORVEGICUS (RAT) . 

OC EUKARYOTA; METAZOA; CHORDATA; VERTEBRATA? TETRAPODA; MAMMALIA; 

OC EUTHERIA; RODENT I A. 

RN [1] 

RP SEQUENCE FROM N.A. 

RN 89295447 

RA FEVOLD H.R., LORENCE M.C., MCCARTHY J.L., TRANT J.M., KAGIMOTO M. , 

RA WATERMAN M.R., MASON J.I.; 

RL MOL. ENDOCRINOL. 3:968-975(1989). 

RN [21 

RP SEQUENCE FROM N.A. 

RC TISSUE=TESTISJ 

RM 89076306 

RA NAMIKI M. , KITAMURA M., BUCZKO E.> DUFAU M.L.; 

RL BIOCHEM. BIOPHYS. RES. COHMUN. 157:705-712(1988). 

RN [3] 

RP SEQUENCE OF 271-507 FROM N.A. 

RM 88280759 

RA NISHIHARA M. , WINTERS C.A., BUZKO E.> WATERMAN M.R., DUFAU M.L.; 

RL BIOCHEM. BIOPHYS. RES. COHMUN. 154:151-158(1988). 

RN [4] 

RP SEQUENCE OF 273-507 FROM N.A. 

RM 90046678 

RA MELLON S.H., VAISSE C; 

RL PROC. NATL. ACAD. SCI. U.S.A. 86:7775-7779(1989). 

CC -!- FUNCTION: CYTOCHROMES P450 ARE A GROUP OF HEME-THIOLATE 

CC MONOOXYGENASES. THEY OXIDIZE A VARIETY OF STRUCTURALLY UNRELATED 

CC COMPOUNDS, INCLUDING STEROIDS, FATTY ACIDS, AND XENOBIOTICS. 

CC -!- CATALYTIC ACTIVITY: A STEROID + AH<2) + 0(2) = A 17-ALPHA- 

CC HYDROXYSTEROID + A + H(2)0. 

DR EMBL? M31681! RNP45017. 



DR EMBL; M21208; RRCYPC17. 

DR EMBL; M27282; RNP450C1. 

DR PIR; A27659; A27659. 

DR PIR; A30S28; A30828. 

DR PIR; S16719; S16719. 

DR PR0SITE5 PS00086J CYT0CHR0ME.P450. 

KW ELECTRON TRANSPORT; OXIDOREDUCTASE5 MONOOXYGENASE; MEMBRANE; 

m HEME; STEROIDOGENESIS. 

FT BINDING 441 441 HEME. 

SQ SEQUENCE 507 AA» 57250 NW; 1347075 CN; 

Initial Score = 5 Optimized Score = 5 Significance = 4.75 
Residue Identity = 100X Hatches = 5 Mi snatches = 0 

Gaps = 0 Conservative Substitutions - 0 

X X 
8AEVS 
lllll 

FVFTALLL9RFDLDVSDDKQLPRLEGDPKVVFLIDPFKVKITVRQAWMDAQAEVST 
460 470 480 490 500 X X 



5. US-08-249-182-4 (1-5) 

VIPR_RAT VASOACTIVE INTESTINAL POLYPEPTIDE RECEPTOR 1 PRECU 

ID VIPR.RAT STANDARD; PRT; 459 AA. 

AC P30083? 

DT 01-APR-1993 (REL. 25, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE VASOACTIVE INTESTINAL POLYPEPTIDE RECEPTOR 1 PRECURSOR (VIP-R-1). 

OS RATTUS NORVEGICUS (RAT) . 

OC EUKARYOTA; HETAZOA; CHORDATA; VERTEBRA! A; TETRAPODA; MAMMALIA; 

oc eutneria; rodentia. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=LUNGf 

RM 92232309 

RA ISHIHARA T., SHIGEMOTO R., MORI K., TAKAHASHI K., NAGATA S.; 

RL NEURON 8:811-819(1992). 

CC -!- FUNCTION; THIS IS A RECEPTOR FOR VIP. THE ACTIVITY OF THIS 
CC RECEPTOR IS MEDIATED BY G PROTEINS WHICH ACTIVATE ADENYLYL 

CC CYCLASE. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. 

CC -!- TISSUE SPECIFICITY: IN LIVER, LUNG, INTESTINES, THYMUS AND BRAIN 

CC (MOSTLY IN THE CEREBRAL CORTEX AND HIPPOCAMPUS). 

CC -!- SIMILARITY: BELONGS TO FAMILY 2 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; M86835; RNVASREC. 

DR PIR; JH0594J JH0594. 

DR PROSITE; PS00649; G_PR0TEIN_RECEP_F2_1. 

DR PROSITE; PS00650; G_PR0TEIN_RECEP_F2l2. 

KM G-PROTEIN COUPLED RECEPTOR; TRANSMEMBRANE; GLYCOPROTEIN; SIGNAL. 



SIGNAL 


1 


30 


POTENTIAL. 


CHAIN 


31 


459 


VASOACTIVE INTESTINAL POLYPEPTIDE 
RECEPTOR 1. 


DOMAIN 


31 


143 


EXTRACELLULAR (POTENTIAL). 


TRANSMEM 


144 


168 


1 (POTENTIAL). 


DOMAIN 


169 


175 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


176 


195 


2 (POTENTIAL). 


DOMAIN 


196 


217 


EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


218 


241 


3 (POTENTIAL). 


DOMAIN 


242 


255 


CYTOPLASMIC (POTENTIAL). 


TRANSMEM 


256 


277 


4 (POTENTIAL). 


DOMAIN 


£78 


294 


EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


295 


318 


5 (POTENTIAL). 



vui in « n 

TRANSKEM 


342 


361 


6 (POTENTIAL). 


DOMAIN 


362 


373 


EXTRACELLULAR (POTENTIAL) . 


TRANSHEM 


374 


393 


7 (POTENTIAL). 


DOMAIN 


394 


457 


CYTOPLASMIC (POTENTIAL) . 


CARBOHYD 


58 


58 


POTENTIAL. 


CARBOHYD 


69 


69 


POTENTIAL. 


CARBOHYD 


100 


100 


POTENTIAL. 


CARBOHYD 


290 


290 


POTENTIAL. 


SEQUENCE 


457 A A; 


51547 


m; 1161417 cn; 



Initial Score = 5 Optinized Score = 5 Significance = 4.75 
Residue Identity = 100X Hatches = 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
GAEVS 
lllll 

RKWRRWHLQGVLGWNPKYRHPSGGSNGATCSTQVSMLTRVSPGARRSSSFSAEVSLV 
410 420 430 440 450 X 



7. US-08-249-182-4 (1-5) 

NOLF.RHIHE NODULATION PROTEIN NOLF. 



ID NOLF.RHIHE STANDARD; PRT5 367 AA. 

AC P25196; 

DT 01-HAY-1992 (REL. 22, CREATED) 

DT Oi-HAY-1992 (REL. 22, LAST SEQUENCE UPDATE) 

DT 01-MAY-1992 (REL. 22, LAST ANNOTATION UPDATE) 

DE NODULATION PROTEIN NOLF. 

GN NOLF. 

OS RHIZOBIUH MEL I LOT I. 

OG PLASHID SYM PRME41B. 

oc prokaryota; gracilicutes; scoto bacteria; aerobic rods and cocci; 

OC RHIZOBIACEAE. 

RN [11 

RP SEQUENCE FROM N.A. 

RC STRAIN=AK 631; 

RH 91360053 

RA BAEV N. , ENDRE G., PETROVICS G., BANFALVI Z., KONDOROSI A.; 

RL HOL. GEN. GENET. 228:113-124(1991). 

CC -!- FUNCTION: INVOLVED IN THE PRODUCTION OF HEDICAGO-SPECIFIC 
CC NODULATION SIGNAL MOLECULE. 

DR EMBL; X 58632; RMPRNE41B. 

DR PIRf S16562; S16562. 

KU PLASHID; NODULATION. 

SS SEQUENCE 367 AA; 39541 HU; 661978 CN; 



Initial Score = 5 Optimized Score = 5 Significance = 4.75 
Residue Identity = 100% Hatches = 5 Hisnatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
6AEV5 
lllll 

RGGCLSA9TELAEAVLERNTRLGERGAASEATRLAALADVLDLRAHVRSKQAEVSDAERSLSHAEVRAEFGG 
120 130 140 150 160 X 170 180 



V IRARSVEEGQTVPLNTQLHT I VELNRLEVDAG 
190 200 210 



8. US-08-249-182-4 (1-5) 

YM2.STRC0 MINI-CIRCLE HYPOTHETICAL 13.3 KD PROTEIN. 



r i 




OJ.7 




inurLnsmi vruicraiinL/. 


FT 


TRANSMEM 


344 


363 


6 (POTENTIAL). 


FT 


DOMAIN 


364 


375 


EXTRACELLULAR (POTENTIAL) 


FT 


TRANSMEM 


376 


395 


7 (POTENTIAL). 


FT 


DOMAIN 


396 


459 


CYTOPLASMIC (POTENTIAL). 


FT 


CARBOHYD 


58 


58 


POTENTIAL. 


FT 


CARBOHYD 


69 


69 


POTENTIAL. 


FT 


CARBOHYD 


100 


100 


POTENTIAL. 


FT 


CARBOHYD 


292 


292 


POTENTIAL. 


SQ 


SEQUENCE 


459 aa; 


52057 


MM; 1186786 CN; 



Initial Score = 5 Optinized Score = 5 Significance = 4.75 
Residue Identity = 100X Hatches = 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
QAEVS 
lllll 

RKURRWHLQGVLGUSSKSQHPMGGSNGATCSTQVSHLTRVSPSARRSSSFQAEVSLV 
410 420 430 440 450 X X 



6. US-Ofi-249-182-4 (1-5) 

VIPR.HUMAN VASOACTIVE INTESTINAL POLYPEPTIDE RECEPTOR 1 PRECU 



ID VIPR.HUMAN STANDARD; PRT; 457 AA. 

AC P32241; 

DT 01-0CT-1993 (REL. 27, CREATED) 

DT 01-0CT-1993 (REL. 27, LAST SEQUENCE UPDATE) 

DT 01 -FEB- 1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE VASOACTIVE INTESTINAL POLYPEPTIDE RECEPTOR 1 PRECURSOR (VIP-R-1). 

OS HOMO SAPIENS (HUMAN) . 

OC EUKARYOTA; METAZOA; CHORD AT A; VERTEBRATA; TETRAPODAf MAMMALIA; 

OC EUTHERIA! PRIMATES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC T I SSUE= INTESTINE; 

RM 93290641 

RA SREEDHARAN S.P.. PATEL D.R., HUANG J.-X., GOETZL E.J.; 

RL BIOCHEM. BIOPHYS. RES. COMMUN. 193:546-553(1993). 

CC -!- FUNCTION: THIS IS A RECEPTOR FOR VIP. THE ACTIVITY OF THIS 

CC RECEPTOR IS MEDIATED BY G PROTEINS WHICH ACTIVATE ADENYLYL 

CC CYCLASE. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. 

CC -!- TISSUE SPECIFICITY: IN LUNG, HT29 COLONIC EPITHELIAL CELLS, 
CC RAJI B-LYMPHOBLASTS. LESSER EXTENT IN BRAIN, HEART, KIDNEY, 

CC LIVER AND PLACENTA. 

CC -!- SIMILARITY: BELONGS TO FAMILY 2 OF G-PROTEIN COUPLED RECEPTORS. 

DR EMBL; LI 3288; HSVIPR1. 

DR PIR; JN0604; JN0604. 

DR PROSITE; PS00649; G_PR0TEIN_RECEP_F2_1 . 

DR PROSITE; PS00650J G_PR0TEIN.RECEP.F2J. 

KM G-PROTEIN COUPLED RECEPTOR ; "TRANSMEMBRANE ; GLYCOPROTEIN; SIGNAL. 



SIGNAL 


1 


30 


POTENTIAL. 


CHAIN 


31 


457 


VASOACTIVE INTESTINAL POLYPEPTIDE 
RECEPTOR 1. 


DOMAIN 


31 


142 


EXTRACELLULAR (POTENTIAL). 


TRANSMEM 


143 


167 


1 (POTENTIAL). 


DOMAIN 


168 


174 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


175 


194 


2 (POTENTIAL). 


DOMAIN 


195 


216 


EXTRACELLULAR (POTENTIAL). 


TRANSMEM 


217 


240 


3 (POTENTIAL). 


DOMAIN 


241 


254 


CYTOPLASMIC (POTENTIAL). 


TRANSMEM 


255 


276 


4 (POTENTIAL). 


DOMAIN 


277 


292 


EXTRACELLULAR (POTENTIAL). 


TRANSMEM 


293 


316 


5 (POTENTIAL). 



100- 



50- 



10- 



5- 



SCORE 0| 
STDEV 0 



II 



PARAMETERS 



Similarity matrix 


Unitary 


K-tuple 


2 


Mismatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 


15 


Optimized scores to 


save 0 


Display context 


50 



SEARCH STATISTICS 



Scores? 



Mean Median Standard Deviation 

1 3 0.88 



Times: 



CPU 
00;00:48.91 



Total Elapsed 
00:00:52.00 



Number of residues: 12496420 
Number of sequences searched: 36000 
Number of scores above cutoff: 3832 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 



b 1002 similar sequences to the query sequence were found: 



Sequence Nane Description 



inn. upL. 
Length Score Score Sig. Frane 



1. 


PULAJvLEAE 


PULLULANASE (EC 3.2.1.41) (AL 


1096 


5 


5 


4.52 


0 


2. 


nnt n TvrPIIII 

PRIS_DESVH 


PRISMANE PROTEIN. 


553 


5 


5 


4.52 


0 


-i 

3. 


tint p nrnnr 

PRISJ)ESDE 


PRISMANE PROTEIN. 


544 


5 


5 


4.52 


0 


fk 

*» . 




CYTUCHRUPit r450"TcRr (EC 1.14 


428 


5 


5 


4.52 


0 


5. 


YB5B~YEAST 


HYPOTHETICAL 47.4 KD PROTEIN 


418 


5 


5 


4.52 


0 


6. 


DBH.THETH 


DNA-BINDING PROTEIN II. 


95 


5 


5 


4.52 


0 



The list of other best scores is; 



Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 







**** 3 standard deviations above wean ***# 






7. 


C551.PSEST 


CYTOCHROME C551. 


82 4 ' 


» 3.39 


0 


8. 


CYC6.PLEB0 


CYTOCHROME Co (SOLUBLE CYT0CH 


85 4 


» 3.39 


0 


9. 


CYC6_ANAVA 


CYTOCHROME C6 (SOLUBLE CYT0CH 


O L it 

86 4 J 


» 3.39 


0 


10. 


DBHJACST 


DNA-BINDING PROTEIN II (HB) ( 


nn n 

90 4 ' 


1 3.39 


0 


11. 


DBHJACSU 


m>tA ntunt nnnTr? u tt Mint i 

DNA-BINDING PROTEIN II (HB) ( 


92 4 ' 


» 3.39 


0 


12. 


PIN0.EC0LI 


n t kin nnn^rr t ki 

PINO PROTEIN. 


105 4 ' 


1 3.39 


0 


13. 


GLN1_HETTL 


NITROGEN FIXATION NIFHD REGI0 


105 4 ' 


» 3.39 


0 


14. 


YP12.RTBVP 


HYPOTHETICAL P12 PROTEIN (0RF 


110 4 ' 


» 3.39 


0 


15. 


YP12.RTBV 


uunnTiirTt aai nil nnnrrtu tnt\r 

HYPOTHETICAL P12 PROTEIN (0RF 


110 4 ' 


» 3.39 


0 


16. 


CYC6.ANAS0 


CYTOCHROME C6 PRECURSOR (S0LU 


111 4 ' 


» 3.39 


0 


17. 


CYC6_ANASP 


CYTOCHROME C6 PRECURSOR (SOLU 


111 4 ' 


» 3.39 


0 


IB. 


N075~PEA 


EARLY NODULIN-75 PROTEIN (N-7 


112 4 ' 


» 3.39 


0 


19. 


YCW8.YEAST 


HYPOTHETICAL 12.4 KD PROTEIN 


114 4 < 


I 3.39 


0 


20. 


WNT3JPTST 


WNT-3 PROTEIN (FRAGMENT). 


123 4 > 


1 3.39 


0 


21. 


CY2_RH0CA 


*p f\ A 1 1 pv pt IS P" A A PN p% p A| 1 PV F\ A PV 

CYTOCHROME C2 PRECURSOR. 


137 4 ' 


1 3.39 


0 


22. 


YZP2 EC0LI 


VERY HYPOTHETICAL 16.1 KD PRO 


146 4 ' 


1 3.39 


0 


23. 


PETD.PR0H0 


CYTOCHROME B6-F COMPLEX SUBUN 


160 4 1 


» 3.39 


0 


24. 


MAT 1_ YE AST 


Ul¥f 4IA vim P* AKftTP Til At mi A J 

NATING-TYPE PROTEIN ALPHA-1. 


175 4 * 


1 3.39 


0 


25. 


SPC4_CANFA 


MICROSOMAL SIGNAL PEPTIDASE 1 


179 4 ' 


» 3.39 


0 


26. 


SPC3.CANFA 


MICROSOMAL SIGNAL PEPTIDASE 2 


191 4 < 


» 3.39 


0 


27. 


C0AG.LIMP0 


C0AGUL0GEN PRECURSOR (CONTAIN 


195 4 < 


» 3.39 


0 


28. 


RL22.SPI0L 


50S RIB050MAL PROTEIN L22 (RI 


199 4 ' 


» 3.39 


0 


29. 


C0AT_BYDVN 


COAT PROTEIN. 


204 4 


1 3.39 


0 


30. 


COAT PLRVH 


COAT PROTEIN. 


208 4 ' 


I 3.39 


0 


31. 


C0AT.PLRVR 


COAT PROTEIN. 


208 4 * 


» 3.39 


0 


32. 


C0AT.PLRV1 


COAT PROTEIN. 


208 4 < 


1 3.39 


0 


33. 


C0AT_PLRV 


COAT PROTEIN. 


208 4 ' 


^ 3.39 


0 


34. 


HUPD.RH0CA 


HUPD PROTEIN. 


210 4 ' 


» 3.39 


0 


35. 


CAT1JTAAU 


CHLORAMPHENICOL ACETYLTRANSFE 


216 4 ' 


» 3.39 


0 


36. 


CAT.EC0LI 


CHLORAMPHENICOL ACETYLTRANSFE 


219 4 i 


1 3.39 


0 


37. 


UL01_HSV11 


GLYCOPROTEIN L PRECURSOR. 


224 4 ' 


» 3.39 


0 


38. 


FRDB W0LSU 


FUMARATE REDUCTASE IR0N-SULFU 


239 4 ' 


1 3.39 


0 


39. 


DNAC.EC0LI 


DNAC PROTEIN. 


245 4 ' 


» 3.39 


0 


40. 


NU6N_HHEAT 


NADH-UBIQUINONE 0XID0REDUCTAS 


247 4 i 


\ 3.39 


0 



1. US-08-249- 182-3 (1-5) 

PULA.KLEAE PULLULANASE (EC 3.2.1.41) (ALPHA-DEXTRIN END0-1.6- 

ID PULA.KLEAE STANDARD; PRT; 1096 AA. 

AC P07811J 

DT 01-AUG-1988 (REL. 08, CREATED) 

DT 01-AUG-1988 (REL. 08, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE PULLULANASE (EC 3.2.1.41) (ALPHA-DEXTRIN END0-1 ,6-ALPHA-GLUCOSIDASE) 

DE (PULLULAN 6-GLUCANOHYDROLASE) . 

GN PULA. 

OS KLEBSIELLA AER0GENES. 

0C PR0KARY0TA; GRACILICUTES; SC0T0BACTERIA; FACULTATIVELY ANAEROBIC RODS; 

0C ENTEROBACTERIACEAE. 



RP SEQUENCE FROM N.A. 

RC STRAIN=H70J 

RN 87194626 

RA KATSURAGI N.> TAKIZAUA N., MUROOKA Y.; 

RL J. BACTERIOL. 167:2301-2306(1987). 

CC -!- CATALYTIC ACTIVITY: STARCH-DEBRANCMNG ENZYME > HYDROLYZES 
CC (l-6)-ALPHA-GLUC0SIDIC LINKAGES IN PULLULAN AND STARCH TO 

CC FORM MALTQTRIOSE. 

CC -!- SUBMIT.* HONOTRINER. 

CC -!- SUBCELLULAR LOCATION: ATTACHED TO THE MEMBRANE BY A LIPID 
CC ANCHOR (PROBABLE). 

CC -!- SIMILARITY: BELONGS TO FAMILY 13 OF GLYCOSYL HYDROLASES, ALSO 

CC KNOWN AS THE ALPHA-AMYLASE FAMILY. 

DR EMBL; M16187; KAPULA. 

DR PIR; A268795 A26879. 

DR PROSITE! PS00013J PR0KAR_LIPOPROTEIN. 

KU HYDROLASE; GLYCOSIDASE; MEMBRANE! LIPOPROTEIN; SIGNAL. 



FT 


SIGNAL 


1 


19 




FT 


CHAIN 


20 


1096 


PULLULANA5E. 


FT 


LIPID 


20 


20 


N-ACYL DIGLYCERIDE. 


FT 


ACT_SITE 


694 


694 


BY SIMILARITY. 


FT 


ACT.SITE 


723 


723 


BY SIMILARITY. 


FT 


ACT SITE 


851 


851 


BY SIMILARITY. 


SQ 


SEQUENCE 


1096 


AA; 119335 MW; 5852107 CM; 



Initial Score = 5 Optinized Score = 5 Significance = 4.52 
Residue Identity = 100X Matches = 5 Misnatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
Mill 

GENKPIVRLYYSHSSKVAADSNGEFSDKYVKLTPTTVNQQVSMRFPHLASYPAFKLPDDVNVDELLQGDDGG 
210 220 230 240 250 260 X 270 280 

IAESDGILSLSHPGADRRRAGRYLCRRAEALSY 
290 300 310 



2. US-08-249-182-3 (1-5) 

PRIS_DESVH PRISMANE PROTEIN. 

ID PRISJESVH STANDARD; PRT> 553 AA. 

AC P31101; 

DT 01-JUL-1993 (REL. 26, CREATED) 

DT 01-JUL-1993 (REL. 26, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE PRISMANE PROTEIN. 

OS DESULFOVIBRIO VULGARIS (STRAIN HILDENBOROUGH) . 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIA; 

OC SULFATE- OR SULFUR-REDUCING DISSIMILATORY. 

RN [ 1 ] 

RP SEQUENCE FROM N.A., AND PARTIAL SEQUENCE. 

RM 92394141 

RA STOKKERHANS J.P.U.G., PIERIK A.J., WOLBERT R.B.G., HAGEN U.R., 

RA VAN DONGEN U.N. A.M., VEEGER C; 

RL EUR. J. BIOCHEM. 208:435-442(1992). 

RN [21 

RP CHARACTERIZATION, AND SEQUENCE OF 1-14. 

RC STRAIN=NCIB 8303; 

RM 92298997 

RA PIERIK A.J., WOLBERT R.B.G., MUTSAERS P.H.A., HAGEN U.R., VEEGER C; 

RL EUR. J. BIOCHEM. 206:697-704(1992). 

RN [3] 

RP CHARACTERIZATION. 



RM 
RA 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
KU 
FT 
FT 
FT 
FT 
SQ 



92298998 

PIERIK A.J.. HAGEN H.R., DUNHAM W.R., SANDS R.H.; 
EUR. J. BIOCHEN. 2065705-719(1992). 

-!- FUNCTION? PROBABLE ENZYME OF YET UNCHARACTERIZED ACTIVITY. 
-!- COFACTOR: CONTAINS A [6FE-6S1 PRISHANE CLUSTER THAT MIGHT OCCUR 
IN 4 DIFFERENT REDOX STATES: C6FE-6S13+J [6FE-6S34+; C6FE-6SI5+ 
AND [6FE-6S16+. 
SUBUNIT: MONOMER. 

subcellular location: cytoplasmic, 
ehbl; z i 1707; dvpris. 
electron transport? iron-sulfur. 

3 iron-sulfur (6fe-6s) (potential) . 

6 IRON-SULFUR (6FE-6S) (POTENTIAL). 

15 IRON-SULFUR (6FE-6S) (POTENTIAL). 

21 IRON-SULFUR (6FE-6S) (POTENTIAL). 



-i. 
_ i _ 



METAL 
METAL 
METAL 
METAL 
SEQUENCE 



3 
6 

15 
21 
553 AA; 



60163 MM; 1555801 CN; 



Initial Score = 5 
Residue Identity = 100X 
Gaps - 0 



Optimized Score = 5 
Matches = 5 

Conservative Substitutions 



Significance = 
Mi snatches = 



4.52 

0 
0 



X X 
YPAFK 
lllll 

E I TGVNIGVGKNPGI L ISGHDLKDHAELLKQTEGTGVDVYTHGEHLPANYYPAFKKYPHFVGNYGGSyWQGN 
230 240 250 260 270 X 280 290 



PEFESFNGPILLTTNCLVPLKKENTYLDRLYTT 
300 310 320 



3. US-08-249- 182-3 (1-5) 

PRIS_DESDE PRISMANE PROTEIN. 

ID PR IS DESDE STANDARD; PRT; 544 AA. 

AC Q01770; 

DT 01-JUL-1993 (REL. 26. CREATED) 

DT 01-JUL-1993 (REL. 26, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26. LAST ANNOTATION UPDATE) 

DE PRISHANE PROTEIN. 

OS DESULFOVIBRIO DESULFURICANS. 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIA; 

OC SULFATE- OR SULFUR-REDUCING DISSIMILATORY. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC 27774; 

RM 92379097 

RA STOKKERHANS J.P.H.G.. VAN DEN BERG U.A.M.. VAN DONGEN W.H.A.M., 

RA VEEGER C.f 

RL BIOCHIM. BIOPHYS. ACTA 1132:83-87(1992). 

RN [2] 

RP PARTIAL SEQUENCE OF 1-36, AND CHARACTERIZATION. 

RC BTRAIN=ATCC 27774; 

RM 92165800 

RA MOURA I., TAVARES P., MOURA J.J.G., RAVI N., HUYNH B.H., LIU H.-Y., 

RA LE GALL J. ; 

RL J. BIOL. CHEM. 267:4489-4496(1992). 

CC -!- FUNCTION: PROBABLE ENZYME OF YET UNCHARACTERIZED ACTIVITY. 

CC -!- COFACTOR: CONTAINS A C6FE-6S1 PRISMANE CLUSTER THAT MIGHT OCCUR 

CC IN 4 DIFFERENT REDOX STATES: [6FE-6S13+! [6FE-6SI4+; [6FE-6S15+ 

CC AND [6FE-6S36+. 

CC -!- SUBUNIT: MONOMER. 

CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC. 

DR EMBL, ZU975; DSPRISMAN. 

DR PIR; A42396; A42396. 



Vn I IMF Jbigu/i 

KU ELECTRON TRANSPORT; IRON-SULFUR. 



INI 1 _nt 1 


u 


A 
U 








Fit 1 hL 


L 

0 


L 

0 


TDfHI-CIII CUD 


lort-ob) 


IrUibNI IAU i 


nt i flL 


Q 

7 


o 

7 


TDDhl-CIII DID 


(ort-oSJ 


(POTENTIAL) . 


MCTAt 


1 Q 

18 




TDnM-CMI CUD 


/iCC LC\ 

(6FE-6S) 


/DflTCMTTAl V 

(POTENTIAL) . 


MCTAt 

ntTAL 


24 


24 


IRON-SULFUR 


(6FE-6S) 


(POTENTIAL) . 


SEQUENCE 


544 AA; 


58528 hw; 


1518483 CN; 





Initial Score = 5 Optimized Score = 5 Significance = 4.52 
Residue Identity = 1007. Hatches = 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
Mill 

EITKVNIGVGSNPGILISGHDLRDLEMLLKQTEGTGVDVYTHSEMLPAHYYPAFKKYAHFKGNYGNAWMK9K 
230 240 250 260 270 X 280 290 

EEFESFNGPVLLTTNCLVPPKDSYKDRVYTTGI 
300 310 320 



4. US-08-249- 182-3 (1-5) 

CPXL.PSESP CYTOCHROME P450-TERP (EC 1.14.-.-). 



ID CPXL.PSESP STANDARD; PRT; 428 AA. 

AC P33006; 

DT 01-0CT-1993 (REL. 27, CREATED) 

DT 01-0CT-1993 (REL. 27. LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE CYTOCHROME P450-TERP (EC 1.14.-.-). 

GN CYP108 OR TERPC. 

OS PSEUD0M0NAS SP. 

oc . prokaryota; gracilicutes; scotobacteria; aerobic rods and cocci; 

oc pseudomonadaceae. 

rn in 

RP SEQUENCE FROM N.A.. AND PARTIAL SEQUENCE. 

RN 92332528 

RA PETERSON J. A., LU J.-Y., GEISSELS0DER J., GRAHAN-LORENCE S., 

RA CARNONA C. WITNEY F., L0RENCE M.C.! 

RL J. BIOL. CHEN. 267:14193-14203(1992). 

CC -!- FUNCTION: INVOLVED IN A ALPHA-TERPINEOL OXIDATION SYSTEM. 

CC -!- SIMILARITY: MEMBER OF THE CYTOCHROME P-450 FAMILY. 

DR EMBL; M91440; PSTERP. 

DR EMBL; S39894J S39894. 

DR PIRf S27653; S27653. 

DR PROSITE; PS00086; CYT0CHR0HE_P450. 

KU OXIDOREDUCTASE; HONOOXYGENASE; ELECTRON TRANSPORT i MEMBRANE; HEME. 

FT BINDING 377 377 HEME (BY SIMILARITY) . 

SQ SEQUENCE 428 AA; 47922 NU; 907644 CN; 



Initial Score = 5 Optinized Score = 5 Significance = 4.52 
Residue Identity = 1007. Matches = 5 Mismatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
YPAFK 
Hill 

MD ARAT IPEHI ARTV I LPQG YADDEV I YPAFKWLRDE9PLAMAH I EGYDPMU I ATKHADVMSI GKQPGLFSN 
10 20 30 X 40 50 60 70 



AEGSEILYDQ 
80 



YB5B.YEAST HYPOTHETICAL 47.4 KD PROTEIN IN KIP1-SEC17 INTERGE 

ID YB5B.YEAST STANDARD,* PRT; 418 AA. 

AC P34220; 

DT OI-FEB-1994 (REL. 28, CREATED) 

DT OI-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT OI-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 47.4 KD PROTEIN IN KIP1-SEC17 INTERGENIC REGION. 

GN YBL0511. 

OS SACCHAROMYCES CEREVISIAE (BAKER'S YEAST). 

OC EUKARYOTAI FUNGI » ASCOMYCOTINA; HENIASCOHYCETES. 

RN [13 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288C* 

RA SCHERENS B. , EL BAKKOURY H. , VIERENDEELS F., DUBOIS E. , MESSENGUY F.l 

RL YEAST 9:1355-1371(1993). 

DR EHBL; Z2326U SCIIL260. 

KH HYPOTHETICAL PROTEIN. 

SQ SEQUENCE 418 AA* 47390 HH; 911132 CN; 

Initial Score = 5 Optinized Score = 5 Significance = 4.52 
Residue Identity = 100% Hatches = 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
lllli 

SLRTEENLAVVKQIPTERLLLETDAPWCEIKRTHASFQYLAKYQEVRDFEYPAFKSVKKNKLADKLNAEELY 
310 320 330 340 350 X 360 370 

MVKGRNEPCNNESVAIVVSEVKDVDLATLIDTT 
380 390 400 



6. US-08-249- 182-3 (1-5) 

DBH.THETH DNA-BINDING PROTEIN II. 

ID DBH THETH STANDARD' PRT: 95 AA. 

AC P19436; 

DT Oi-FEB-1991 (REL. 17, CREATED) 

DT 01 -FEB- 1991 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-HAY-1991 (REL. 18, LAST ANNOTATION UPDATE) 

DE DNA-BINDING PROTEIN II. 

OS THERMOS AQUATICUS (SUBSP. THERMOPHILIC). 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIA; AEROBIC RODS AND COCCI' 

OC UNCERTAIN. 

RN 111 

RP SEQUENCE. 

RM 91032203 

RA ZIERER R. , CHOLI D.; 

RL FEBS LETT. 273:59-62(1990). 

CC -!- FUNCTION: THIS PROTEIN BELONGS OT THE HISTONE LIKE FAMILY OF 

CC PROKARYOTIC DNA-BINDING PROTEINS WHICH ARE CAPABLE OF WRAPPING 

CC DNA AND TO STABILIZE DNA FROM DENATURATION UNDER EXTREME 

CC ENVIRONMENTAL CONDITIONS. 

CC -!- SUBUNIT: HOHODIMER. 

DR PIR; S12888; S12888. 

DR PROSITE; PS00045* HISTONE.LIKE. 

KU DNA-BINDING; DNA CONDENSATION. 

SQ SEQUENCE 95 AA', 10163 MU; 48066 CN; 

Initial Score = 5 Optinized Score = 5 Significance = 4.52 
Residue Identity = 100X Hatches = 5 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 



A A 

YPAFK 
Hill 

DALLAKVEEALANGSKVSLTGFGTFEVRKRKARTGVKPGTKEKIKIPATQYPAFKPGKALKDKVK 
40 50 60 70 80 X 90 



7. US-08-249- 182-3 (1-5) 

C551.PSEST CYTOCHROME C551 . 

ID C551_PSEST STANDARD; PRTf 82 AA. 

AC POOlOl; 

DT 21-JUL-1986 (REL. 01. CREATED) 

DT 21-JUL-1986 (REL. 01, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE CYTOCHROME C551. 

GN NIRN. 

OS PSEUDOMONAS STUTZERI (PSEUDOMONAS PERFECTOMARINA) . 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIA; AEROBIC RODS AND COCCI; 

OC PSEUDOMQNADACEAE. 

RN [1] 

RP SEQUENCE. 

RC STRAIN=221 ; 

RM 73224976 

RA AMBLER R.P., HYNN M.; 

RL BIOCHEM. J. 131 .'485-498(1973) . 

RN [2] 

RP STRUCTURE BY NMR. 

RC STRAIN=ATCC 17588; 

RM 93002990 

RA CAI M.i BRADFORD E.G., TIMKOVICH R.5 

RL BIOCHEMISTRY 31:8603-8612(1992). 

CC -!- FUNCTIONS ELECTRON DONOR FOR CYTOCHROME CD1 IN NITRITE AND NITRATE 
CC RESPIRATION. 

CC -!- SUBCELLULAR LOCATION: PERIPLASTIC. 

CC -!- SIMILARITY: 11 DIFFERENCES WITH STRAIN ZOBELL. 

DR PIR; A00093; CCPS5S. 

DR PROSITE; PS00190; CYTOCHROME.C. 

KM ELECTRON TRANSPORT; OXIDATIVE PHOSPHORYLATION! HEME; PERIPLASTIC. 

FT BINDING 12 12 HEME (COVALENT). 

FT BINDING 15 15 HEME (COVALENT). 

FT METAL 16 16 IRON (HEME AXIAL LIGAND) . 

FT METAL 61 61 IRON (HEME AXIAL LIGAND) . 

SQ SEQUENCE 82 AA; 8612 MH! 33580 CN; 

Initial Score = 4 Optimized Score = 4 Significance = 3.39 
Residue Identity = BOX Matches = 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
llll 

9DGEALFKSKPCAACHSIDAKLVGPAFKEVAAKYAGQDGAADLLAGHIKNGSQGVWGPIPMPPNPVTEEEAK 
10 20 X 30 40 50 60 70 

ILAEWI 



8. US-08-249-182-3 (1-5) 

CYC6_PLEB0 CYTOCHROME C6 (SOLUBLE CYTOCHROME F) (CYTOCHROME C 

ID CYC6.PLEB0 STANDARD; PRT; 85 AA. 

AC P00117? 

DT 21-JUL-1986 (REL. 01. CREATED) 

DT 21-JUL-1986 (REL. 01, LAST SEQUENCE UPDATE) 



ui ui-nflK-nev \ittL ■ iui la&i annuiaiiun uruAiti 

DE CYTOCHROME C6 (SOLUBLE CYTOCHROHE F) (CYTOCHROME C553) . 

OS PLECTONEHA BORYANUN. 

OC prokaryota; gracilicutes; oxyphotobacteria; 

OC CYANOBACTERIA (BLUE-GREEN ALGAE); NOSTOCALES. 

RN [1] 

RP SEQUENCE. 

RC STRAIN=CCAP 1462/2; 

RN 78023897 

RA A I THEN A.; 

RL EUR. J. BIOCHEN. 78:273-279(1977) . 

CC -!- FUNCTION: CYTOCHROME C6 IS A HONOHEME MONOMER. IT FUNCTIONS AS AN 

CC ELECTRON CARRIER BETWEEN MEMBRANE-BOUND CYTOCHROHE F AND P700 IN 

CC THE PHOTOPHOSPHORYLATION CHAIN IN CHLOROPLASTS AND ALGAE. IT 

CC SUBSTITUTES FOR PLASTOCYANIN IN COPPER-DEFICIENT BLUE -GREEN ALGAE 

CC AND IN THE CHLOROPLASTS OF SOME EUKARYOTE ALGAE. 

DR PIR; A00109; CCPB6. 

DR PROSITE! PS00190; CYTOCHROME C. 

KM ELECTRON TRANSPORT? PHOTOSYNTHESIS; HEME. 



FT 


BINDING 


14 


14 


HEME (COVALENT). 


FT 


BINDING 


17 


17 


HEME (COVALENT). 


FT 


METAL 


18 


18 


IRON (HEME AXIAL LIGAND). 


FT 


METAL 


58 


58 


IRON (HEME AXIAL LIGAND). 


sa 


SEQUENCE 


85 AA! 


8576 NU; 


34505 CN; 



Initial Score = 4 Opiinized Score = 4 Significance = 3.39 
Residue Identity = 80% Matches = 4 Hi snatches = 1 

Gaps - 0 Conservative Substitutions = 0 

X X 
YPAFK 
fill 

KVFNANCAACHASGGGQINGAKTLKKNALTANGKDTVEAIVAQVTNGKGAHPAFKGRLSDDQIQSVALYVLD 
10 20 30 40 50 60 X 70 

KAEKGW 
80 



9. US-08-249-182-3 (1-5) 

CYC6.ANAVA CYTOCHROHE C6 (SOLUBLE CYTOCHROME F) (CYTOCHROHE C 

ID CYC6.ANAVA STANDARD; PRTf 86 AA. 

AC P00113; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-1986 (REL. 01 , LAST SEQUENCE UPDATE) 

DT 01-MAR-1989 (REL. 10. LAST ANNOTATION UPDATE) 

DE CYTOCHROHE C6 (SOLUBLE CYTOCHROHE F) (CYTOCHROME C553) . 

OS ANABAENA VARIABILIS. 

OC PROKARYOTA; GRACILICUTES; OXYPHOTOBACTERIA; 

OC CYANOBACTERIA (BLUE-GREEN ALGAE); NOSTOCALES. 

RN [1] 

RP SEQUENCE. 

RH 82265568 

RA BEECHER J., MARGOLIASH E.; 

RL UNPUBLISHED RESULTS r CITED BY? 

RL ULRICH E.L., KROGHANN D.W.. HARKLEY J.L.; 

RL J. BIOL. CHEM. 257:9356-9364(1982). 

RN 12] 

RP SEQUENCE OF 1-22; 30-39 AND 56-86. 

RM 77056395 

RA AITKEN A.; 

RL NATURE 263:793-796(1976). 

CC -!- FUNCTION: CYTOCHROME C6 IS A MONOHEME MONOMER. IT FUNCTIONS AS AN 
CC ELECTRON CARRIER BETWEEN MEMBRANE-BOUND CYTOCHROME F AND P700 IN 

CC THE PHOTOPHOSPHORYLATION CHAIN IN CHLOROPLASTS AND ALGAE. IT 



it. aucsii luits run rLH3iut!«NJW in lurrtn-utrintNi BLUt-unetN nLunc 

CC AND IN THE CHLOROPLASTS OF SOME EUKARYOTE ALGAE. 

DR PIR; A00105; CCAI6. 

DR PROSITE; PSQ0190; cytdchrohe c. 

KH ELECTRON TRANSPORT} PHOTOSYNTHESIS? HEME. 



FT 


BINDING 


14 


14 


HEME (COVALENT). 


FT 


BINDING 


17 


17 


HEME (COVALENT). 


FT 


HETAL 


IS 


18 


IRON (HEME AXIAL LIGAND). 


FT 


NETAL 


58 


58 


IRON (HEME AXIAL LIGAND). 


FT 


CONFLICT 


82 


82 


E -> D (IN REF. 2). 


FT 


CONFLICT 


84 


84 


E -> D (IN REF. 2). 


SQ 


SEQUENCE 


86 aa; 


8973 HUf 


36422 CN; 



Initial Score = 4 Optimized Score = 4 Significance = 3.39 
Residue Identity = 807. Hatches = 4 Misnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
till 

KIFSANCASCHAGGKNLGVAQKTLKKADLEKYGAYSAMAIGA8VTNGKNANPAFKGRLKPEEIZBVAAYVLG 
10 20 30 40 50 60 X 70 

KAEAEUK 
80 



10. US-08-249-182-3 (1-5) 

DBHJACST DNA-BINDING PROTEIN II (HB) <HU). 

ID DBHJACST STANDARD; PRT; 90 AA. 

AC P02346; P08822; 

DT 21-JUL-1986 (REL. 01. CREATED) 

DT 21-JUL-1986 (REL. 01. LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25, LAST ANNOTATION UPDATE) 

DE DNA-BINDING PROTEIN II (HB) (HU). 

GN HBSU. 

OS BACILLUS STEAROTHERMOPHILUS, BACILLUS CALDOLYTICUS. AND BACILLUS 

OS CALDOTENAX. 

OC PROKARYOTA! FIRMICUTES; ENDOSPORE-FORNING RODS AND COCCI; BACILLACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC SPEC I ES=B. STEAROTHERMOPHILUS. B. CALDOLYTICUS, AND B. CALDOTENAX; 

RM 92354934 

RA PADAS P.M. . WILSON K.S., VORGIAS C.E.; 

RL GENE 117:39-44(1992). 

RN [2] 

RP SEQUENCE. 

RC SPECIES=B.STEAROTHERMOPHILUS; 

RM 83160950 

RA KIMURA M.» WILSON K.S.; 

RL J. BIOL. CHEH. 258:4007-4011(1983). 

RN [3] 

RP SEQUENCE OF 1-39. 

RC SPECIES=B.CALDOLYTICUSf 

RM 87184910 

RA BECK A., DIJK J., REINHARDT R. ; 

RL BIOL. CHEM. HOPPE-SEYLER 368:121-130(1987). 

RN 143 

RP X-RAY CRYSTALLOGRAPHY (3 ANGSTROMS) . 

RC SPECIES=B. STEAROTHERMOPHILUS; 

RM 84270702 

RA TANAKA I.. APPELT K. . DIJK J.. WHITE S.U., WILSON K.S.; 

RL NATURE 310:376-381(1984). 

CC -!- FUNCTION: THIS PROTEIN BELONGS OT THE HISTONE LIKE FAMILY OF 
CC PROKARYOTIC DNA-BINDING PROTEINS WHICH ARE CAPABLE OF WRAPPING 



\.i unn nnu iu sihdiluc unn rnun ucnniunniiun unvcn CAincnt 

CC ENVIRONMENTAL CONDITIONS. 

CC -!- SUBUNIT: HOMODIMER. 

DR EMBL; M 73500! BSHUB1 . 

DR EMBL ; H73501; BCHUB2. 

DR EMBL; H73502; BCHUB3. 

DR PIR; A02690? DNBS2F. 

DR PIR; A26040; A26040. 

DR PIR? JC1207; JC1207. 

DR PIR; JC1206; JC1206. 

DR PIR; JC1205; JC1205. 

DR PROSITE? PS00045! HISTONE.LIKE. 

KU DNA-BINDING; DNA CONDENSATION. 

FT CONFLICT 9 9 A -> T (IN REF. 3). 

FT CONFLICT 13 13 T -> I (IN REF. 3). 

FT CONFLICT 30 30 D -> E (IN REF. 3). 

SS SEQUENCE 90 AA; 9716 NW; 38390 CN; 

Initial Score = 4 Optimized Score = 4 Significance = 3.39 
Residue Identity = 80X Hatches = 4 Hi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 

YPAFK 
llll 

DAVFDSITEALRKGDKVQLIGFGNFEVRERAARKGRNP8TGEEHEIPASKVPAFKPGKALKDAVK 
30 40 50 60 70 X 80 90 



11. US-08-249-182-3 (1-5) 

DBH.BACSU DNA-BINDING PROTEIN II (KB) (HU) . 

ID DBH.BACSU STANDARD? PRTf 92 AA. 

AC P0B821; 

DT 01-N0V-1988 (REL. 09, CREATED) 

DT 01-AUG-1992 (REL. 23. LAST SEQUENCE UPDATE) 

DT 01 -APR- 1993 (REL. 25, LAST ANNOTATION UPDATE) 

DE DNA-BINDING PROTEIN II (HB) (HU). 

GN HBSU. 

OS BACILLUS SUBTILIS, AND BACILLUS GLOBIGII. 

OC PROKARYOTA; FIRMICUTES; ENDOSPORE-FORHING RODS AND COCCI; BACILLACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC SPECIES=BACILLUS SUBTILIS; 

RN 92339457 

RA GROCH N.i SCHINDELIN H., SCHOLLZ A.S., HAHN U., HEINEMANN U.; 

RL EUR. J. BIOCHEM. 2075677-685(1992). 

RN 12] 

RP SEQUENCE FROM N.A. 

RC SPECIES-B. SUBTILIS; STRAIN=JH642; 

RM 91216992 

RA MICK A B. , GROCH N., HEINEHANN U., NARAHIEL H.A.; 

RL J. BACTERIOL. 173:3191-3198(1991). 

RN [33 

RP SEQUENCE FROM N.A. 

RC SPECIES=B. SUBTILIS, AND B. GLOBIGII! 

RM 92354934 

RA PADAS P.M. t WILSON K.S., VORGIAS C.E.; 

RL GENE 117:39-44(1992). 

RN [4] 

RP SEQUENCE. 

RC SPECIES=B. SUBTILIS; 

RN 87246638 

RA IMBER R., KIMURA M. , GROCH N. , HEINEMANN U.i 

RL EUR. J. BIOCHEN. 165:547-552(1987). 

CC -!- FUNCTION: THIS PROTEIN BELONGS OT THE HI STONE LIKE FAMILY OF 



o conservative substitutions 



0 



X X 
YPAFK 
Elll 

MTDLPVVCRNGAGUUVCQAANGALDSKSRSHLEAETPAFKQ5TQHYFFKK9PLPVVESVEEEDDPGVAVENA 
10 20 30 X 40 50 60 70 



PSSSEDEENTVEESEEKA 
80 90 



13. US-08-249-182-3 (1-5) 

GLN1.HETTL NITROGEN FIXATION NIFHD REGION GLNB-LIKE PROTEIN 1 



ID GLN1.HETTL STANDARD; PRTi 105 AA. 

AC P25771; 

DT 01-HAY-1992 (REL. 22, CREATED) 

DT 01-NAY-1992 (REL. 22. LAST SEQUENCE UPDATE) 

DT 01-MAY-1992 (REL. 22. LAST ANNOTATION UPDATE) 

DE NITROGEN FIXATION NIFHD REGION GLNB-LIKE PROTEIN 1 (ORF-105). 

OS METHANOCOCCUS THERHOLITHOTROPHICUS. 

OC PROKARYOTA; HEND0S1CUTES; ARCHAEBACTERIA; HETHANOCOCCALES; 

OC NETHANOCOCCACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RN 89343640 

RA SOUILLARD N., SIBOLD L.i 

RL MOL. MICROBIOL. 3:541-551(1989). 

CC -!- FUNCTION: COULD BE INVOLVED IN THE REGULATION OF NITROGEN 
CC FIXATION. 

CC -!- SIMILARITY: STRONG. TO EUBACTERIAL P(II) PROTEINS. 

DR EtIBL; X13830; MTNIFHDK. 

DR PIR; S06985.' S06985. 

DR PROSITE; PS00638; PI I GLNB CTER. 

KU TRANSCRIPTION REGULATION. NITROGEN FIXATION. 

SO SEQUENCE 105 AA; 11318 MW; 58675 CN; 



Initial Score = 4 Optinized Score = 4 Significance - 3.39 
Residue Identity = 80% Hatches = 4 Mi snatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
llll 

MKM I KA I VRPDK VDD I VDSLEN AGYPAFTKI NSVGRGKQGGLKVGE I FYDELPKTI LLI AVNDDEVDEVVGL 
10 20 X 30 40 50 60 70 



IKSSAST 



14. US-08-249-182-3 (1-5) 

YP12_RTBVP HYPOTHETICAL PI 2 PROTEIN (ORF 2) . 

ID YP12.RTBVP STANDARD; PRT; 110 AA. 

AC P27499; 

DT Oi-AUG-1992 (REL. 23. CREATED) 

DT OI-AUG-1992 (REL. 23, LAST SEQUENCE UPDATE) 

DT OI-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL P12 PROTEIN (ORF 2). 

OS RICE TUNGRO BACILLIFORM VIRUS (ISOLATE PHILIPPINES) (RTBV) . 

OC VIRIDAE; NOT YET CLASSIFIED. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 91252204 



vt< rnunnniui iu unn-oinuiNb rnuicins wruun nnc inrnoLC ur wnnrriNb 

CC DNA AND TO STABILIZE DNA FROM DENATURATION UNDER EXTREME 

CC ENVIRONMENTAL CONDITIONS. 

CC -!- SUBUNITJ HOMODIMER. 

DR EHBL; X66448; BSHBSUHOM. 

DR EHBL; X52418; BSHBSU. 

DR EHBL; M73504; BSHUB5. 

DR EHBL; H73503; BGHUB4. 

DR PIR! JC1209; JC1209. 

DR PIR; sooois; S0001S. 

DR PIR; S24373; S24373. 

DR PIR; JC1208; JC1208. 

DR PROSITE; PS00045; HISTONE_LIKE. 

m dna-binding; dna condensation. 

FT CONFLICT 40 40 D -> K (IN REF. 4). 

SO SEQUENCE 92 AA; 9884 HW; 39386 CN; 

Initial Score = 4 Optinized Score = 4 Significance = 3.39 
Residue Identity = 80X Hatches = 4 Hi snatches = 1 

Caps = 0 Conservative Substitutions = 0 

X X 
YPAFK 
llll 

DSVFDT ILD ALKNGDK I QL I GFGNFEVRERSARKGRNP8TGEE I EI PASKVPAFKPGKALKDAVAGK 
30 40 50 60 70 X 80 90 



12. US-08-249-182-3 (1-5) 

PINO.ECOLI PINO PROTEIN. 

ID PINO.ECOLI STANDARD; PRT; 105 AA. 

AC P03825; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-1986 (REL. 01. LAST SEQUENCE UPDATE) 

DT 01-HAV-1992 (REL. 22, LAST ANNOTATION UPDATE) 

DE PINO PROTEIN. 

CN PINO. 

OS ESCHERICHIA COLI. 

OC PROKARYOTA; GRACILICUTES; SCOTQBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTERIACEAE. 

RN [1] 

RP SEQUENCE FROH N.A. 

RH 82137054 

RA OLINS P.O., NOMURA H.; 

RL CELL 26:205-211(1981). 

RN C2] 

RP CHARACTERIZATION. 

RM 92021770 

RA GUZMAN E.C., JIMENEZ-SANCHEZ A.; 

RL RES. HICROBIOL. 142:137-140(1991). 

RN m 

RP IDENTIFICATION OF PROTEIN. 

RM 92041646 

RA GUZHAN E.C., JIMENEZ-SANCHEZ A.! 

RL J. BACTERIOL. 173:7409-7409(1991). 

CC -!- FUNCTION: CALCIUM-BINDING PROTEIN THAT HAY BE REQUIRED FOR THE 

CC INITIATION OF CHROMOSOME REPLICATION. 

DR EMBL; V00344! ECRPSL. 

DR PIR! A04450f QQECRP. 

DR ECOGENE; EG11263; PINO. 

m CALCIUM-BINDING; DNA REPLICATION. 

SQ SEQUENCE 105 AA! 11692 MU; 52846 CN; 

Initial Score = 4 Optinized Score = 4 Significance = 3.39 
Residue Identity = 80X Hatches = 4 Misnatches = 1 



Rtl 89034156 

RA CHANG Y.-D., DICKSON R.C.; 

RL J. BIOL. CHEM. 263:16696-16703(1988). 

CC -!- FUNCTION: LAC 12 MEDIATES THE TRANSPORT OF LACTOSE AND IT MOULD 
CC APPEAR THAT THE PERMEASE WORKS IN PART BY A PROTON SYHPORT 

CC MECHANISM. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. 

CC -!- SIMILARITY: BELONGS TO THE SUGAR TRANSPORTER FAMILY. 

DR EMBLI X06997; KLLAC12. 

DR PIR; A31776; A31776. 

DR PROSITE; PS00216? SUGAR_TRANSP0RT_1. 

DR PROSITEi PS00217; SUGAR_TRANSP0RT~2. 

KM DUPLICATION; TRANSMEMBRANE? TRANSPORT; SUGAR TRANSPORT; SYHPORT. 

SO SEQUENCE 587 AAr 65383 MM! 1854896 CN; 

Initial Score - 6 Optimized Score - 6 Significance = 4.65 
Residue Identity - 60X Matches = 6 Mi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
lit I II 

SSSSSLGKKP I NT I EHKDTLGNDRDHKEALNSDNDNTSGLK I NGVP IEDAREE VLLPGYLSK8 YYKLYGLCF 
10 20 30 40 50 X 60 X 70 

ITYLCATMQGYDGALMGSIYTEDAYLKYYHLDINSSSG 
80 90 100 110 



9. US-08-249-182-5 (1-10) 

RRP1_DR0ME RECOMBINATION REPAIR PROTEIN 1 (DNA- (APURINIC OR A 

ID RRP1_DR0ME STANDARD; PRT; 679 AA. 

AC P27864; 

DT 01-AUG-1992 (REL. 23, CREATED) 

DT 01-AUG-1992 (REL. 23, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE RECOMBINATION REPAIR PROTEIN 1 (DNA-( APURINIC OR APYRIMIDINIC SITE) 

DE LYASE) (EC 4.2.99.18). 

GN RRP1 . 

OS DROSOPHILA MELANOGASTER (FRUIT FLY). 

oc eukaryota; metazoa; arthropoda; insecta; diptera. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC strain=oregon-r; 

RM 91319767 

RA SANDER PI., LOMENHAUPT K., RICH A.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 88:6780-6784(1991). 

RN [21 

RP SEQUENCE FROM N.A. 

RC STRAIN=OREG0N-R; 

RM 91360356 

RA SANDER M., LOMENHAUPT K. , LANE U.S., RICH A.! 

RL NUCLEIC ACIDS RES. 19:4523-4529(1991). 

CC -!- FUNCTION: COULD PROMOTE HOMOLOGOUS RECOMBINATION AT SITES OF DNA 

CC DAMAGE. RRP1 HAS APURINIC ENDONUCLEASE AND DOUBLE-STRANDED DNA 3' 

CC EXONUCLEASE, ACTIVITIES AND CARRIES OUT SINGLE-STRANDED DNA 

CC RENATURATION IN A MG (2+) -DEPENDENT MANNER. 

CC -!- CATALYTIC ACTIVITY: ENDONUCLEOLYTIC CLEAVAGE NEAR APURINIC OR 

CC APYRIMIDINIC SITES TO PRODUCTS UITH 5'-PH0SPHATE. 

CC -!- SUBCELLULAR LOCATION: NUCLEAR. 

CC -!- SIMILARITY: BELONGS TO THE AP/EXOA FAMILY OF DNA REPAIR ENZYMES. 

DR EMBL; M62472; DMRRP1. 

DR FLYBASE; 04584; RRP1. 

DR PIR; S28366; S28366. 



SQ SEQUENCE 323 AAi 34973 NW; 594373 CN? 



Initial Score = 6 Opiinized Score = 6 Significance - 4.65 
Residue Identity = 602 Hatches = 6 fli snatches = 4 

Gaps = 0 Conservative Substitutions - 0 

X 10 
PEEVTRPNYL 

huh 

LDPVFTPATGTPVVGGLSYREGLYITEEIYKTGLLSGLDIMEVNPTLGKTPEEVTRTVNTAVALTLSCFGTK 
240 250 260 270 280 X 290 X 300 



REGNHKPETDYLKPPK 
310 320 



7. US-08-249- 182-5 (1-10) 
VC13.SFVKA PROTEIN C13. 



ID VC13_SFVKA STANDARD PRTi 500 AA. 

AC P32206; 

DT 01-0CT-1993 (REL. 27, CREATED) 

DT 01-0CT-1993 (REL. 27, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE PROTEIN C13. 

GN C13L. 

OS SHOPE Fl BRORA VIRUS (STRAIN KASZA) (SFV) . 

DC VIRIDAE; DS-DNA ENVELOPED VIRUSES; PDXVIRIDAE; CHORDOPOXVIRINAE; 

OC LEPORIPOVIRUSES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RA NASSUNG R.F., JAYARAHA V., MOYER R.U.; 

RL SUBMITTED (XXX-1993) TO EMBL/GENBANK/DDBJ DATA BANKS. 

CC -!- FUNCTION! TO VACCINIA VIRUS ORF F3. 

DR EMBL; L22013; PXSUPHLSB. 

SQ SEQUENCE 500 AA; 57475 MH; 1384311 CN; 



Initial Score = 6 Optinized Score = 6 Significance = 4.65 
Residue Identity = 60% Matches = 6 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
II I III 

TD1LLSLSLFDLRIILKSGELDVSSEDDVLLFIIKHSRHKKSNRRKSFTLVTEVLRYNYLSIYGKYKLTKNL 
160 170 180 190 200 X 210 X 220 

ARFGKNNN VELNENELPR I SYQHRFTNRRYTMVTPSSF 
230 240 250 260 



8. US-08-249-182-5 (1-10) 

LACPJLULA LACTOSE PERMEASE. 

ID LACPJLULA STANDARD; PRT; 587 AA. 

AC P07921; 

DT 01-AUG-1988 (REL. 08, CREATED) 

DT 01-AUG-1988 (REL. 08, LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25. LAST ANNOTATION UPDATE) 

DE LACTOSE PERMEASE. 

GN LAC12. 

OS KLUYVEROMYCES LACT1S (YEAST) . 

OC eukaryota; fungi; ASCOMYCOTINA; HEMIASCOMYCETES. 

RN [1] 

RP SEQUENCE FROM N.A. 



DR PIR; A26370; A26370. 
DR Plir,; 207800; TENTH EDITION. 
DR PROSITE; PS00147; ARGINASE.l. 
DR PROSITE; PS00148! ARGINASE.2. 

KU UREA CYCLE; ARGININE MET ABOLISH; HYDROLASE; MAGNESIUM. 
SQ SEQUENCE 322 AA; 34734 MW; 574214 CN; 

Initial Score = 6 Optimized Score = 6 Significance = 4.65 
Residue Identity = 60% Hatches = 6 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
llllll 

LDPSFTPATGTPVVGGLT YREGLY I TEE I YKTGLLSGLD I NEVNPSLGKTPEEVTRTVNTAVA I TLACFGLA 
240 250 260 270 280 X 270 X 300 

REGNHKPIDYLNPPK 
310 320 



b. US-08-249-182-5 (1-10) 

ARGI.RAT ARGINASE (EC 3.5.3.1). 

ID ARGI.RAT STANDARD; PRT; 323 AA. 

AC P07824; 

DT 01-AUG-1988 (REL. 08, CREATED) 

DT 01-APR-1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE ARGINASE (EC 3.5.3.1). 

OS RATTUS NORVEGICUS (RAT). 

oc eukaryota; hetazoa; chord at a; vertebrata; tetrapoda; mammalia; 

oc eutheria; rodentia. 

RN III 

rp sequence from n.a. 

rc tissue=liver; 

RH 8B 115364 

RA OHTAKE A., TAKIGUCHI H. , SHIGETO Y., AHAYA Y., KAWAMOTO S., MORI H.S 

RL J. BIOL. CHEtl. 263:2245-2249(1988). 

RN [2] 

RP SEQUENCE FROH N.A. 

RC tissue=liver; 

RH 87194847 

RA KAWAMOTO S., AHAYA Y.. MURAKAMI K., TOKUNAGA F., I U AN AG A S., 

RA KOBAYASHI K., SAHEKI T., KIHURA S., MORI H.; 

RL J. BIOL. CHEH. 262:6280-6283(1987). 

CC -!- CATALYTIC ACTIVITY: L-ARGININE + H(2)0 = L-ORNITHINE + UREA. 

CC -!- PATHWAY: FIRST STEP IN ARGININE DEGRADATION IN THE UREA CYCLE. 

CC -!- SUBUNIT: HOHOTRIHER. 

CC -!- COFACTOR: HN(2+). 

CC -!- INDUCTION: BY ARGININE OR HOHOARGININE. 

DR EHBL; N17924; RNARG1. 

DR EHBL; H17925; RNARG2. 

DR EHBL; Ml 7926; RNARG3. 

DR EHBL; M17927; RNARG4. 

DR EHBL; H17928; RNARG5. 

DR EHBL; H17929; RNARG6. 

DR EHBL; HI 7930; RNARG7. 

DR EHBL; H17931! RNARG8. 

DR EHBL; J02720; RNARGL. 

DR PIR? A26702; A26702. 

DR PIR; A28358; A28358. 

DR PROSITE; PS00147! ARGINASE.l. 

DR PROSITE; PS00148; ARGINASE.2. 

KU UREA CYCLE; ARGININE HETABOLISHf HYDROLASE; HAGNESIUH. 



OC ENTEROBACTERI ACEAE. 

RN [11 

RP SEQUENCE FROM N.A. 

RM 93268094 

RA COLLINS CM., GUTMAN D.fl. , LAMAN H.; 

RL MOL. MICROBIOL. 8:187-198(1993) . 

CC -!- FUNCTION: PROBABLY FACILITATING NICKEL INCORPORATION. 

DR EMBL? L 07 03V; KPURE. 

DR PIRf S32937; S32937. 

KM NICKEL. 

SO SEQUENCE 270 AA; 29953 »U; 374889 CN; 

Initial Score = 6 Optimized Score - 6 Significance = 4.65 
Residue Identity = 60X Matches = 6 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
III I II 

HLPPLKKGH9RTLDLRFQQAGGKTVLASA9HVGPLTv8RPFYPEEETCHLYLLHPPGGIVGGDELTISAHLA 
10 20 30 40 X SO X 60 70 

PGCHTLITHPGASKFYRSSGAQALVR88LT 
80 90 100 



5. US-08-249-182-5 (1-10) 

ARGI.HUMAN ARGINASE (EC 3.5.3.1) . 

ID ARGI.HUMAN STANDARD; PRT; 322 AA. 

AC P05089; 

DT 13-AUG-1987 (REL. 05. CREATED) 

DT 13-AUG-1987 (REL. 05, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE ARGINASE (EC 3.5.3.1). 

GN ARG1. 

OS HOMO SAPIENS (HUMAN) . 

OC eukaryota; hetazoa; chord at a; vertebrata; tetrapoda; mammalia; 

OC eutheria; primates, 

rn in 

RP SEQUENCE FROM N.A. 

RC TISSUE=LIVER; 

RM 87092419 

RA HARAGUCHI Y., TAKIGUCHI M., AMAYA Y., KAWAMOTO S.r MATSUDA I., 

RA MORI M. ; 

RL PROC. NATL. ACAD. SCI. U.S.A. 84:412-415(1987). 

RN 12] 

RP SEQUENCE FROM N.A. 

RC TISSUE=8L00D! 

RM 89016562 

RA TAKIGUCHI H., HARAGUCHI Y., MORI M.« 

RL NUCLEIC ACIDS RES. 16:8789-8802(1988). 

CC -!- CATALYTIC ACTIVITY: L-ARGININE ♦ H(2)0 = L-ORNITHINE + UREA. 

CC -!- PATHWAY: FIRST STEP IN ARGININE DEGRADATION IN THE UREA CYCLE. 

CC -!- SUBUNIT: HOMOTRIMER. 

CC -!- COF ACTOR: MN(2+). 

CC -!- INDUCTION: BY ARGININE OR HONOARGININE. 

CC -!- DISEASE: DEFICIENCY IN ARG1 IS THE CAUSE OF ARGININEMIA. 

DR EMBL; Ml 4502; HSARGL. 

DR EMBL; X 12662; HSARGl. 

DR EMBL; X12663; HSARG2. 

DR EMBL; X 12664! HSARG3. 

DR EMBL; X 12665; HSARG4. 

DR EMBL; XI 2667; HSARG6. 

DR EMBL; XI 2668! HSARG7. ' 



Initial Score = 6 Optimized Score = 6 Significance = 4.65 
Residue Identity = 752 Matches = 6 Mi snatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
lilt II 

AVTVDRLGNMASASTPVTLAHFWPDIQPGQRVLVLTYGSGATWGAALYRKPEEVNRPC 
180 190 200 210 220 X X 



3. US-08-249-182-5 (1-10) 

UNG_HCNVA URACIL-DNA GLYCOSYLASE (EC 3.2.2.-). 



ID UNG_HCMVA STANDARD; PRT; 250 AA. 

AC P16769; 

DT 01-AUG-1990 (REL. 15, CREATED) 

DT 01-AUG-1990 (REL. 15, LAST SEQUENCE UPDATE) 

DT 01-FEB-1991 (REL. 17, LAST ANNOTATION UPDATE) 

DE URACIL-DNA GLYCOSYLASE (EC 3.2.2.-) . 

GN UL114. 

OS HUMAN CYTOMEGALOVIRUS (STRAIN AD169). 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; HERPESVIRIDAE, BETAHERPESVIRINAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 90269039 

RA CHEE M.S., BANKIER A.T., BECK S., BOHNI R., BROUN C.N., CERNY R., 

RA HORSNELL T., HUTCHISON C.A. Ill, KOUZARIDES T., NARTIGNETTI J. A., 

RA PREDDIE E. , SATCHWELL S.C., TOMLINSON P., WESTON K.M., BARRELL B.G.; 

RL CURR. TOP. MICROBIOL. IMMUNOL. 1545125-169(1990). 

CC -!- FUNCTION: EXCISES URACIL RESIDUES FROM THE DNA WHICH CAN ARISE 

CC AS A RESULT OF HI S INCORPQRT AT I ON OF DUMP RESIDUES BY DNA 

CC POLYMERASE OR DUE TO DEAMINATION OF CYTOSINE. 

CC -!- SIMILARITY: STRONGLY CONSERVED IN ALL SPECIES. 

DR EtIBL; XI 7403; HEHCMVCG. 

DR PIR; S09881; DGBEL5. 

DR PROSITE; PS00130; U_DNA_GLYCOSYLASE . 

KU DNA REPAIR; HYDROLASE; GLYCOSIDASE. 

SQ SEQUENCE 250 AA; 28353 MM; 307699 CN; 



Initial Score = 6 Optimized Score = 6 Significance = 4.65 
Residue Identity = 60% Matches = 6 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
II I I II 

LLTPDEQARVFCLS ADW I RFLSLPDHDTVLLRDTVA A VEGARQLEMVYP APEHVHRHSYLCPPEQVRVV I VG 
20 30 40 50 60 70 X 80 



GDPYCDGSASGLAFGTLAGRPPPPSLNNVFRELARTVD 
90 100 110 120 



4. US-08-249-182-5 (1-10) 

URED.KLEPN UREASE OPERON URED PROTEIN. 

ID URED_KLEPN STANDARD; PRT; 270 AA. 

AC 002944! 

DT 01-JUL-1993 (REL. 26, CREATED) 

DT 01-JUL-1993 (REL. 26, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE UREASE OPERON URED PROTEIN. 

GN URED. 

OS KLEBSIELLA PNEUMONIAE. 



39. AROL~ECOLI SHIKIHATE KINASE II (EC 2.7.1 174 

40. GLUC_0CTDE GLUCAGON PRECURSOR. 180 



5 5 3.49 0 
5 6 3.49 0 



1. US-08-249-182-5 (1-10) 

EXON_HSVSA ALKALINE EXONUCLEASE (3.1.11.-). 

ID EXON HSVSA STANDARD; PRT; 483 AA. 

AC Q01013; 

DT 01-APR-1993 (REL. 25, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25. LAST ANNOTATION UPDATE) 

DE ALKALINE EXONUCLEASE (3.1.11.-). 

GN 37. 

OS HERPESVIRUS SAIMIR1 (STRAIN 11). 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; HERPESVIRIDAE; GAMHAHERPES VIR I NAE . 

RN (13 

RP SEQUENCE FROM N.A. 

RM 92333688 

RA ALBRECHT J.-C, NICHOLAS J., BILLER D.> CAMERON K.R.. BIESINGER B.> 

RA NEWMAN C, WITTMANN S., CRAXTON (I.A., COLEMAN H., FLECKENSTEIN B.> 

RA HONESS R.W.; 

RL J. VIROL. 66! 5047-5058 (1992). 

DR EMBL; X 64 3 46; HSGEND. 

DR PIR; H36809; QQBEN4. 

KU HYDROLASE; NUCLEASE; EXONUCLEASE. 

Sfi SEQUENCE 483 AA; 55554 NH; 1295833 CN; 

Initial Score = 7 Optimized Score = 7 Significance = 5.82 
Residue Identity = 70X Hatches = 7 Hi snatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 

inn ii 

GGF I GNEK I GT YFSKNVCNNI AAKGVPKLADV YKACEKMNLRQQSE I CLL I EE VTRGQYLNSLUDALRDGT I 
70 80 90 100 110 120 130 

SSSKFYMATKK9NSTKKIFEPWPIKNDYYVAGPLAFGL 
140 150 160 170 



2. US-08-249-182-5 (t-10) 

YPH1_PSEAE HYPOTHETICAL PROTEIN IN PHNA 5'REGION (0RF1) (FRAG 

ID YPH1.PSEAE STANDARD; PRT; 229 AA. 

AC P20582; 

DT 01-FEB-1991 (REL. 17, CREATED) 

DT 01-FEB-1991 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-FEB-1991 (REL. 17, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL PROTEIN IN PHNA 5'REGION (0RF1) (FRAGMENT) . 

OS PSEUDONONAS AERUGINOSA. 

oc prokaryota; gracilicutes; scotobacteria; AEROBIC RODS AND cocci; 

OC PSEUDOMONADACEAE . 

RN 111 

RP SEQUENCE FROH N.A. 

RC STRAIN=PA01; 

RM 90130326 

RA ESSAR D.W., EBERLY L., HADERO A., CRAWFORD I. P.; 

RL J. BACTERIOL. 172:884-900(1990). 

DR EMBL; 1133810; PAPHNABA. 

DR PIR; D35116; D35U6. 

KU HYPOTHETICAL PROTEIN. 

FT NONJER 1 1 

SQ SEQUENCE 229 AA; 24636 MU; 257164 CN; 



stores* 



nean neoian atanoaro uevianon 

2 3 0.86 



Tines; CPU Total Elapsed 

00:00:50.89 00:00:55.00 

Nimber of residues: 12496420 
Nunber of sequences searched: 36000 
Nunber of scores above cutoff: 3826 > 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence uas not found. 



The list of best scores is: 



Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 







**«* 5 standard deviations 


above Re an 


**** 








1. 


EX0N.HSVSA 


ALKALINE EX0NUCLEASE (3.1.11. 


483 


7 


7 


5.82 


0 






**»* 4 standard deviations 


above Re an 


»*»# 








2. 


YPHl.PSEAE 


HYPOTHETICAL PROTEIN IN PHNA 


229 


6 


6 


4.65 


0 


3. 


UNG.HCMVA 


URACIL-DNA GLYC0SYLASE (EC 3. 


250 


6 


6 


4.65 


0 


4. 


URED_KLEPN 


UREASE 0PER0N URED PROTEIN. 


270 


6 


6 


4.65 


0 


5. 


ARG INHUMAN 


ARGINASE (EC 3.5.3.1). 


322 


6 


6 


4.65 


0 


6. 


ARGI.RAT 


ARGINASE (EC 3.5.3.1). 


323 


6 


6 


4.65 


0 


7. 


VC13 SFVKA 


PROTEIN C13. 


500 


6 


6 


4.65 


0 


8. 


LACP.KLULA 


LACTOSE PERMEASE. 


587 


6 


6 


4.65 


0 


9. 


RRP1_DR0ME 


RECOMBINATION REPAIR PROTEIN 


679 


6 


7 


4.65 


0 


10. 


G13A.DICDI 


CELL SURFACE GLYCOPROTEIN GP1 


730 


6 


6 


4.65 


0 


11. 


G13BJICDI 


CELL SURFACE GLYCOPROTEIN GP1 


734 


6 


6 


4.65 


0 


12. 


K6PF_RABIT 


6-PH0SPH0FRUCT0KINASE, HUSCLE 


779 


6 


6 


4.65 


0 


13. 


CRYXJACTK 


139 KD CRYSTAL PROTEIN (DELTA 


1228 


6 




4.65 


0 






**»* 3 standard deviations 


above nean 


««»* 








14. 


TXW2.NAJHH 


WEAK TOXIN CM-2. 


61 


5 


5 


3.49 


0 


15. 


NUCC_SECCE 


NADH-PLASTOflUINONE OXIDOREDUC 


90 


5 


5 


3.49 


0 


16. 


RS7 SULSO 


30S RIBOSOHAL PROTEIN S7 (FRA 


94 


5 


6 


3.49 


0 


17. 


VE6JPV4 


PROBABLE E6 PROTEIN. 


99 


5 


5 


3.49 


0 


18. 


RLA2JRYCR 


60S ACIDIC RIBOSOHAL PROTEIN 


107 


5 


5 


3.49 


0 


19. 


CYC.USTSP 


CYTOCHROHE C. 


107 


5 


5 


3.49 


0 


20. 


CYC.SANNI 


CYTOCHROME C. 


111 


5 


5 


3.49 


0 


21. 


CYC_PHAAU 


CYTOCHROME C. 


111 


5 


5 


3.49 


0 


22. 


CYC_0RYSA 


CYTOCHROME C. 


111 


5 


5 


3.49 


0 


23. 


CYC_NIGDA 


CYTOCHROME C. 


111 


5 


5 


3.49 


0 


24. 


CYC.HAIZE 


CYTOCHROME C. 


111 


5 


5 


3.49 


0 


25. 


CYC.HELAN 


CYTOCHROME C. 


111 


5 


5 


3.49 


0 


26. 


CYC.CUCHA 


CYTOCHROME C. 


111 


5 


5 


3.49 


0 


27. 


CYC.BRA0L 


CYTOCHROHE C. 


111 


5 


5 


3.49 


0 


28. 


CYC.ARUNA 


CYTOCHROME C. 


111 


5 


5 


3.49 


0 


29. 


CYC.HHEAT 


CYTOCHROHE C. 


112 


5 


5 


3.49 


0 


30. 


ACHi.CHICK 


ACETYLCHOLINE RECEPTOR PROTEI 


118 


5 


5 


3.49 


0 


31. 


RL24 HALNA 


50S RIBOSOHAL PROTEIN L24 (HM 


119 


5 


5 


3.49 


0 


32. 


WNT8.THUTH 


WNT-8 PROTEIN (FRAGMENT). 


128 


5 


5 


3.49 


0 


33. 


CCDP_HAIZE 


CORTICAL CELL DELINEATING PRO 


129 


5 


5 


3.49 


0 


34. 


SG50.M0USE 


SULFATED 50 KD GLYCOPROTEIN P 


151 


5 


5 


3.49 


0 


35. 


GLUC_P1G 


GLUCAGON PRECURSOR (FRAGMENT) 


158 


5 


6 


3.49 


0 


36. 


YBL3_F0AMV 


BEL-3 PROTEIN. 


167 


5 


5 


3.49 


0 


37. 


VS11.R0TGA 


NONSTRUCTURAL PROTEIN. 


170 


5 


5 


3.49 


0 



* 



« 



100- 
50- 



t 

i 

10- 
5- 



0 

II I I I I I 



SCORE 0| 1 | 2 


1 2 


3 4 | 5 | 5 


b 


STDEV -1 0 


1 


2 3 5 






PARAMETERS 




Similarity matrix 


Unitary 


K-tuple 


2 


Misnatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Uindou size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 


40 


Alignnents to save 


15 


Optimized scores to save 0 


Display context 


50 



SEARCH STATISTICS 



n 
B 
E 
R 

0 

F 10000- 
S 

E 5000- 

Q 

U 

E * 
N 

C 
E 

S 1000- 



500- 



raDDiL 

DATE 28~Hay-19a6 #sequence_revision 28-May-1986 #text_ehange 

03-Feb-1994 
ACCESSIONS A00604 
REFERENCE A00604 

iauthors Poormani R.A.; Randolph* A.; Kemp* R.G.J Heinrikson» R.L. 

•journal Nature (1984) 309:467-469 

ttitle Evolution of phosphofructokinase-gene duplication and 

creation of neu effector sites, 
tcross-references MUID:84219739 
^accession A00604 

t#nolecule_type protein 
##residues" 1-749 ##label POO 
COMMENT The active enzyme» a tetramer of identical chains i catalyzes the 
key control step of glycolysis > the phosphorylation of 
fructose-6-phosphate by ATP to form fructose-l *6-bisphosphate in 
the presence of magnesium ion. It is an allosteric enzyme 
activated by ADP» AMPt or fructose bisphosphate and inhibited by 
ATP or citrate. 
CLASSIFICATION isuperf amily human 6-phosphofructokinase; 

6-phosphofructokinase 1 homology 
KEYWORDS acetylation; allosteric regulation; duplication; glycolysis; 

phosphoprotein; phosphotransferase 



FEATURE 
17-326 
402-658 
1 

744 

SUMMARY 
SEQUENCE 



tdomain 6-phosphofructokinase 1 homology llabel 6PF1\ 
Sdomain 6-phosphofructokinase 1 homology Slabel 6PF2\ 
#modif ied_site acetylated amino end (Thr) istatus 

experimental \ 
#binding_site phosphate (Ser) (covalent) Istatus 

experimental 

ilength 749 #molecular-ueight 81839 tchecksum 3199 



Initial Score 



6 Optimized Score = 



6 Significance = 4.11 
Residue Identity = 60X Matches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
II II II 

GRLRAAHNLVKRGITNLCVIGGDGSLTGADTFRSEHSDLLSDL9KAGKITAEEATRSSYLNIVGLVGSIDND 
100 110 120 130 140 X 150 X 160 

FCGTDMTIGTDSALHRITEIVDAITTTAQSH8RTFVLE 
170 180 190 200 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249_5s.res made by on Thu 22 Sep 94 10:21 :04-PDT. 



Query sequence being compared:US-08-249-182-5 (1-10) 
Number of sequences searched: 36000 
Number of scores above cutoff: 3826 

Results of the initial comparison of US-08-249-182-5 (1-10) with: 
Data bank : Suiss-Prot 28i all entries 

100000- 

N 

U50000- 



aunnnrw 
SEQUENCE 



Initial Score = 6 Optimized Score = 7 Significance = 4.11 
Residue Identity = 72X Hatches = 8 Mismatches = 2 

Gaps = 1 Conservative Substitutions = 0 

X X 
PEEVTR-PNYL 

linn i 1 

PADKEFNLKICSWNVAGLRAHLKKDGLSLIDLEEPDIFCL9ETKCAND9LPEEVTRLPGYHPYWLCMPGGYA 
420 430 440 450 460 470 480 490 

GVAIYSKIHPIHVEYGIGNEEFDDVGRMITAEYEKFYLI 
500 510 520 530 



14. US-08-249-182-5 (1-10) 

A48832 gpl38=cell surface glycoprotein - slime mold (Diet 

ENTRY A48832 #type complete 

TITLE gpl38=cell surface glycoprotein - slime wold (Dictyostelium 

discoideun) 

ORGANISM #formal_name Dictyostelium discoideun 

DATE Ol-Dec-1993? tsequence revision Ol-Dec-1993; itext change 

Ol-Dec-1993 
ACCESSIONS A48832 
REFERENCE A48832 

#authors Fang, H.; Higai M.; Suzuki 1 K.J Aiba* K, ; Urushiharai H.> 

Yanagisaya* K. 
#journal Dev. Biol. (1993) 156?201-208 

ttitle Molecular cloning and characterization of two genes encoding 

gpl38» a cell surface glycoprotein involved in the sexual 
cell fusion of Dictyostelium discoideun. 
^cross-references MUID:93193972 
#contents AX-3 cells 
ftaccession A48832 

##status preliminary 
##nolecule_type DNA; protein 
Mresidues" 1-730 ##label FAN 
##cross-references NCBIN: 127258? NCBIPU27259 
##note sequence extracted from NCBI backbone 

SUMMARY #length 730 #molecular-ueight 80960 ichecksum 7716 

SEQUENCE 

Initial Score = 6 Optimized Score = 6 Significance = 4.11 
Residue Identity = 602 Matches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
II I III 

NTFSYECIDFSNIADFSDYYNDYEQYLPNIDNAPLLTGVDIY8SVVVGDIPESYCRINYLSLYYN8LNGTVP 
330 340 350 360 370 X 380 X 390 

SCISCLGGVKGGDIVLPNPFLNFNKTTEPYCPTFKIDE 
400 410 420 430 



15. US-08-249-182-5 (1-10) 

KIRBF 6-phosphofructokinase (EC 2.7.1.11) - rabbit 

ENTRY KIRBF #type complete 

TITLE 6-phosphofructokinase (EC 2.7.1.11) - rabbit 

ALTERNATEJIAMES phosphofructokinase 1; phosphohexokinase 

ORGANISM #formal name Oryctolagus cuniculus fcommon name domestic 



REGNHKPETDYLKPPK 
310 320 



12. US-08-249-182-5 (1-10) 

A31776 lactose permease - yeast (Kluyveronyces narxianus 

ENTRY A31776 #type conplete 

TITLE lactose permease - yeast (Kluyveronyces narxianus var. 

lactis) 

ORGANISM #fornal_nane Kluyveronyces narxianus var. lactis* Candida 

sphaerica 

DATE 2i-May-1990 #sequencej*evision 21-May-1990 #text change 

18-Jun-1993 
ACCESSIONS A31776 
REFERENCE A92683 

♦authors Chang > Y.D.; Dickson, R.C. 

#journal J. Biol. Chen. (1988) 263? 16696-16703 

fttitle Prinary structure of the lactose permease gene fron the yeast 

Kluyveronyces lactis. Presence of an unusual transcript 
structure, 
♦cross-references MUID;89034156 
#accession A31776 
##nolecule_type DNA 
ttresidues" 1-587 i#label CHA 
ticross-references GBJX06997 
SUMMARY ftlength 587 tnolecular-ueight 65383 Schecksun 2470 

SEQUENCE 



Initial Score = 6 
Residue Identity = 60X 
Gaps = 0 



Optinized Score = 6 
Matches = 6 

Conservative Substitutions 



Significance = 
Misnatches = 



4.11 
4 
0 



X 10 
PEEVTRPNYL 
III I II 

SSSSSLflKKPINTIEHKDTLGNDRDHKEALNSDNDNTSGLKINGVPIEDAREEVLLPGYLSKSYYKLYGLCF 
10 20 30 40 50 X 60 X 70 



ITYLCATM8GYDGALHGSIYTEDAYLKYYHLDINSSSG 
80 90 100 110 



13. US-0S-249-182-5 (1-10) 

S28366 reconbination repair protein 1 - fruit fly (Drosop 



fruit fly (Drosophila 



ENTRY S28366 Hype conplete 

TITLE reconbination repair protein 1 

nelanogaster) 
#fornal_nane Drosophila nelanogaster 
17-Apr-1993 Ssequence revision 17-Apr-1993 itext change 

18-Jun-1993 
S28366 
S28366 

Sander, N.; Louenhauptr K.; Lane, U.S.; Rich, A. 
Nucleic Acids Res. (1991) 19:4523-4529 
Cloning and characterization of Rrpl> the gene encoding 
Drosophila strand transferase : carboxy-terninal honology to 
DNA repair endo/exonucleases. 
S28366 
##nolecule_type nRNA 
♦♦residues" 1-679 ♦♦label SAN 
♦♦cross-references EMBL:M62472 
GENETICS 

♦gene Rrpl 



ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
#authors 
♦journal 
#title 



♦accession 



iiilc arginase tec j.j.Ji1/> rtepdLit - rai 

ORGANISM #fornal_naise Rattus norvegicus SconftOnjiane Norway rat 

DATE 28-Aug-1989 #sequence_revision 28-Aug-1989 #textj:hange 

28-Apr-1993 
ACCESSIONS A28358 
REFERENCE A28358 

#authors 0htake ? A.? Takiguchi* M. ? Shigeto> Y.; Anaya? Y.: Kauanotoi 

S.; Mori i H. 

Sjournal J. Biol. Chen. (1988) 263:2245-2249 

ttitle Structural organization of the gene for rat liver-type 

arginase. 
#cross-references NUID:88115364 
taccession A28358 
##nolecule_type DNA 
##residues~ 1-323 ##tabel OHT 
##cross-references GB:M17924 
KEYWORDS hydrolase; urea cycle 

SUMMARY Slength 323 Snolecular-ueight 34973 ichecksun 6211 

SEQUENCE 



Initial Score = 6 
Residue Identity = 60X 
Gaps = 0 



Optimized Score = 6 Significance = 4.11 
Matches = 6 Mi snatches = 4 

Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
Illlll 

LDPVFTPATGTPVVGGLSYREGLYITEEIYKTGLLSGLDIMEVNPTLGKTPEEVTRTVNTAVALTLSCFGTK 
240 250 260 270 280 X 290 X 300 



REGNHKPETDYLKPPK 
310 320 



11. US-08-249-182-5 (1-10) 

A26702 arginase (EC 3.5.3.1) * hepatic - rat 

ENTRY A26702 #type complete 

TITLE arginase (EC 3.5,3.1) i hepatic - rat 

ORGANISM #fornal_nane Rattus norvegicus #connon_nane Norway rat 

DATE 05-0ct-1988 #sequence_revision 05-0ct-1988 #text_change 

28-Apr-1993 
ACCESSIONS A26702 
REFERENCE A26702 

ftauthors Kawanoto» S.; Anayar Y.» Murakami i K.; Tokunagai F.? Iuanaga* 
S.; Kobayashi? K.; Saheki> T.; Kinura> S.I Mori* M. 

#journal J. Biol. Chen. (1987) 262:6280-6283 

#title Conplete nucleotide sequence of cDNA and deduced anino acid 

sequence of rat liver arginase. 
#cross-references MUID:S7194S47 
#accession A26702 
##nolecule_type nRNA 
tiresidues" 1-323 i#label KAW 
KEYWORDS hydrolase; urea cycle 

SUMMARY ilength 323 Snolecular-weight 34999 Schecksun 6406 

SEQUENCE 



Initial Score = 6 
Residue Identity = 60X 
Gaps = 0 



Optinized Score = 6 Significance = 4.11 
Matches = 6 Misnatches = 4 

Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
Illlll 

LDPVFTPATGTPVVGGLSYREGLYITEEIYKTGLLSGLDIMEVNPTLGKTPEEVTRTVNTAVPLTLSCFGTK 
240 250 260 270 280 X 290 X 300 



ftjournat rroc. nau. acao. aci. u.s.a. u^5/j 

#title Molecular cloning and nucleotide sequence of cDNA for hunan 

liver arginase. 
tcross-references MUI 870924 19 
taccession A26370 
##nolecule_type piRNA 
##residues" 1-322 ##label HAR 
KEYWORDS hydrolase; urea cycle 

SUMMARY Blength 322 #nolecular-«eight 34734 Schecksun 2748 

SEQUENCE 



Initial Score = 6 Optimized Score = 6 Significance = 4.11 
Residue Identity = 60X (latches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 

nun 

LDPSFTPATGTPVVGGLTYREGLYITEEIYKTGLLSGLDIMEVNPSLGKTPEEVTRTVMTAVAITLACFGLA 
240 250 260 270 280 X 290 X 300 



REGNHKPIDYLNPPK 
310 320 



9. US-08-249-182-5 (1-10) 

B26370 arginase (EC 3.5.3.1) 



hepatic - rat 



ENTRY B26370 #type complete 

TITLE arginase (EC 3.5.3.1)* hepatic - rat 

ORGANISM #fornal_nane Rattus norvegicus #connon_nane Norway rat 

DATE 25-0ct-1987 #sequence_revision 25-0ct-1987 #text change 

28-Apr-1993 
ACCESSIONS B26370 
REFERENCE A94160 

#authors Haraguchir Y.r Takiguchii M.i An ay a. Y.J Kayanotor S.I 

Matsuda* I.; Mori* M. 
journal Proc. Natl. Acad. Sci. U.S.A. (1987) 84:412-415 
ititle Molecular cloning and nucleotide sequence of cDNA for hunan 

liver arginase. 
Scross-references NUID:87092419 
taccession B26370 

##nolecule_type protein 
firesidues" 1-323 i#label HAR 
KEYWORDS hydrolase? urea cycle 

SUMMARY tlength 323 #nolecular-weight 34927 Schecksun 6462 

SEQUENCE 



Initial Score = 6 
Residue Identity = 60% 
Gaps = 0 



Optimized Score = 6 Significance = 4.11 
Matches = 6 Mismatches = 4 

Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
llllll 

LDPVFTPATGTPVVGGLSYRGGLYITEEIYKTGLLSGLDINEVNPTLGKTPEEVTRTVNTAVPLTLSCFGTK 
240 250 260 270 280 X 290 X 300 



REGNHKPETDYLKPPK 
310 320 



10. US-08-249-182-5 (1-10) 

A28358 arginase (EC 3.5.3.1) r hepatic - rat 



ENTRY 



A28358 Stype complete 



Initial Score = 6 Optimized Score = 6 Significance = 4.11 
Residue Identity = 60X Hatches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
III I II 

MLPPLKKGWBATLDLRFHQAGGKTVLASA9HVGPLTV8RPFYPEEETCHLYLLHPPGGIVGGDELTISAHLA 
10 20 30 40 X SO X 60 70 

PGCHTLITMPGASKFYRSSGA8ALVRG8LT 
80 90 100 



7. US-08-249-182-5 (1-10) 

S02132 arginase (EC 3.5.3.1), hepatic 



hunan 



ENTRY S02132 Hype complete 

TITLE arginase (EC 3.5.3.1), hepatic - hunan 

ORGANISM •fornal_nane Hono sapiens iconnon_nane nan 

DATE 01-Dec-1989 #sequence_revision 01-Dec-1989 itext_change 

2B-Apr-1993 
ACCESSIONS S02132 
REFERENCE S02132 

ftauthors Takiguchi, N.; Haraguchi, Y.> Mori, M. 
•journal Nucteic Acids Res. (1988) 16:8789-8802 
•title Hunan liver-type arginase gene: structure of the gene and 

analysis of the promoter region, 
•cross-references MUID:89016562 
•accession S02132 
••nolecule_type DNA 
••residues" 1-322 »»label TAK 
••cross-references EHBL:X12662 
GENETICS 

•introns 19/3; 44/1; 102/2; 155/3; 187/2; 222/2; 268/1 
KEYWORDS hydrolase; urea cycle 

SUMMARY # length 322 «flolecular-ueight 34735 «checksun 2400 

SEQUENCE 



Initial Score = 
Residue Identity = 
Gaps = 



6 Optimized Score = 6 Significance = 4.11 
60X Matches - 6 Nisnatches = 4 

0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
llllll 

LDPSFTPATGTP VVGGLT YREGLY I TEE I YKTGLLSGLD I ME VNPSLGKTPEEVTRT VNT AVA I TLACFGLA 
240 250 260 270 280 X 290 X 300 



REGNHKPIDYLNPPK 
310 320 



US-08-249-182-5 (1-10) 

A26370 arginase (EC 3.5.3.1). 



hepatic - hunan 



ENTRY - A26370 Stype conplete 

TITLE arginase (EC 3.5.3.1), hepatic - hunan 

ORGANISM •fornaljiane Hono sapiens •connon_nane nan 

DATE 25-0ct-1987 #sequence_revision 25-0ct-1987 #text change 

28-Apr-1993 
ACCESSIONS A26370 
REFERENCE A94160 

•authors Haraguchi, Y.; Takiguchi, M.; An ay a. Y.; Kawamoto, S.; 

Matsuda, I.; Mori, H. 



ISi/r T IrUuDnOuLnr u I LrtbrtrrrrOLfMN Vt ttLLHf\ \ VU 

90 100 110 120 



5. US-08-249- 132-5 (1-10) 
S32937 ureD protein 



Klebsiella pneumoniae 



ENTRY S32937 Stype complete 

TITLE ureD protein - Klebsiella pneumoniae 

ORGANISM #formal_name Klebsiella pneumoniae 

DATE 31-Dec-1993 #sequence_revision 31-Dec-1993 #text change 

31-Dec-1993 
ACCESSIONS S32937 
REFERENCE S32937 

tauthors Collins, CM.; Gutman* D.M.? Lacian, H. 

#journal Nol. Microbiol. (1993) 8:187-198 

ititle Identification of a nitrogen-regulated promoter controlling 

expression of Klebsiella pneumoniae urease genes, 
taccession S32937 
##molecule_type DNA 
iiresidues" 1-270 ##label COL 
##cross-references EMBL:L07039 
GENETICS 

#gene ureD 
#start_codon GTG 

SUMMARY " Slength 270 tmolecular-ueight 29953 #checksum 5656 

SEQUENCE 



Initial Score = 6 
Residue Identity = 607. 
Gaps = 0 



Optimized Score = 6 
Matches = 6 

Conservative Substitutions 



Significance = 
Mismatches = 



4.11 
4 
0 



X 10 
PEEVTRPNYL 
III I II 

MLPPLKKGWSRTLDLRF98AGGKTVLASAQHVGPLTVQRPFYPEEETCHLYLLHPPGGIVGGDELTISAHLA 
10 20 30 40 X 50 X 60 70 



PGCHTLITMPGASKFYRSSGAQALVR8QLT 
80 90 100 



6. US-08-249-182-5 (1-10) 

A42887 urease - Klebsiella pneumoniae 



ENTRY A42887 #type complete 

TITLE urease - Klebsiella pneumoniae 

ORGANISM #formal_name Klebsiella pneumoniae 

DATE 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 

31-Dec-1993 
A42B87 
A42887 

Leer M.H.; Mulrooney* S.B.; Renner* M.J. J Markouiczr Y.> 

Hausingeri R.P. 
J. Bacteriol. (1992) 174:4324-4330 

Klebsiella aerogenes urease gene cluster: sequence of ureD 
and demonstration that four accessory genes (ureDr ureEr 
ureFi and ureG) are involved in nickel metallocenter 
biosynthesis. 
A42887 

preliminary 
##molecule_type DNA 
iSresidues" 1-270 Mlabel LEE 
##cross-references GB:M55391 
SUMMARY #length 270 #molecular-weight 29839 Checksum 5847 

SEQUENCE 



ACCESSIONS 
REFERENCE 
#authors 

#journal 
Hitle 



^accession 
^status 



evolutionary lnpucawons, 
#cross-references MUID:90130326 
iaccession D35116 

iistatus prelininary 
##nolecule_type DNA 
##residues~ 1-229 iilabel ESS 
iicross-references GB:M33S10 
SUMMARY #length 229 ichecksun 9355 

SEQUENCE 

Initial Score * 6 Optimized Score = 6 Significance = 4.11 
Residue Identity = 75X Matches = 6 Misnatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
III! II 

A VT VDRLGNMASASTP VTL AMFUPD I QPG8RVLVLT YGSG ATWGA ALYRKPEEVNRPC 
180 190 200 210 220 X X 



4. US-08-249-182-5 (1-10) 

DGBEL5 uracil-DNA giycosylase (EC 3.2.2.-) - hunan cytone 



ALTERNATE.NAMES 
ORGANISM 
inote 
DATE 

ACCESSIONS 
REFERENCE 
iauthors 



-) - hunan cytonegalovirus 



hunan herpesvirus 5 



ENTRY DGBEL5 Itype conplete 

TITLE uracil-DNA giycosylase (EC 3.2.2. 

(strain AD169) 
UL114 protein 

ifornal_nane hunan cytomegalovirus i 
host Ho no sapiens (nan) 

31-Dec-1990 #sequence_revision 31-Dec-1990 #text_change 

24-Feb-l994 
S09881 
S09749 

Cheei M.S.; Bankier, A.T.; Beckt S.J Bohnii R. ; Broun » CM.; 
Cerny, R,; Horsnell* T.; Hutchison III * C.A.i Kouzaridesr 
T.; Martignetti? J. A.; Preddiei E.» Satchuell* S.C.J 
Tonlinsoni P.; Weston* K.M.J Barrel I ; B.G. 
ijournal Curr. Top. Microbiol. Innunol. (1990) 154:125-169 
#title Analysis of the protein-coding content of the sequence of 

hunan cytomegalovirus strain AD169. 
icross-references MUID:90269039 
iaccession S09881 
##nolecule_type DNA 
iiresidues" 1-250 ##label CHE 
##cross-references ENBL:X17403 



iinote 
iinote 

##note 

CLASSIFICATION 
KEYWORDS 
SUMMARY 
SEQUENCE 



possible protein-coding franes are given in this paper 
neither protein nor nucleic acid sequence is given in 
this paper 

the DNA sequence was subnitted to EMBLr Decenber 1989, 
in conputer-readable forn 
tsuperfanily uracil-DNA giycosylase 
glycosidase; hydrolase 

# length 250 inolecular-ueight 28353 tchecksun 6234 



Initial Score = 
Residue Identity = 
Gaps = 



6 Optinized Score = 6 Significance = 4.11 
60X Matches = 6 Mi snatches = 4 

0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
II I I II 

LLTPDEQARVFCLSADWIRFLSLPDHDTVLLRDTVAAVEGARQLEMVYPAPEHVHRWSYLCPPEQVRVVIVG 
20 30 40 50 60 70 X 80 



SQBEN4 alkaline exonuclease (EC 3.1.11.-) - sainiriine he 



ENTRY 
TITLE 

ORGANISM 
Snote 
DATE 

ACCESSIONS 

REFERENCE 
#authors 
tsubnission 
tdescription 
^accession 



sainiriine herpesvirus 1 



QSBEN4 Hype conplete 
alkaline exonuclease (EC 3.1.11.-) 

(strain 11) 
#fornal_nane sainiriine herpesvirus 1 
host Sainiri sciureus (connon squirrel nonkey) 
31-Dec-1992 #sequence_revision 31-Dec-1992 #text_change 

04-Har-1994 
H36809 
A36806 
Albrechtf J. 

submitted to the EHBL Data Libraryi January 1992 
Primary structure of the herpesvirus sainiri genone. 
H36809 
##nolecule_type DNA 
^residues" 1-483 ##label ALB 
##cross-references GB:X64346 



REFERENCE 
^authors 



tjournal 
Stitle 



A37309 

Albrecht* J.C.J Nicholas; J. ; Biller, D.J Caneront K.R.; 
Biesinger* B.; Neunanr C; Uittnannr S.; Craxtonr tl.A.i 
Colenan* H.; Fleckensteini B. > Hones s» R.U. 

J. Virol. (1992) 66:5047-5058 

Prinary structure of the herpesvirus sainiri genone. 



tkross-references RUID:92333688 



#contents 
inote 

GENETICS 
#gene 
CLASSIFICATION 
KEYWORDS 
SUMHARY 
SEQUENCE 



annotation? possible protein-coding franes 
neither protein nor nucleotide sequence is given in this 
paper 

37 

ftsuperfanily hunan cytonegalovirus alkaline exonuclease 
exonuclease? hydrolase 

Slength 483 inolecular-weight 55554 tchecksun 6775 



Initial Score = 
Residue Identity = 
Gaps = 



7 Optinized Score = 7 Significance = 5.13 
70X Hatches = 7 Hisnatches = 3 

0 Conservative Substitutions = 0 



X X 
PEEVTRPNYL 
Illll II 

GGFIGNEKIGTYFSKNVCNNIAAKGVPKLADVYKACEKWNLR89SEICLLIEEVTRG8YLNSLHDALRDGTI 
70 80 90 100 110 120 130 



SSSKFYUATKK8NSTKKIFEPWPIKNDYYVAGPLAFGL 
140 150 160 170 



3. US-08-249-182-5 (1-10) 

D35116 hypothetical protein 2 (phnA 5' region) 



Pseudono 



ENTRY D35116 #type fragnent 

TITLE hypothetical protein 2 (phnA 5' region) - Pseudononas 

aeruginosa (fragnent) 
ORGANISM #fornal_nane Pseudononas aeruginosa 

DATE 03-Aug-1990 #sequence_revision 03-Aug-1990 #text change 

30-Sep-1993 
ACCESSIONS D35116 
REFERENCE A35116 

tauthors Essar* D.H.; Eberlyr L.J Haderor A.; Crawford? I. P. 
journal J. Bacterid. (1990) 172:884-900 

#title Identification and characterization of genes for a second 

anthranilate synthase in Pseudononas aeruginosa: 
interchangeability of the two anthranilate synthases and 
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1. US-08-249-182-5 (1-10) 

A42329 autotaxin - hupian (fragments) 

ENTRY A42329 #type fragments 

TITLE autotaxin * human (fragments) 

ORGANISM #formal_nane Homo sapiens #common_nane nan 

DATE 04-Mar-1993; #sequence_revision Ol-Jan-1993? #textj:hange 

08-May-1993 
ACCESSIONS A42329 
REFERENCE A42329 

f authors Strackei H.L.J Krutzsch* H.C.; Unsuorthr E.J. ; Arestad, A,; 
Cioce* V.i Schiffnann* E.i Liottai L.A. 

ijournal J. Biol. Chen. (1992) 267:2524-2529 

#title Identif icationr purification and partial sequence analysis 

of autotaxini a novel notility-stinulating protein. 
#cross-references MUID:92129337 
#accession A42329 

#istatus prelininary 

##nolecule_type protein 

##residues" 1-114 ##label STR 

ticross-references NCBIP:78526; NCBIP?78523? NCBIP:7852U NCBIP:78518; 

NCBIPJ78515? NCBIP:78512; NCBIP:7851U NCBIP:78510? 
NCBIP:78509; NCBIP:78508; NCBIPr78503 
##note sequence extracted froR NCBI backbone 

SUMMARY ilength 114 #checksun 7335 

SEQUENCE 

Initial Score = 9 Optinized Score = 9 Significance = 7.19 
Residue Identity = 90X Matches = 9 Misnatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 
Mill Mil 

DFFRVNSMQTVFVGYGPTFKGGQPLWITATKSPPFENINLYYDVPWNETIPEEVTXPNYLQAEVSYPAFKPX 
40 50 60 70 80 X 90 100 



LDVYKUHVAAN 
110 



II 

SCORE 0| 
STDEV -1 



1 1 


II 


II 


II 


II 


II 


II 


1! 


2| 


3| 




5| 


M 


7| 


0 


1 


2 


3 


4 


5 


6 



PARAMETERS 



Similarity matrix Unitary 

tlisnaich penalty 1 

Gap penalty i.OO 

Gap size penalty 0.05 

Cutoff score 0 

Randomization group 0 

Initial scores to save 40 

Optimized scores to save 0 



K-tuple 

Joining penalty 
Window size 



2 
20 
5 



Alignments to save 15 
Display context SO 



SEARCH STATISTICS 



Scores; 



Mean Median Standard Deviation 
2 3 0.97 



Times? 



CPU 
00:01:25.97 



Total Elapsed 
00:01:36.00 



Number of residues: 20816057 
Number of sequences searched: 70848 
Number of scores above cutoff: 4001 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence uas not found. 



The list of best scores is: 



Sequence Name Description 



Init. Opt. 
Length Score Score 



Sig. Frame 







**** 7 standard deviations 


above wean *«** 




1. 


A42329 


autotaxin - hunan (fragments) 


114 9 ^ 


} 7.19 0 






f**» s standard deviations 


above nean **** 




2. 


QSBEN4 


alkaline exonuclease (EC 3.1. 


483 7 i 


' 5.13 0 






**** 4 standard deviations 


above nean **** 




3. 


035116 


hypothetical protein 2 (phnA 


229 6 I 


> 4.11 0 


4. 


DGBELS 


uracil-DNA glycosylase (EC 3. 


250 6 I 


i 4.11 0 


5. 


S32937 


ureD protein - Klebsiella pne 


270 6 I 


> 4.11 0 


6. 


A42887 


urease - Klebsiella pneunonia 


270 6 t 


i 4.11 0 


7. 


S02132 


arginase (EC 3.5.3. IK hepati 


322 6 I 


» 4.11 0 


8. 


A26370 


arginase (EC 3.5.3.1) . hepati 


322 6 i 


i 4.11 0 


9. 


B26370 


arginase (EC 3.5.3.1) > hepati 


323 6 I 


i 4.11 0 


10. 


A283S8 


arginase (EC 3.5.3. 1) r hepati 


323 6 I 


) 4.11 0 


11. 


A26702 


arginase (EC 3.5.3. 1) » hepati 


323 b I 


> 4.11 0 


12. 


A31776 


lactose pernease - yeast (Klu 


587 b I 


> 4.11 0 


13. 


S28366 


reconbination repair protein 


679 6 1 


' 4.11 0 


14. 


A48832 


gpl38=cell surface glycoprote 


730 6 I 


i 4.11 0 


15. 


KIRBF 


6-phosphofructokinase (EC 2.7 


749 6 I 


> 4.11 0 


16. 


A26550 


6-phosphofructokinase (EC 2.7 


780 6 I 


) 4.11 0 


17. 


JH0592 


glutanate receptor chain KA-2 


979 6 i 


> 4.11 0 


18. 


JH0589 


glutsnate receptor gacina-2 ch 


979 6 < 


> 4.11 0 



FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 

Results file u249_5p.res made by on Thu 22 Sep 94 10?33?47-PDT. 



Query sequence being conpared:US-08-249-182-5 (1-10) 
Number of sequences searched: 70848 
Nunber of scores above cutoff? 4001 

Results of the initial comparison of US-08-249- 182-5 (1-10) with: 
Data bank : PIR 41 > all entries 
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U50000- » 
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t 

s 

E 5000- * 

9 
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N 
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E 

S 1000- 



500- » 



100- « 



50- 



10- 



5- 



i » 

0 



neutralising uonam irwin ot hiv-i. tnese pepuoes are useo in 

CC association with carriers which enhance immunogenicity. The carrier 

CC nay be the purified protein derivative (PPD) of tuberculin from 

CC Mycobacterium tuberculosis. These peptide/carrier complexes can be 

CC used as a vaccine for the treatment of HIV-infected and unaffected 

CC individuals and/or for the transmission prevention of HIV-1. These 

CC peptides nay be linear or cyclic. 

SQ Sequence 35 AA; 

SQ 2 A? 5 R; 3 N? 0 D? 0 B; 2 C? 1 Q? 0 E; 0 Z; 3 G; 2 H? 

SQ 5 I; o L; 3 K; 0 H; l F; 2 P; 0 s; 4 T; 0 w; 2 Y; 0 V; 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 71X Hatches = 5 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
Mill 

CTRPNYNKRKR I H I GPGRAFYTTKN IIGTI RQAHC 
X X 10 20 30 



15. US-08-249- 182-5 (1-10) 

R14336 HIV-1 amplifier peptide 120. 

ID R14336 standard; Protein; 35 AA. 

AC R14336; 

DT 03-JAN-1992 (first entry) 

DE HIV-1 amplifier peptide #20. 

KW human immunodeficiency virus? vaccine? human retrovirus? AIDS? 

KW acquired immunodeficiency syndrome? envelope glycoprotein. 

DS Synthetic. 

PN W09114449-A. 

PD 03-0CT-1991. 

PF 16-MAR-1991? E00509. 

PR 19-HAR-1990? US-494749. 

PA (INSP ) INST PASTEUR. 

PI Girard M? 

DR WPI; 91-310366/42. 

PT Enhancing imnunogenicity of envelope glycoprotein - for use as 

PT vaccine or inmuno therapeutic drug especially against HIVi HTLV-I 

PT and HTLV-I I 

PS Claim 12; Page 50; 71pp? English. 

CC This peptide is one example of an HIV-1 amplifier peptide for use in 

CC a composition for enhancing the immunogenic ity of an envelope 

CC glycoprotein of a virus. The sequence corresponds to a 

CC neutralisation epitope and enhances the induction of persistent 

CC neutralising antibodies in the host. The anplifier peptide is used 

CC in addition to an envelope glycoprotein for priming the induction of 

CC neutralising antibodies. The compositions are particularly 

CC useful for vaccinating against HIV, SIV, HTLV-I and HTLV-I I. 

SQ Sequence 35 AA? 

SQ 2 A? 5 R; 3 n; i o; o b; 2 c? i q? o e; o z; 3 g; 2 h? 

SQ 5 I; 0 L? 3 K? 0 H? 1 F; 2 P? 0 S? 3 T; 0 M? 2 Y? 0 V? 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 71X Matches = 5 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 

nut 

CTRPNYNKRKRIHIGPGRAFYTTKNIIGDIRQAHC 
X X 10 20 30 

> 0 < 

0| |0 IntelliGenetics 



PF 29-MAY-1992; 109072. 

PR 31-MAY-1991; JP-129224. 

PA (KAGA ) CHEHO SERO THERAPEUTIC RES INST. 

PI Erie Y, Osatomi Kr Shiosaki Kr Tokiyoshi S; 

DR HPU 92-400517/49. 

PT Principal neutralising determinant peptide(s) of HIV gpl20 

PT protein - used for diagnosingi preventing and treating HIV 

PT infection 

PS Example l; Page 10; 26pp; English. 

CC DNA encoding HIV PND peptides uas PCR amplified using genomic DNA 

CC from HIV-infected peripheral blood mononucleic cells as template. 

CC The amplified fragments were fused to beta-galactosidase coding 

CC sequence. E.coli transformants were cultured to produce the fusion 

CC protein. The expressed PND proteins were divided into groups based 

CC on their reactivity uith neutralising antibodies and their amino 

CC acid sequence. The amino acid sequence uas analysed using Robson's 

CC analytical program for protein secondary structure. Five groups 

CC were identified and 90X of all previously reported PND peptides 

CC were included in 3 main groups (i.e. Groups If II and III). 

CC Group I PND peptides are those which have the structure XXXBBBX on 

CC the amino-terminal side of the GPGR motif (B = beta-strand 

CC structure and X = turn or coil structure). Vaccine preparations 

CC comprising representative peptides from each of the 5 groups can be 

CC used to develop vaccines able to recognise any HIV variant. 

CC See also G31607-Q31608, R28996-R29000 and R29110-R29128. 

SQ Sequence 35 AA; 

SQ 2 A; 5 R; 3 N; 0 D; 0 b; 2 C; l Q; o E; 0 Z? 3 G; 2 h; 

SQ 5 If 0 U 3 KJ 0 H; l F; 2 P; o S; 4 T; o w? 2 Y; 0 v; 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 71X Hatches = 5 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
Mill 

CTRPNYNKRKRIHIGPGRAFYTTKNIIGTIRGAHC 
X X 10 20 30 



14. US-08-249- 182-5 (1-10) 

R27857 gpl20 PND fragment (m). 

ID R27857 standard; peptide; 35 AA. 

AC R27857; 

DT 16-HAR-1993 (first entry) 

DE gpl20 PND fragment (m) . 

KW gpl20; principal neutralising domain; PND; HIV-l; immunogenic ity? 

KW purified protein derivative; PPD; tuberculin; vaccine; AIDS; 

KU Mycobacterium tuberculosis; transmission prevention. 

OS Human immunodeficiency virus type 1. 

PN W09217590-A. 

PD 15-0CT-1992. 

PF 02-APR-1992; E00735. 

PR 02-APR-199U US-681624. 

PR 14-FEB-1992; US-837781. 

PA (INSS ) SCHWEIZ SERUM & IMPFINST. 

PA (YESH ) UNIV YESHIVA EINSTEIN COLLEGE. 

PI Bloom BR i Cryz Sr Devash Y» Rubinstein A; 

DR WPI; 92-366260/44. 

PT Peptide conjugates of HIV gpl20 principal neutralising domain - 

PT and carrier proteins as vaccines for treating and preventing HIV 

PT infection and/or transmission 

PS Claim 2; Page 54; 64pp; English. 

CC The sequences given in R27845-58 are fragments of the gpl20 Principal 



ri /noi,e= "conserved rwif mowt 

FT Region 8.. 14 

FT /note= "classification of PND peptides is 

FT determined by the predicted secondary 

FT structure of this region - see consents 0 

PN EP-516135-A. 

PD 02-DEC-1992. 

PF 29-MAY-1992; 109072. 

PR 31-MAY-1991? JP-129224. 

PA (KAGA ) CHEHO SERO THERAPEUTIC RES INST. 

PI Eda Yi Osatoni K, Shiosaki Kr Tokiyoshi S; 

DR WPI; 92-400517/49. 

PT Principal neutralising determinant peptide(s) of HIV gpl20 

PT protein - used for diagnosingi preventing and treating HIV 

PT infection 

PS Example l; Page 10; 26pp; English. 

CC DNA encoding HIV PND peptides bias PCR amplified using genomic DNA 

CC from HIV-infected peripheral blood mononucleic cells as template. 

CC The amplified fragments were fused to beta-galactosidase coding 

CC sequence. E.coli transformants were cultured to produce the fusion 

CC protein. The expressed PND proteins uere divided into groups based 

CC on their reactivity uith neutralising antibodies and their amino 

CC acid sequence. The amino acid sequence uas analysed using Robson's 

CC analytical program for protein secondary structure. Five groups 

CC uere identified and 90Z of all previously reported PND peptides 

CC were included in 3 main groups (i.e. Groups L II and III). 

CC Group I PND peptides are those which have the structure XXXBBBX on 

CC the anino-terminal side of the GPGR motif (B = beta-strand 

CC structure and X = turn or coil structure). Vaccine preparations 

CC comprising representative peptides from each of the 5 groups can be 

CC used to develop vaccines able to recognise any HIV variant. 

CC See G31607-G31608, R28995-R29000 and R29110-R29128. 

SO Sequence 35 AA; 

sq 2 A; 5 r; 3 N; o D? o B; 2 C; l fi; 0 E; o z; 3 G; 2 H; 

S8 5 no L; 3 K; 0 N! 1 F; 2 P? 0 5? 4 T; 0 H; 2 Y; o v; 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 71X Hatches = 5 Hismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
Mill 

CTRPNYNKRKRIHIGPGRAFYTTKNIIGTIROAHC 
X X 10 20 30 



13. US-08-249-182-5 (1-10) 

R28995 Group I HIV gpl20 PND peptide IIIHN. 

ID R28995 standard; Protein? 35 AA. 

AC R28995? 

DT 02-APR-1993 (first entry) 

DE Group I HIV gpl20 PND peptide IIIHN. 

KH Principal Neutralising determinant? Human immunodeficiency virus; 

KW vaccine; Robson's analytical method; polymerase chain reaction; 

KH Garnier-Osguthorpe-Robson method; G0R method; secondary structure. 

OS Human immunodeficiency virus. 

FH Key Location/Qualifiers 

FT Region 15.. 18 

FT /note= Conserved PND motif 0 

FT Region 8.. 14 

FT /note= "classification of PND peptides is 

FT determined by the predicted secondary 

FT structure of this region - see comments" 

PN EP-516135-A. 



rccvinrreTL 



TRPNYNKRKRIHIGPGRAFYTTKNIIGDIRQAH 
X X 10 20 30 



11. US-08-249-182-5 (1-10) 

R36586 Virus neutralising epitope of envelope glycoprotei 

ID R36586 standard; peptide; 35 AA. 

AC R36586; 

DT 06-SEP-1993 {first entry) 

DE Virus neutralising epitope of envelope glycoprotein of HIV. 

KU Human immunodef iciency virus; gpl20; gp!60; EGP ; VNE; immunity. 

05 Synthetic. 

PN W09308836-A. 

PD 13-HAY-1993. 

PF 28-QCT-1992; E02459. 

PR 28-0CT-1991; US-782154. 

PR 28-0CT-1991; US-782241. 

PR 28-0CT-1991? US-782252. 

PA UNSP ) INST PASTEUR. 

PI Girard M; 

DR WPI; 93-167398/20. 

PT Enhancing immunogenicity of viral envelope glycoprotein * by 

PT co-adninistration of viral envelope glycoprotein itself* and an 

PT oligopeptide derive. 

PS Disclosure; Page 82; 107ppi English, 

CC A novel method of enhancing the immunogenicity of an envelope 

CC glycoprotein (EGP) of a virus (esp. HIV gp!20 or gp!60) in a host 

CC comprises admin, to the host at least one EGP of the virus in an ant. 

CC sufficient for priming vaccination and at least one peptide derived 

CC from an amino acid sequence of the EGP (e.g. the sequence shoun)f 

CC where the peptide comprises at least one virus-neutralisation 

CC epitope (VNE). The complex is able to enhance the induction of 

CC neutralising antibodies to the virus and to confer long lasting 

CC immunity! longer than 6 months. 

CC See also R36567-87. 

SQ Sequence 35 AA; 

SQ 2 A; 5 R; 3 n; l D; o B; 2 c; 1 a; o e; o z; 3 g; 2 h; 

SQ 5 I; 0 l; 3 k; o M; l F; 2 p; o s; 3 t; o w; 2 v; o v; 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 71X Matches = 5 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
Mill 

CTRPNYNKRKR I H I GPGR AFYTTKN 1 1 GD I R9AHC 
X X 10 20 30 



12. US-08-249-182-5 (1-10) 

R28996 Group I HIV gpl20 PND peptide 8909C. 

ID R28996 standard; Protein; 35 AA. 

AC R28996! 

DT 02-APR-1993 (first entry) 

DE Group I HIV gp!20 PND peptide 8909C. 

KW Principal Neutralising determinant; Human immunodeficiency virus; 

KM vaccine; Robson's analytical method; polymerase chain reaction; 

KW Garnier-Gsguthorpe-Robson method; GOR method; secondary structure. 

OS Human immunodeficiency virus. 

FH Key Location/Qualifiers 

FT Region 15.. 18 



m oreasi. ano Lung carcinoma**/ 1 ano melanoma 

PS Disclosure; Page 16? 17pp; English. 

CC This is the N-terminal amino acid sequence of a human tumour- 

CC associated cell surface glycoprotein antigen. The antigen is 

CC associated yith non small cell lung carcinoma cells and also uith 

CC other carcinoids including breast and colon carcinomas and melanomas. 

CC It can be used to produce a monoclonal antibody uhich can be used 

CC for both in vivo and in vitro diagnostic purposes such as the 

CC detection of malignant carcinomas. 

SQ Sequence 30 AA; 

sg l A; o R; l N; 2 D; o B; o c; 3 q; 0 E; o z; 2 G; o H; 

SQ 0 I? 3 L? 0 Kf 1 M? 1 F; 4 P; 1 Si 2 Ti 0 U; 1 Y; 6 V; 

SQ 2 Others; 



Initial Score 
Residue Identity 
Gaps 



5 Optimized Score = 5 Significance = 3.15 
50% Hatches = 5 Hi snatches - 5 

0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
II I II 

DVVVQAPTQVPGFLGDSVTLPXYL9VPNHX 
10 X 20 X 30 



10. US-08-249-182-5 (1-10) 

Rl 1210 Retroviral B-epitope containing peptide #20 of hyb 

ID Rl 1210 standard; Protein; 33 AA. 

AC R11210; 

DT 23-HAY-1991 (first entry) 

DE Retroviral B-epitope containing peptide #20 of hybrid molecule. 

KU retrovirus; env glycoprotein? B-epitope; immunodeficiency virus; 

KW HIV; SIV; AIDS. 

PN W09102544-A. 

PD 07-MAR-1991. 

PF 17-AUG-1990J F00620. 

PR 1S-AUG-1989; FR-011044. 

PA (INSP ) INST PASTEUR. 

PA (UYCU-) UNIV CURIE P & M PARIS V. 

PI Girard Hi Glucknan JC* Bahraoui EH; 

DR MPI; 91-087117/12. 

PT Vaccine compsns. which neutralise human immune deficiency virus - 

PT comprise a B epitope of retro-virus envelope glyco-protein and T 

PT epitope of distinct protein 

PS Claim 6; Page 36; 47pp; French. 

CC The peptide is a specific example of a B-epitope contg. peptide 

CC which can form a hybrid immunogenic molecule uith a retroviral T- 

CC epitope. The B-epitope is chosen to be the major neutralisation 

CC epitope of the envelope glycoprotein of a pathogenic retrovirus. 

CC The T-epitope can be derived from a different protein of the 

CC same retrovirus or from the same protein from a different retrovirus. 

CC The hybrid molecule can also contain a minor epitope, especially a 

CC B-epitope from a conserved region of the HIV» SIVi HTLV-1 or 

CC HTLV-II env glycoprotein. The B-epitope-contg. peptide is joined to 

CC the T-epitope using eg tetanus toxin* KLH or HSA. 

CC See also R11191-R11209. 

SQ Sequence 33 AA; 

S9 2 A; 5 R; 3 N; l D; o B? 0 c; l 9; o E? o z; 3 G; 2 H; 

SQ 5, I; 0 L; 3 K; 0 H; l F; 2 p; 0 S; 3 T; 0 w; 2 Y; 0 V; 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 83X Hatches * 5 Hismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X 10 



X 10 X 



8. US-08-249-182-5 (1-10) 

R40204 Sequence of peptide construct SPIONN(A). 

ID R40204 standard; peptide? 24 AA. 

AC R40204? 

DT 05-FEB-1994 (first entry) 

DE Sequence of peptide construct SPIOHN(A). 

KM Hybrid protein; synthetic protein; immunogenic peptide; tolerance; 

KM synthetic toleragen. 

OS Synthetic. 

PN W09315750-A. 

PD 19-AUG-1993. 

PF 10-FEB-1993; U01207. 

PR 10-FEB-1992; US-833429. 

PA (HAYN/) HAYNES B F. 

PI Haynes BF; 

OR WPI; 93-272554/34. 

PI Inducing immune tolerance to immunogenic peptide(s) or proteins - 

PT by administering the peptide(s) or proteins coupled to a 2-20 

PT aminoacid hydrophobic peptide 

PS Example; Figure 10; 65pp; English. 

CC The peptide is a variant of T1-SP10 peptide derived from HIVHM 

CC envelope sequences. It comprises the following regions from the 

CC envelope sequences: SP10 and A. 

CC SPIOMN(A) sequence is AAs 301-319 from HIVMN. 

CC (A) sequence is AAs 32-324 from HIVHN and AAs 322-327 from HIVIIB. 

CC A = additional HIV gpl20 V3 loop sequences added to the original 

CC synthetic peptide (SP10) sequence to add an additional neutralising 

CC and CTL region to the HIV B cell determinant of the hybrid peptide. 

S9 Sequence 24 AA; 

S9 l A; 4 R; 2 N; o 0; 0 B; l C; 0 9; 0 E; 0 Z; 2 G; 1 H; 

S9 2 I; 0 L; 3 K; 0 «; l F; 2 P; 0 S; 3 T; 0 w; 2 Y; 0 V? 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 712 Matches - 5 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
Hill 

CTRPNYNKRKRIHIGPGRAFYTTK 
X X 10 20 



9. US-08-249-182-5 (1-10) 

R15700 Human tumour-associated cell surface antigen N-ter 

10 R15700 standard; peptide; 30 AA. 

AC R15700; 

DT 16-HAR-I992 (first entry) 

DE Human tumour-associated cell surface antigen N-terminal. 

KW Diagnosis; carcinoma; vaccine; therapy; lymphoma; glycoprotein. 

OS Homo sapiens. 

PN EP-460607-A. 

PD ll-DEC-1991. 

PF 04-JUN-1991; 109120. 

PR 05-JUN-1990; US-533371. 

PA (BRIM ) BRISTOL-MYERS SQUIB. 

PI Hellstrom If Hellstrom KEi Marquardt Hf Johnston J; 

DR UPI; 91-363102/50. 

PT Novel monoclonal antibody - to human tumour-associated cell 

PT surface antiaen useful in diaanosis and theraou of human coloni 



MAb-01 with the peptides corresp. to AAs 302-316 or 322-336 of the 
V3 loopf binding of the antibody to the peptide representing AAs 
3122-326 uas apparent. The extent of this reactivity with other 
HIV-1 isolates uas screened uith peptides corresp. to the V3 loop 
region of HIV-1 isolates IIIBx RF, CDC4, NY/5, 26, Z2 and ELI 
(R33335-R33342) . These results indicate that nonoclonal antibody 
NM-01 recognizes an epitope of the V3 loop of gpl20 of nultiple 
HIV-1 isolates having the anino acid sequence R33343. NM-01 is also 
putatively reactive uith the RF-like peptide set out in R33344. 
The variable region of the heavy and light chain of nonoclonal 
antibody NM-01 were cloned by PGR and sequenced. Nucleotides 1-21 
and 334-363 of 937472 corresp. to the PCR priners used to anplify 
NM-01 light chain sequences and nucleotides 1-27 and 385-402 of 
857471 corresp. to the PCR priners used to anplify NM-01 heavy chain 
sequences. 
Sequence 15 AA; 

D; 0 B; i C; o fi; 0 E; o z; 1 G; 1 H; 
M; 0 F? 1 P; o s; i T; 0 w; i Y; 0 v; 



A; 

Ir 



Rr 

L; 



N; 

Ki 



Initial Score = 5 
Residue Identity = 71X 
Gaps = 0 



Optimized Score = 5 Significance = 3.15 
Hatches - 5 Misnatches = 2 

Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 

inn 

CTRPNYNKRKRIHIG 
X X 10 



7. US-08-249-182-5 (1-10) 

P40385 Sequence of synthetic antigenic peptide 16 fron gr 

ID P40365 standard; peptide; 15 AA. 

AC P403855 

DT 09-JAN-1992 (first entry) 

DE Sequence of synthetic antigenic peptide 16 fron group C-src/yes 

DE family of oncoproteins. 

KU Vaccine; neoplasia; tunour location; diagnosis; oncogenic virus; 

KM antigen! oncoprotein; viral oncogene. 

PN U08403087-A. 

PD 16-AUG-1984. 

PF 14-FEB-1984; U00190. 

PR 14-FEB-1983; US-466329. 

PA (SENA/) SEN A. 

PI Sen Ai Lerner RAi Houghten Rr Bittle JL; 

DR WPI; 84-213376/34. 

PT Synthetic polypeptide(s) - useful for innunisation against 

PT neoplastic growth and in detection of neoplastic disease 

PS Exanple; Table 4r Page 53; 84pp; English. 

CC The synthetic peptides of the invention corresp. to an AA residue S8 

CC of a first determinant donain of a first oncoprotein produced by 

CC cells transformed by an oncogenic virus. The determinant donain is 

CC vicinal to* but exclusive of; an active site of the oncoprotein. 

SO Sequence 15 AA; 

SQ 0 A; 3 R; 0 N; l D; 0 B; l C; 0 G; 3 E; 0 Z; 0 G; 0 H; 

SO o I; l l; o K; o H? l F; 2 P; o S; l t; l h; i Y; o v; 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 507. Hatches = 5 Mismatches - 5 

Gaps = 0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
III II 



PN W09310241-A. 

PD 27-HAY-1993. 

PF 13-N0V-1992; U09863. 

PR 20-N0V-1991? US-796256. 

PR 21-AUG-1992? US-933411. 

PA (CALJ ) CALGENE INC. 

PI Lardizabal KD, Lassner HWi fletz JG; 

DR MPI; 93-182556/22. 

PT Recombinant DNA construct for forming transgenic host cells which 

PT produce wax ester(s) - comprises nucleic acid sequence yhich 

PT encodes part of wax synthase protein and heterologous DNA 

PT sequence not naturally associated uith wax synthase protein 

PT encoding sequence 

PS Disclosure? Page 46? 79pp? English. 

CC The sequences given in R37476-83 are jojoba wax synthase (fatty acyl? 

CC fatty alcohol acyltransf erase) tryptic peptides. These peptides were 

CC used in the determination of the wax synthase amino acid sequence. 

CC The wax synthase and fatty acyl reductase coding sequences (see also 

CC 942838-39) were used in a construct in which the wax synthase gene is 

CC associated uith a heterologous DNA sequence not naturally associated 

CC uith it. This construct may be used to produce crop plants having a 

CC convenient source of wax esters. Wax esters can be used in a variety 

CC of industrial applications t including pharmaceuticals* cosmetics? 

CC detergents r plastics and lubricants. Production of wax esters in 

CC crop plants allows easier isolation than from traditional sources i 

CC eg. jojoba and sperm whale. 

SO Sequence 10 AA; 

59 0 a; o R; G N; o D; 0 B; o C; o q; 3 E; 0 Z; 0 G; 0 H; 

SQ 0 I? 0 L? 1 K? 0 M? 0 F? 1 P; 0 S? 2 T? 0 W? 1 Y? 2 Vf 

Initial Score = 5 Optimized Score = 5 Significance = 3.15 
Residue Identity = 83X Matches = 5 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 

X X 10 
PEEVTRPNYL 

Mill 
ETYVPEEVTK 
X 10 



6. US-08-249-182-5 (1-10) 

R33332 Sequence of peptide which corresp. to residues 302 

ID R33332 standard; peptide; 15 AA. 

AC R33332J 

DT 06-JUL-1993 (first entry) 

DE Sequence of peptide which corresp. to residues 302-316 of the V3 

DE loop region of HIV-1MN gpl20. 

KW Monoclonal antibody? NM-01? HIV-1? gpl20? gpl60. 

OS Synthetic. 

PN W09304090-A. 

PD 04-MAR-1993. 

PF 24-AUG-1992? U07111. 

PR 22-AUG-1991; US-748562. 

PA (NISP ) NISSIN SH0KUHIN KAISHA LTD. 

PI Ohno T; 

DR WPI? 93-093943/11. 

PT Monoclonal antibodies against HIV-1 gpl20 and gpl60 proteins - 

PT for treating and preventing HIV-1 infection 

PS Example? Page 19? 57pp; English. 

CC Hybridoma cell line HB 10726 secretes MAb NM-01. In order to 

CC characterize the viral epitope recognized by NM-01 tthe Ab was 

CC screened reactivity with overlapping peptides corresp. to the amino 

CC acid sequence of the V3 loop region of HIV-1 gp!20 (R33332i R33333, 



nunan f\fl-£ receptor. 



ID R32359 standard; Protein; 980 AA. 

AC R32359; 

0T 17-JUM-1993 (first entry) 

DE Hunan KA-2 receptor. 

KW Kainate high affinity receptor; EAA2; excitatory anino acid faftily? 

KW assay; binding affinity; CNS disorders; drugs. 

OS Hofto sapiens. 

FH Key Location/Qualifiers 

FT Peptide 1..18 

FT /note= "signal peptide 0 

FT Peptide 19.. 962 

FT /note= "nature peptide" 

FT Donain 528.. 547 

FT /note= "transnenbrane domain TH~1° 

FT Donain 572.. 590 

FT /note= "transnenbrane donain TH-2" 

FT Donain 601.. 619 

FT /note= "transnenbrane donain TH-3" 

FT Donain 786.. 806 

FT /note= "transnenbrane donain TM-4 n 

PN EP-529995-A. 

PD 03-HAR-1993. 

PF 25-AUG-1992; 307724. 

PR 27-AUG-1991; US-750081. 

PA (ALLE-) ALLELIX BIOPHARHACEUTICALS INC. 

PI Kanboj R, Nutt SL, Shekter Lr Wosnick HA. 

DR WPI; 93-069002/09. 

DR N-PSDBJ 036930. 

PT Isolated polynucleotide encoding hunan excitatory anino acid-2 

PT receptor - useful for deternining binding affinity of cpds. for 

PT the receptor in assaying for drugs to treat CNS disorders 

PS Clain 18; Fig 1; 29pp; English. 

CC The sequence is that of the hunan KA-2 receptor? a kainate high 

CC affinity receptor of the EAA2 (Excitatory Anino Acid) fanily. It 

CC can bind glutanate and has ligand-binding properties characteristic 

CC of kainate-type receptors. 

SO Sequence 980 AA; 

SO 79 A; 76 R; 36 N? 32 D; 0 B; 20 C; 30 0; 67 E; 0 Z; 68 G; 18 H; 

SO 55 I i 110L; 28 k; 30 H; 38 F; 58 P; 81 s; 51 T? 13 H; 30 y; 60 v; 

Initial Score = 6 Qptinized Score = 6 Significance - 3.94 
Residue Identity = 607. Hatches = 6 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 
ill I II 

RDSQYETTDTHCGILPKGVVSVLGPSSSPASASTVSHICGEKEIPHIKVGPEETPRLflYLRFASVSLYPSNE 
80 90 100 110 120 X 130 140 

DVSL AVSR I LKSFNYPSASL I C AK AECLLRLEELVRGF 
150 160 170 180 



5. US-08-249- 182-5 (1-10) 

R37478 Jojoba uax synthase tryptic peptide, SQ1083. 

ID R37478 standard; peptide; 10 AA. 

AC R37478; 

DT 20-SEP-1993 (first entry) 

DE Jojoba wax synthase tryptic peptide, SQ1083. 

KW Tryptic peptide; crop plant; fatty acyUfatty alcohol acyltransferase; 

KW fatty acyl reductase; jojoba; uax ester; uax synthase; cosnetics; 

KW detergents? plastics; lubricants; oharnaceuticals; soern uhale. 



\*\* mtf btquenutr it> tfidt efiLoaeu uy une uhh iriberwun in pidbPiiu ratio/ o 

CC which nay be used in the prodn. of attenuated non-primate herpes viruses. 

CC These can be used as live vaccines and provide a safer vaccine than 

CC currently available for e.g. pseudorabies virus of swine* infectious 

CC bovine rhinotracheitis ( IBR) virus or Marek's disease of fowl. This 

CC is not the complete sequence as described in the specification but 

CC is the only one given. 

SQ Sequence 38 AA; 

SQ 5 a; 3 R; 8 N; 4 D; 0 B; 0 C; 0 Q; 5 E; o Z; 1 G; 2 H; 

SQ 0 It 1 L ? 0 Kr 0 Hr o f; 6 P; 0 s; 1 Tf 0 H; o Y; 2 V? 

Initial Score = 6 Optimized Score = 6 Significance = 3.94 
Residue Identity = 601 Hatches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
Mil II 

EAPEGEEVTDPNANPNANPNPNANPNVDLERHHRADDR 
X 10 X 20 30 



3. US-08-249-182-5 (1-10) 

R05306 Sequence of human arginase gene product. 



ID R05306 standard; protein; 322 AA. 

AC R05306; 

DT 08-0CT-1990 (first entry) 

DE Sequence of hunan arginase gene product. 

KW Arginase; ds. 

OS Homo sapiens. 

PN J021173B3-A. 

PD l-MAY-1990. 

PF 26-QCT-1988; 268018. 

PR 26-0CT-1988; JP-268018. 

PA (T0YJ) Tosoh Corp. 

DR UP I; 90-176227/23. 

DR N-PSDBJ Q04714. 

PT Prepn. of hunan arginase - 

PT using transformed microorganism with vector having hunan 

PT arginase coding DNA sequence. 

PS Clain 3; Page 388-389; 2Spp; Japanese. 

CC A replicable vector expressing cDNA derived from the arginase gene 

CC and coupled with a lactose pronotor/operator system or a tryptophan 

CC pronotor/operator hybrid system allows production of the enzyne by 

CC transformants. 

CC See also 804715. 

SQ Sequence 322 AA; 

SQ 16 A; 13 R; 9 N; 19 D; o B; 3 c; 5 Q; 18 E; o z; 37 G; 8 H; 

SQ 23 I; 34 L; 24 K; 4 M; 9 f; 22 P? 21 s; 21 T; 2 w; 9 Y; 25 v; 



Initial Score = 6 Optimized Score = 6 Significance = 3.94 
Residue Identity = 60X Hatches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 
llllll 

LDPSFTPATGTPVVGGLTYREGLYITGEIYKTGLLSGLDIHEVNPSLGKTPEEVTRTVNTAVAITLACFGLA 
240 250 260 270 280 X 290 X 300 



REGNHKPIDYLNPPK 
310 320 



4. US-08-249-182-5 (1-10) 



rn r\ey LocaLion/auaut iers 

FT Modified.site 6 

FT /note= "potentially glycosylated residue" 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch Hi Liotta LA* Schiffmann Er Stracke M. 

DR WPi; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 29. It nay be used to 

CC raise anti-autotaxin antibodies uhich can be used to diagnose cancer 

CC metastasis and in innunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich nay also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SO Sequence 10 AA? 

S8 0 A; 0 R; l N; 0 D? 0 B; 0 C; 0 a; 2 E; 0 Z; 0 G; 0 H; 

so o I; i L; o K; o M; o F; 2 p; o s? 1 t; o w? 1 Y; l v? 

S3 1 Others; 



Initial Score 
Residue Identity 
Gaps 



9 Optimized Score = 9 Significance = 6.30 
90X Matches = 9 Mismatches = 1 

0 Conservative Substitutions = 0 



X X 
PEEVTRPNYL 



PEEVTXPNYL 
X 10 



2. US-08-249-182-5 (1-10) 

R38708 Plasnid PSY1373 DNA insertion product. 

ID R38708 standard; Protein; 38 AA. 

AC R38708; 

DT 25-N0V-1993 (first entry) 

DE Plasnid PSY1373 DNA insertion product. 

KW Attenuated; vaccine; herpes virus; non-primate! live; safer; IBR; 

KW infectious bovine rhinotracheitis; MDV; Marek's disease virus; foul; 

KW pseudo-rabies; suine; infectious bursal disease virus. 

OS Synthetic. 

PN US5223424-A. 

PD 29-JUN-1993. 

PF 27-JUL-1988J 225032. 

PR 06-SEP-1985J US-773430. 

PR 27-JAN-1986; US-823102. 

PR 17-JUL-1986? US-887140. 

PR 02-SEP-1986; US-902887. 

PR 20-N0V-1986? US-933107. 

PR 27-JUL-1987; US-078519. 

PR 27-JUL-1988; US-225032. 

PA (PRUT-) PRUTECH RES & DEV. 

PI Chiang CH, Cochran MD* Macdonald RD. 

DR WPI; 93-219585/27. 

DR N-PSDB; 042766. 

PT Recombinant fusion proteins for vaccine - comprises antigenic 

PT sequences fused to viral sequences e.g. pseudo-rabies virus* used 

PT as vaccines 

PS Disclosure; Fig 51; !27pp; English. 



me scores ceiou are sorxeo oy initial score. 
Significance is calculated based on initial score. 



A 10 OX identical sequence to the query sequence uas not found. 



The list of best scores is: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



6 standard deviations above nean **** 



1 . 


R37447 


Autotaxin peptide ATX 29. 


10 


9 


9 


6.30 


0 






3 standard deviations above nean *#« 








2. 


R38708 


Plasnio PSY1373 DNA insertion 


38 


6 


6 


3.94 


0 


3. 


rv AC"? A / 

R05306 


Sequence of hunan arginase ge 


See 


6 


6 


3.94 


0 


4. 


R32359 


Hunan KA-2 receptor. 


70U 


6 


6 


3.94 


0 


5. 


R37478 


Jojoba wax synthase tryptic p 


10 


5 


5 


T 4 C 

3.15 


A 

0 


6. 


R33332 


Sequence of peptide yhich cor 


4 C 

15 


5 


5 


3.15 


0 


7. 


P40385 


Sequence of synthetic antigen 


4 C 


c 

□ 


5 


3.15 


0 


8. 


ft 1 AAA 1 

R40204 


Sequence of peptide construct 


24 


5 


5 


3.15 


0 


9. 


R15700 


Hunan tumour-associated cell 


30 


5 


5 


3.15 


0 


10. 


Rl 1210 


Retroviral B-epitope containi 


33 


5 


5 


3.15 


0 


11. 


R36586 


Virus neutralising epitope of 


35 


5 


5 


3.15 


0 


12. 


R28996 


^» • I ittl 4AM Akin ■*! 

Group I HIV gpl20 PND peptide 


35 


5 


5 


3.15 


0 


13. 


a a nnnr 

R28995 


_ _ * 11TII * a a nkin _ A ■ i 

Group I HIV gpl20 PND peptide 


35 


5 


5 


3.15 


0 


14. 


R27857 


4 aa alia a i it 

gpl20 PND fragment (n) . 


35 


5 


5 


3.15 


0 


15. 


R14336 


■ 1*11 4 I .1 A -J A*_l_ U AA 

HIV-1 amplifier peptide #20. 


35 


5 


5 


3.15 


0 


16. 


R29128 


A 11 11*11 4 AA AklVt * • ■ 

Group V HIV gpl20 PND peptide 


36 


5 


5 


3.15 


0 


17. 


R41926 


CAP37 protein partial sequenc 


40 


5 


5 


3.15 


0 


18. 


R40201 


Sequence of peptide construct 


40 


5 


5 


3.15 


0 


19. 


R40196 


Sequence of peptide construct 


40 


5 


5 


3.15 


0 


20. 


R33217 


11*11 4 A A 1 1^ t * 

HIV gpl20 V3 loop mnunogenic 


40 


5 


5 


3.15 


0 


21. 


R29228 


Heteroconjugate antibody innu 


40 


5 


5 


3.15 


0 


22. 


R04461 


Hunan innunodef iciency virus 


40 


5 


5 


3.15 


0 


23. 


R40202 


A _ _ A — A " _1 A A 

Sequence of peptide construct 


47 


5 


5 


3.15 


0 


24. 


R41924 


CAP37 protein coding sequence 


50 


5 


5 


3.15 


0 


25. 


P91880 


Fusion protein coded for by o 


50 


5 


6 


3.15 


0 


26. 


R40203 


Sequence of peptide construct 


52 


5 


5 


3.15 


0 


27. 


P92100 


Fusion protein coded for by o 


63 


5 


6 


3.15 


0 


28. 


R31226 


Fusion peptide contg. HIV-1 e 


85 


5 


5 


3.15 


0 


29. 


R42360 


Papillonavirus E7 protein and 


97 


5 


5 


3.15 


0 


30. 


R26968 


Hunan T lynphocyte receptor V 


114 


5 


5 


3.15 


0 


31. 


R28823 


Alpha 66 integrin subunit cDN 


141 


5 


5 


3.15 


0 


32. 


R28824 


Alpha 6A integrin subunit cDN 


149 


5 


5 


3.15 


0 


33, 


R28944 


50 kD L-selectin ligand. 


151 


5 


5 


3.15 


0 


34. 


P50384 


TNF analogue having nodified 


154 


5 


5 


3.15 


0 


35. 


P50138 


Rabbit tunor necrosis factor. 


154 


5 


5 


3.15 


0 


36. 


P50101 


Sequence of a pure polypeptid 


154 


5 


5 


3.15 


0 


37. 


R05189 


Tunoricidal polypeptide. 


154 


5 


5 


3.15 


0 


38. 


R05175 


Tunoricidal polypeptide. 


154 


5 


5 


3.15 


0 


39. 


R03266 


Rabbit tunour necrosis factor 


154 


5 


5 


3.15 


0 


40. 


P70557 


Tunor necrosis factor. 


154 


5 


5 


3.15 


0 



1. US-08-249-182-5 (1-10) 

R37447 Autotaxin peptide ATX 29. 

ID R37447 standard; peptide; 10 AA. 

AC R37447; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 29. 

KW Cell notility stinulating? cancer netastasis; antibody; detection; 

KW innunostains? disease outcone prediction; therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 



E 

S 1000- 



500- 



100- 



50- 



10- 



5- 



II II II I II II II II I 

SCORE 0| 1| 2| 3 4| 5| 6| 7| 8 
STDEV 0 1 2 3 4 5 6 



PARAMETERS 



Similarity matrix 


Unitary 


K-tupIe 




2 


Mismatch penalty 


1 


Joining penalty 




20 


Gap penalty 


1.00 


Window size 




5 


Gap size penalty 


0.05 








Cutoff score 


0 








Randomization group 


0 








Initial scores to save 


40 


Alignments to save 


15 




Optimized scores to save 0 


Display context 


50 





SEARCH STATISTICS 



Scores; Mean Median Standard Deviation 

1 3 1.27 

Tines: CPU Total Elapsed 

00:00:27.97 00:00:30.00 

Number of residues: 5287517 

Number of sequences searched: 42145 

Number of scores above cutoff: 4616 



Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 



N 
3 

n 

8 

9 -OOOS 3 

s 

-oooou 

0 

i * a 

3 

a 
u 

-oooosn 

N 

-oooooi 

seij-jua \\e <si basauag-y : nueq e^BQ 
sM"»?n (OI-I) S-2BT-6fr2-80-Sn J° uostjeduoa ien?ui eqi jo s-Jinsey 

9I9t jjjoirn dAoqe sajoas jo jaqun^ 

SfrlSfr tpaipueas saauanbas jo jaqunN 

(OI-T) S-28T-6*2-80-Sn!PaJPduo3 Butaq eauanbas fijang 



"iOd-Ifr.'ZfrsOI #6 d^S 22 "Mi «° «q apew saj-es"6«n anj s^nsa^ 

9 be 

fr'S aseajay 

saauanbas jo uostueduo^ asynjiej ^sej - aaisej 



> 0 < 

saneuagni^ui ol 10 

> 0 < 

OS Ofr OE X 02 01 

33NHd33dViHmNHdHHd AWDDi V neWd33SA3 VdODNIiSe I dWHQMHSMH 

Mil 
SA3V8 
X X 

0 = suonni!15qns aAn^Ajasuoo o = sdeg 

1 = saipieusiy f = sa M 3 1 e H X08 = fiinuapi enptsay 
99'£ = d3ue3tjiu6is fr = ajoag paziundQ fr = ajoag leniui 



iND ES99I m 2fr£9 SW £S 3DN3n03S 8S 

"VDD1H J9fr68TZ nflH3 dd 

•(£6&nS0fr-S6£;Z 'lOISOdDIU HOH Id 

i*r'9 simhvs "d'o nndiVH va 

282TI2C6 Wd 

•ru uoad 33N3ne3s da 

cn Nd 

'aaidlSSVlD 13A ION J3V0IHIA 30 

'SI 3DVHdOId313V80DAM SO 

'81 ND 

' (8IdD) NI310dd 81 3N33 30 

GlVOdO N0I1V1ONNV 1SV1 '82 '13d) t66I-a3d-I0 10 

OlVddn 3DN30B3S 1SV1 '82 '13d) fr66I-83d-I0 10 

(Q31V3dO '82 "13d) V66I-83d-I0 10 

ifr22S08 DV 

"W £S Udd JQdVONVlS S1Md8~BIDA 01 



'(8IdD) NI310dd 8T 3N3D S1Wd8~8l9A 

(S-I) fr-28I-6K-80-Sn 'SI 



r i nun_ i cn tc ie 

SQ SEQUENCE 42 AAi 4727 MW; 10119 CN; 



Initial Score = 4 Optinized Score = 4 Significance = 3.56 
Residue Identity - 100X Hatches = 4 Hisnatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 

QAEVS 

llll 

I DYRKC8ASQ I LKEHGHDK VI PLPELVCTNFH I SGLSPQAEV 
10 20 30 40 X 



12. US-08-249-182-4 (1-5) 

THIO.EUBAC THIOREDOX IN (FRAGMENT) . 

ID THIO.EUBAC STANDARD; PRT; 45 AA. 

AC P21610; 

DT 01-MAY-1991 (REL. 18, CREATED) 

DT 01-MAY-1991 (REL. 18, LAST SEQUENCE UPDATE) 

DT 01-NAY-1991 (REL. 18, LAST ANNOTATION UPDATE) 

DE THIOREDOXIN (FRAGMENT). 

OS EU6ACTERIUN ACIDAMINOPHILUH. 

OC PROKARYOTA; FIRHICUTES; IRREGULAR ASPOROGENOUS RODS; CORYNEFORH GROUP. 

RN 111 

RP SEQUENCE. 

RN 91139594 

RA MEYER M., DIETRICHS D., SCHMIDT B. , ANDREESEN J.R.; 

RL J. BACTERIOL. 173 ? 1509-1513 ( 1991 ) . 

CC -!- FUNCTION: THIOREDOXIN PARTICIPATES IN VARIOUS REDOX REACTIONS 

CC THROUGH THE REVERSIBLE OXIDATION OF ITS ACTIVE CENTER DITHIOL, 

CC TO A DISULFIDE, b CATALYZES DITHIOL-DISULFIDE EXCHANGE REACTIONS. 

DR PROSITE; PS00194; THIOREDOXIN. 

KM REDOX-ACTIVE CENTER; ELECTRON TRANSPORT. 

FT DISULFID 32 35 REDOX-ACTIVE (BY SIMILARITY) . 

FT NON.TER 45 45 

SQ SEQUENCE 45 AA; 5002 MW; 11785 CN; 

Initial Score = 4 Optinized Score = 4 Significance = 3.56 
Residue Identity = 80Z Hatches = 4 Hisnatches - 1 

Gaps - 0 Conservative Substitutions = 0 

X X 
QAEVS 
llll 

HALLVE IDKDQFQAEVLEAEGYVLVDYFSDGCVPCKALMPDVX IL 
10 X X 20 30 40 



13. US-08-249-182-4 (1-5) 

PSBK.CHLRE PHOTOSYSTEM II 4 KD REACTION CENTRE PROTEIN PRECUR 

ID PSBK.CHLRE STANDARD; PRT; 46 AA. 

AC PI 8263; 

DT 01-N0V-1990 (REL. 16, CREATED) 

DT 01-N0V-1990 (REL. 16, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE PHOTOSYSTEM II 4 KD REACTION CENTRE PROTEIN PRECURSOR. 

GN PSBK. 

OS CHLAMYDOMONAS REINHARDTII. 

OG CHLOROPLAST. 

OC EUKARYOTA; PLANTA! PHYCOPHYTA; CHLQROPHYTA (GREEN ALGAE); VOLVOCALES J 

OC CHLAHYDOKONADACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 



R« 90370493 

RA SILK G.W., DE LA CRUZ F., UU M.; 

RL NUCLEIC ACIDS RES. 18:4930-4930(1990). 

CC -!- FUNCTION: THIS PROTEIN IS A COMPONENT OF THE REACTION CENTER 

CC OF PHOTOSYSTEN II. 

DR EMBL; X53413; CHCRPSBK. 

DR PIR; SI 1 1 62 ; SI 1 162 . 

KM CHLOROPLAST; PHOTOS YSTEM II; SIGNAL. 

FT SIGNAL 1 9 POTENTIAL. 

FT CHAIN 10 46 4 KD REACTION CENTER PROTEIN. 

SO SEQUENCE 46 AA; 5060 HU; 12441 CN; 



Initial Score = 4 Optimized Score = 4 Significance = 3.56 
Residue Identity = 802 Hatches = 4 Mismatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
QAEVS 
II II 

MTTLALVLAKLPEAYAPFAPIVDVLPVIPVFFILLAFVWQAAVSFR 
10 20 30 40 X 



14. US-08-249-182-4 (1-5) 

PSBK.MARPO PHOTOSYSTEN II 4 KD REACTION CENTRE PROTEIN PRECUR 



ID PSBK.MARPO STANDARD; PRT; 55 AA. 

AC P10348; 

DT 01-MAR-19B9 (REL. 10, CREATED) 

DT 01-MAR-1989 (REL. 10. LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26. LAST ANNOTATION UPDATE) 

DE PHOTOSYSTEH II 4 KD REACTION CENTRE PROTEIN PRECURSOR. 

GN PSBK. 

OS MARCHANTIA POLYHORPHA (LIVERWORT). 

OG CHLOROPLAST. 

OC EUKARYOTA! PLANTA5 EMBRYOPHYTA; BRYOPHYTA; HEPATICOPSIDA. 

RN [1] 

RP SEQUENCE FROH N.A. 

RA OHYAHA K.J 

RL SUBMITTED (OCT-1986) TO EHBL/GENBANK/DDBJ DATA BANKS. 

RN [21 

RP COMPLETE GENOME. 

RA OHYAHA K., FUKUZAUA H., KOHCHI T., SHIRAI H.> SANO T.> SANO S., 

RA UMESONO K., SHIKI Y.> TAKEUCHI M. , CHANG Z., AOTA S.r INOKUCHI H.» 

RA OZEKI H.> 

RL NATURE 322:572-574(1986). 

CC -!- FUNCTION: THIS PROTEIN IS A COMPONENT OF THE REACTION CENTER 

CC OF PHOTOSYSTEH II. 

DR EMBL; X 04465,* CHMPXX. 

DR PIR; A05024; A05024. 

DR PIR; S01585; S01535. 

KM CHLOROPLAST; PHOTOSYSTEH II; SIGNAL. 

FT SIGNAL 1 18 POTENTIAL. 

FT CHAIN 19 55 4 KD REACTION CENTER PROTEIN. 

SQ SEQUENCE 55 AA; 6442 MU; 18114 CN; 



Initial Score = 4 Opt in i zed Score = 4 Significance = 3.56 
Residue Identity = 807. Hatches = 4 Hisnatches = 1 

Gaps = 0 Conservative Substitutions = 0 



X X 
QAEVS 
II II 

MFNIYLENAFYLNGITFAKLPEAYSIFDPIVDVHPIIPLFFFLLAFVWQASVSFR 
10 20 30 40 50 X 



SB Ct Hi iC Hi CI Nt 17 Hi U Or 0 l> 10 Ui £0 C i V Li IB bi 0 Hi 

SO 31 I; 34 l; 37 k; 6 n; 18 F; 22 P; 35 S? 29 T; 12 w; 17 Y; 36 VJ 

Initial Score = 6 Optinized Score = 6 Significance = 4.21 
Residue Identity = 60X Matches = 6 Mi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 
II llll 

RIENIYSRHKIYKLSPETTYCLKVKAALLTSWKIGVYSPVHCIKTTVENELPPPENIEVSVSN8NYVLKHDY 
180 190 200 210 220 230 240 250 

TYANWTF8V8WLHAFLKRNPGNHLYKWK8IPDCENVKT 
260 270 280 



3. US-08-249- 182-7 (1-10) 

R14487 Soluble interferon-alpha/beta receptor. 

ID R14487 standard; Protein; 436 AA. 

AC R14487; 

DT 16-JAN-1992 (first entry) 

DE Soluble interferon-alpha/beta receptor. 

KU IFN; autoinnune disease; graft rejection; histoconpatibility. 

OS Hono sapiens. 

PN FR26578B1-A. 

PD 09-AUG-1991. 

PF 05-FEB-1990; 001298. 

PR 05-FEB-1990; FR-001298. 

PA (EUBI-) LAB EURO BIOTECHNO. 

PI Eid Pr Gresser I, Lutfalla Gr Heyer F. Hogensen KE; 

PI Tovey MG> Uze G; 

DR HPI; 91-319778/44. 

DR N-PSDB; 814239. 

PT Neu uater-soluble polypeptide(s) with affinity for IFN-alpha and 

PT beta - used to treat e.g. lupus erythenatosusi Behcet's disease> 

PT aplastic anaeniai diabetes nellitusi rheunatoid arthritist etc. 

PS Clain 2; Page 45; 52pp; French. 

CC The transnenbrane and cytoplasmic donains of the native IFN receptor 

CC have been deleted to obtain a soluble! circulating forn of the 

CC receptor. Potentially innunogenic epitopes have thus been elininated. 

CC Derivatives obtained by substitution or deletion of this sequence 

CC are also claimed as are hybrid molecules conprising the soluble 

CC receptor (or deriv.) and an innunoglobulin such as IgGl. 

CC See also G14240. 

SO Sequence 436 AA; 

SQ 21 A; 12 Ri 29 N; 19 D; 0 B; 8 C; 16 Q; 28 E; 0 Z; 18 G; 8 H; 

S9 31 I; 34 L; 37 k; 6 h; 18 F; 22 P; 35 S; 29 T; 12 w; 17 Y; 36 V; 

Initial Score = 6 Optinized Score = 6 Significance = 4.21 
Residue Identity = 60X Hatches = 6 Hisnatch.es = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 
II llll 

R I EN I YSRHK I YKLSPETTYCLKVKAALLTSWK I GV YSPVHC I KTTVENELPPPENIEVSVQN8NY VLKHDY 
180 190 200 210 220 230 240 250 

TYANMTF8V8HLHAFLKRNPGNHLYKWK8IPDCENVKT 
260 270 280 



4. US-08-249-182-7 (1-10) 

R29583 Hunan activin receptor. 



useful in cancer 



PF 17-JAN-1992? 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUNAN SERVICE. 

PI Krutzsch Hf Liotta LAr Schiffmann Er Stracke 

DR MPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin 

PT diagnosis and therapy 

PS Example? Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 48. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in imnunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SQ Sequence 10 AA; 

S9 0 A; 0 R; 2 N; 0 D; 0 B; 0 C; 0 Q; l E; 0 Z; 0 G; 0 H; 

sa i i; l L; o K? o M; i f? 2 p; i s; o t; o w; i y; o v; 



Initial Score = 8 
Residue Identity = 80X 
Gaps = 0 



Optimized Score = 8 
Matches = 8 

Conservative Substitutions 



Significance = 
Mismatches = 



5.90 
2 
0 



X X 
VPPFENIELY 

linn ii 

SPPFENINLY 
X 10 



2. US-08-249-182-7 (1-10) 

R28495 Sequence of a soulble form of the interferon (IFN) 

ID R28495 standard; Protein; 436 AA. 

AC R28495; 

DT 31-HAR-1993 (first entry) 

DE Sequence of a soulble form of the interferon (IFN) receptor 

DE with a high affinity for IFN-alpha and -beta. 

KM Interferon receptor? alpha-interferon; beta-interferon. 

OS Synthetic. 

PN UQ9218626-A. 

PD 29-QCT-1992. 

PF 17-APR-1991? F00318. 

PR 17-APR-1991; W0-F00318. 

PA (EUBI-) LAB EURO B10TECHN0L0GIE. 

PI Eid Pf Gresser I, Lutfalla Gr Meyer Fr Mogensen KEr 

PI Tovey M, Uze G; 

DR WPI; 92-382110/46. 

DR N-PSDB; 830532. 

PT Hater soluble polypeptide(s) strongly bind interferon(s) alpha 

PT and beta - useful as immunosuppressants! for treating autoimmune 

PT diseases and transplant rejection 

PS Claim 2; Fig 1; 58pp; English. 

CC DNA encoding the uater-soluble polypeptide with a high affinity for 

CC IFN-alpha and -beta is isolated by PCRi using appropriate 

CC oligonucleotides as primers and cloned cDNA as template. For exanplei 

CC bacteriophage lambda ZAPr containing the entire coding sequence of 

CC the IFN-alpha and -beta receptor (Q30533K uas incubated uith oligos 

CC Q30534 and 930535. R28496 represents the complete receptor. R28495 

CC lacks the transmembrane and cytoplasmic domains. Both forms bind 

CC IFN in the same uay as antibodies so are immunosuppressants e.g. for 

CC treating autoimmune diseases and graft rejection. They lack the 

CC toxic side-effects of known immunosuppressants such as steroids. 

SO Sequence 436 AA; 



A 1007. identical sequence to the query sequence uas not found. 



The list of best scores is: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



5 standard deviations above nean t*#* 



4 

1 • 


R3/44V 


Autotaxin peptide ATX 48. 


10 


8 


8 


5.90 


0 






4 standard deviations above wean 










2. 


R28495 


Sequence of a soulble forn of 


436 


6 




4.21 


0 


3. 


R14487 


Soluble interferon-alpha/beta 


436 


6 




4.21 


0 


ji 

4, 


R29583 


Hunan activin receptor. 


513 


6 




4.21 


0 


5. 


R29581 


Mouse activin receptor. 


513 


6 




4.21 


0 


6. 


R42635 


Hunan interferon receptor. 


557 


6 




4.21 


0 


7. 


AAA i n i 

R28496 


Sequence of a soulble form of 


557 


6 




4.21 


0 


n 

8. 


ni ji it n n 

R14488 


Complete interferon-alpha/bet 


557 


6 




4.21 


0 


n 

V, 


m * ncn 

Rl 1958 


Hunan alpha-interferon recept 


557 


6 




4.21 


0 






3 standard deviations above nean 


#§** 








10. 


R 10607 


Peptide uith notilin-like act 


4 1 

13 


5 


5 


3.37 


0 


11. 


R 10606 


Peptide with notilin-like act 


14 


5 


5 


3.37 


0 


12. 


R10605 


Peptide uith noti I in-like act 


15 


5 


5 


3.37 


0 


13. 


R10601 


Peptide with notilm-like act 


15 


5 


5 


3.37 


0 


14. 


ft 4 n J A j 

R10604 


Peptide uith noti I in-like act 


4 T 

17 


5 


5 


3.37 


0 


15. 


R10603 


Peptide uith notilin-like act 


18 


5 


5 


3.37 


0 


16. 


ft* ai An 

R10602 


Peptide uith notilin-like act 


18 


5 


5 


3.37 


0 


4 "7 

17. 


I'M AAA A 

R 10600 


Peptide uith notilin-like act 


18 


5 


5 


3.37 


0 


18. 


n ft 4 A "T ^ 

P81032 


Sequence (Fornula 1) of a 13- 


22 


5 


5 


3.37 


0 


4 a 

IV. 


R03267 


Motilin-like polypeptide. 


22 


5 


5 


3.37 


0 


20. 


P82050 


13-Leu notilin 


22 


5 


5 


3.37 


0 


21 . 


R04201 


N-terninal of deacetoxycephal 


22 


5 


5 


3.37 


0 


22. 


noon a*7 

P92007 


Polypeptide uith notilin-like 


22 


5 


5 


3.37 


0 


23. 


d ji i in 

R41<i37 


Motilin-like polypeptide S3. 


23 


5 


5 


3.37 


0 


24. 


R41235 


Motilin-like polypeptide #1. 


23 


5 


5 


3.37 


0 


25. 


R41236 


Motilin-like polypeptide #2. 


23 


5 


5 


3.37 


0 


26. 


R41194 


Moti I in-like polypeptide gene 


23 


5 


5 


3.37 


0 


27. 


R05893 


Motilin-like peptide. 


23 


5 


5 


3.37 


0 


28. 


P92005 


Polypeptide uith notilin-like 


23 


5 


5 


3.37 


0 


29. 


P92006 


Polypeptide uith notilin-like 


23 


5 


5 


3.37 


0 


30. 


R05892 


Motilin-like peptide. 


25 


5 


5 


3.37 


0 


31. 


P83215 


Sequence of a 13-Leu analogue 


26 


5 


5 


3.37 


0 


32. 


R07040 


Motilin-like peptide. 


28 


5 


5 


3.37 


0 


33. 


R07039 


Motilin-like peptide. 


29 


5 


5 


3.37 


0 


34. 


R129U 


Fragile X syndrone related pe 


36 


5 


5 


3.37 


0 


35. 


R05364 


CLeul3] notilin uith leader p 


48 


5 


5 


3.37 


0 


36. 


P83216 


Sequence of diner of a 13-Leu 


52 


5 


5 


3.37 


0 


37. 


R29163 


PRP2. 


114 


5 


5 


3.37 


0 


38. 


R10981 


Motilin/uhite salnon grouth h 


114 


5 


5 


3.37 


0 


39. 


P80459 


Sequence corresp. to the nucl 


115 


5 


5 


3.37 


0 


40. 


P91380 


Porcine prepronotilin. 


119 


5 


5 


3.37 


0 



1. US-08-249-182-7 (1-10) 

R37449 Autotaxin peptide ATX 48. 

ID R37449 standard? peptide; 10 AA. 

AC R37449; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 48. 

KU Cell notility stimulating; cancer netastasis? antibody; detection; 

KM innunostains; disease outcone prediction? therapy choice; 

KU cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 



9 iuuu- 



500- 

* 



100- 



50- 



10- 



5- 



II I I I I I I I I II I 

SCORE 0| 1 | 2 | 3 | 4 4 |5 6 
STDEV 0 1 2 3 4 5 



PARAMETERS 



Similarity matrix 


Unitary 


K- tuple 




2 


Mi snatch penalty 


1 


Joining penalty 




20 


Gap penalty 


1.00 


Window size 




5 


Gap size penalty 


0.05 








Cutoff score 


0 








Randomization group 


0 








Initial scores to save 40 


Alignments to save 


15 




Optimized scores to 


save 0 


Display context 


50 





SEARCH STATISTICS 



Scores: Mean Hedian 

1 3 

Times? CPU 

00:00:27.94 

Number of residues: 5287517 
Number of sequences searched: 42145 
Number of scores above cutoff: 4126 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 

The scores belou are sorted by initial score 
Significance is calculated based on initial score. 



Standard Deviation 
1.1? 

Total Elapsed 
00:00:30.00 



OS 
OC 
OC 
RN 
RP 
RM 
RA 
RL 
CC 
CC 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
SQ 



iysiniin> ujLtiainun uniuL rKuitiHHBt inoibiiukj. 

bos taurus (bovine) . 

eukaryota? metazoa; chordata; vertebrataj tetrapoda; mammalia; 
eutheria; artiodactyla. 

sequence. 

85231205 

HIRADO II. r TSUNASAWA S., SAKIYAMA F., NIINOBE M., FUJII s.; 
FEBS LETT. 186:41-45(1985). 

-!- FUNCTION! THIS IS A THIOL PROTEINASE INHIBITOR. 

-!- SIMILARITY: THIS IS A TYPE 2 CYSTATIN. 

PIR; A01271; UDBO. 

PROSITE; PS00287J CYSTATIN. 

THIOL PROTEASE INHIBITOR. 

ACT.SITE 4 4 REACTIVE SITE. 

SITE 48 52 SECONDARY AREA OF CONTACT. 

DISULFID 66 76 BY SIMILARITY. 

DISULFID 90 110 BY SIMILARITY. 

SEQUENCE 112 AAi 12789 MM; 68828 CN; 



Initial Score = 5 
Residue Identity = 66% 
Gaps = 0 



Optinized Score = 6 Significance = 3.52 
Matches = 6 Hi snatches = 3 

Conservative Substitutions = 0 



X X 
YDVPHNETI 
I III II 

ARKGVVSGHNYFLDVELGRTTCTKSQANLDSCPFHNQPHLKREKLCSF9VYVVPHMNTINLVKFSCQD 
50 60 70 80 90 X 100 X 110 

> 0 < 

0| |0 Intel liGenetics 

> 0 < _ 

Se* 7 

FastDB - Fast Pairuise Coc&arison of Sequences 
Release 5.4 



Results file u249_7a.res Bade by on Thu 22 Sep 94 10:56:41-PDT. 



Query sequence being conparedJUS-08-249-182-7 (1-10) 
Number of sequences searched: 42145 
Nunber of scores above cutoff: 4126 

Results of the initial conparison of US-08-249-182-7 (1-10) with: 
Data bank : A-GeneSeq 15. all entries 

100000- 

N 

U50000- 

M 

B 

E * 

R - * 

0 

F 10000- 
S 

E 5000- 

Q - « 

U 

E 



Residue Identity = 55% Hatches = 5 Mi snatches - 4 

Gaps = 0 Conservative Substitutions = 0 



X X 
YDVPWNETI 
III II 

AYKTvIKTPSGEFTLDVPEGTTILDAAEEAGYDLPFSCRAGACSSCLGKVVSGSVDQSEGSFLDDGQHEEGF 
10 X 20 X 30 40 SO 60 70 



14. US-08-249-182-6 (1-9) 

FER1_EQUAR FERREDOXIN I. 

ID FER1.E8UAR STANDARD; PRT; 95 AA. 

AC P00235I 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-1986 (REL. 01, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE FERREDOXIN I. 

OS EQUISETUH ARVENSE (FIELD HORSETAIL) (COMMON HORSETAIL). 

oc eukaryota; planta; embryophyta; equisetophyta; sphenopsida; 

OC E9UISETALES; E9UISETACEAE . 

RN [1] 

RP SEQUENCE. 

R« 77249492 

RA HASE T., UADA K., MATSUBARA H.J 

RL J. BIOCHEM. 82?277-286(1977). 

CC -!- FUNCTION: FERREDOXIN ARE IRON-SULFUR PROTEINS THAT TRANSFER 

CC ELECTRONS IN A WIDE VARIETY OF METABOLIC REACTIONS. 

CC -!- SUBCELLULAR LOCATION: CHLOROPLAST. 

CC -!- THIS FERREDOXIN BINDS A SINGLE 2FE-2S CLUSTER. 

DR PIR; A04609; FEEG1F. 

DR PROSITE; PS001975 2FE2S_FERRED0XIN. 

KW ELECTRON TRANSPORT; IRON-SULFUR; CHLOROPLAST. 



FT 


METAL 


38 


38 


IRON-SULFUR 


(2FE-2S) 


(BY SIMILARITY). 


FT 


METAL 


43 


43 


IRON-SULFUR 


(2FE-2S) 


(BY SIMILARITY). 


FT 


METAL 


46 


46 


IRON-SULFUR 


(2FE-2S) 


(BY SIMILARITY). 


FT 


METAL 


76 


76 


IRON-SULFUR 


(2FE-2S) 


(BY SIMILARITY). 


SS 


SEQUENCE 


95 AA; 


10098 mu; 


47218 CN! 







Initial Score = 5 Optinized Score = 5 Significance - 3.52 
Residue Identity = 55X Matches = 5 Misnatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 
III II 

AYKTVLKTPSGEFTLDVPEGTTILDAAEEAGYDLPFSCRAGACSSCLGKVVSGSVDESEGSFLDDGGMEEGF 
10 X 20 X 30 40 50 60 70 

V 



15. US-08-249-182-6 (1-9) 

CYTCJOVIN CYSTATIN, COLOSTRUM (THIOL PROTEINASE INHIBITOR). 

ID CYTCJOVIN STANDARD; PRT; 112 AA. 

AC P01035; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-1986 (REL. 01, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 



12. US-08-249-182-6 U-9) 
VKIL.BPP22 KIL PROTEIN. 



ID VKIL_BPP22 STANDARD; PRTf 62 AA. 

AC P14111! 

DT 01-JAN-1990 (REL. 13, CREATED) 

DT 01-N0V-1990 (REL. 16, LAST SEQUENCE UPDATE) 

DT 01-N0V-1990 (REL. 16, LAST ANNOTATION UPDATE) 

DE KIL PROTEIN. 

GN KIL. 

OS BACTERIOPHAGE P22. 

OC VIRIDAE; DS-DNA NONENVELOPED VIRUSES, SIPHOVIR IDAE . 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 89293845 

RA SEMERJIAN A.V., MALLOY D.C., POTEETE A.R.f 

RL J. NOL. BIOL. 207:1-13(1989). 

CC -!- FUNCTION: P22 KIL IS ESSENTIAL FOR LYTIC GROWTH IN THE ABSENCE OF 

CC ABC. EXPRESSION OF P22 KIL CAUSES FOMENTATION AND CELL DEATH. 

DR EMBL; X 15637; P0P22PL. 

DR PIRf S04248; VDBP22. 

SO SEQUENCE 62 AA; 6950 HU; 15686 CN; 

Initial Score = 5 Optimized Score = 5 Significance = 3.52 
Residue Identity = 55X Hatches = 5 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
II I II 

HT I VPVNGT I LV9QGNREFNKL YEASFPDTKEGNSAA YAH ASS I ANGWEDCQDEDWNRNH 
X 10 20 30 40 50 60 



13. US-08-249-182-6 (1-9) 

FER1 EQUTE FERREDOXIN I. 



ID 
AC 
DT 
DT 
DT 
DE 
OS 
OC 
OC 
RN 
RP 
RM 
RA 
RL 
CC 
CC 
CC 
CC 
DR 
DR 
KU 
FT 
FT 
FT 
FT 
SO 



01, CREATED) 

01, LAST SEQUENCE UPDATE) 
28, LAST ANNOTATION UPDATE) 



FER1.EQUTE STANDARD; PRT; 95 AA. 
P00234; 

21-JUL-1986 (REL. 

21 -JUL- 1986 (REL. 

01-FEB-1994 (REL. 

ferredoxin i. 

equisetun telmateia (giant horsetail). 

eukaryota; planta; embryophyta; equisetophyta; sphenopsida; 

equisetales; equisetaceae. 

m 

SEQUENCE. 
77249491 

HASE T., UADA K., MATSUBARA H.; 
J. BIOCHEH. 82:267-276(1977). 

-!- FUNCTION: FERREDOXIN ARE IRON-SULFUR PROTEINS THAT TRANSFER 

ELECTRONS IN A WIDE VARIETY OF METABOLIC REACTIONS. 
-!- SUBCELLULAR LOCATION: CHLOROPLAST. 
-!- THIS FERREDOXIN BINDS A SINGLE 2FE-2S CLUSTER. 
PIR; A00240; FEEQI. 
PROSITE? PS00197; 2FE2S FERREDOXIN. 
ELECTRON TRANSPORT; IRON-SULFUR i CHLOROPLAST. 



METAL 
METAL 
METAL 
METAL 
SEQUENCE 



38 
43 
46 
76 
95 AA; 



38 
43 
46 
76 

10097 HU; 



IRON-SULFUR 
IRON-SULFUR 
IRON-SULFUR 
IRON-SULFUR 
47161 CN; 



(2FE-2S) 
(2FE-2S) 
(2FE-2S) 
(2FE-2S) 



(BY SIMILARITY). 
(BY SIMILARITY). 
(BY SIMILARITY). 
(BY SIMILARITY). 



ui ui-rtB-im intt. *;a, tKtAituj 

DT 01-FEB-1994 <REL. 28. LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 375.7 KD PROTEIN ZK1I2.7 IN CHROMOSOME III. 

GN ZK112.7. 

OS CAENORHABDITIS ELEGANS. 

oc eukaryota; metazoa; acoelemates; nematoda; secernentea; rhabditida. 

RN [1] 

RP sequence from n.a. 

RC STRAIN=BR1ST0L N2; 
RA DU Z.; 

RL SUBMITTED (HAY-1993) TO EHBL/GENBANK/DDBJ DATA BANKS. 
CC -!- SIMILARITY: TO D. MEL ANOG ASTER FAT TUMOR SUPPRESSOR. 
DR EMBL; L14324; CEZK112. 
DR WORMPEP! ZK112.7; CE0O378. 
KW HYPOTHETICAL PROTEIN. 

SQ SEQUENCE 3343 AA; 375745 MM; 22105723 CN; 

Initial Score = 6 Optinized Score = 6 Significance = 4.69 
Residue Identity = 66X Matches = 6 Nisnatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
II Mil 

VTCIQKNSTC9PTLVGDSASRLVSRSSSVIFDLPLKKLTARCFCSSGIDCYDDTTNETI8KT9KINV1TTCD 
2930 2940 2950 2960 2970 2980 2990 

D I DCGPRGKCFHEESSQP1 CRCG9GFESMYSCER ADD 
3000 3010 3020 



11. US-08-249-182-6 (1-9) 

TES1.RAT TESTIN 1 (CMB-22) (FRAGMENT). 

ID TES1.RAT STANDARD; PRT; 30 AA. 

AC P152425 

DT 01-APR-1990 (REL. 14, CREATED) 

DT 01-APR-1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-APR-1990 (REL. 14, LAST ANNOTATION UPDATE) 

DE TESTIN 1 (CMB-22) (FRAGMENT). 

OS RATTUS NORVEGICUS (RAT). 

OC EUKARYOTA; HETAZOA; CHORD AT A; VERTEBRATA; TETRAPODA; MAMMALIA; 

OC EUTHERIA; RODENTIA. 

RN [1] 

RP SEQUENCE. 

RC STRAIN=SPRAGUE-DAHLEY; TISSUE=SERTOLI CELLS; 

RM 90078247 

RA CHENG C.Y., GRIMA J., STAHLER M.S. , LOCKSHIN R.A.; 

RL J. BIOL. CHEM. 264:21386-21393(1989). 

CC -!- FUNCTION: NOT KNOWN. 

CC -!- SUBCELLULAR LOCATION: SERTOLI CELLS SECRETORY PROTEIN. 

DR PIRf A34199f A34199. 

KW TESTIS. 

FT NON.TER 30 30 

SQ SEQUENCE 30 AA; 3731 MW; 4350 CN; 

Initial Score = 5 Optinized Score = 5 Significance = 3.52 
Residue Identity = 55% Matches = 5 Misnatches - 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
II III 

TPDPSLDVEWNEWRTKHGKTYNMNEERLKR 
X 10 X 20 30 



600 610 620 630 640 6S0 X 660 



HMLTGDFVGTAKAIAQEVGILPTNLYHYS8EIVDSHV 
670 680 690 700 



9. US-08-249- 182-6 (1-9) 

CYGS.STRPU SPERACT RECEPTOR PRECURSOR (GUANYLATE CYCLASE) (EC 

ID CYGS.STRPU STANDARD; PRT; 1125 AA. 

AC P 16065 ; 

DT 01-APR-1990 (REL. 14, CREATED) 

DT 01-APR-1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE SPERACT RECEPTOR PRECURSOR (GUANYLATE CYCLASE) (EC 4.6.1.2). 

OS STRONGYLOCENTROTUS PURPURATUS (PURPLE SEA URCHIN) . 

oc eukaryota; metazoa; echinodermata; echinozoa; echindidea; 

OC EUECHINOIDEA. 

RN HI 

RP SEQUENCE FROM N.A. 

RM 89197965 

RA THORPE D.S., GARBERS D.L.; 

RL J. BIOL. CHEH. 264:6545-6549(1989). 

CC -!- FUNCTION: IMPLICATED AS A CELL-SURFACE RECEPTOR ON SPERMATOZOA 

CC FOR 'SPERACT' A CHEMOTACTIC PEPTIDE, AND ON VARIOUS OTHER CELLS 

CC AS A RECEPTOR FOR ATRIAL NATRIURETIC PEPTIDE. 

CC -!- CATALYTIC ACTIVITY: GTP » 3,'5'-CYCLIC GMP + PYROPHOSPHATE. 

CC -!- SUBCELLULAR LOCATION: TYPE I MEMBRANE PROTEIN. 

CC -!- SIMILARITY: TO OTHER GUANYLATE CYCLASES. 

CC -!- SIMILARITY: SOME SIMILARITY WITH CONSERVED REGION OF CATALYTIC 

CC DOMAIN OF PROTEIN KINASES. 

DR EMBL; M22444; SPGUA. 

DR PIR; A33535J OYURCP. 

DR PIR,' A30856; A30856. 

DR PROSITE; PS00452; GUANYLATE CYCLASES. 

KM RECEPTOR; TRANSMEMBRANE; GLYCOPROTEIN; PHOSPHORYLATION; LYASE; 

KU CGMP SYNTHESIS; SIGNAL. 



SIGNAL 


1 


21 


POTENTIAL. 


CHAIN 


22 


1125 


GUANYLATE CYCLASE. 


DOMAIN 


22 


510 


EXTRACELLULAR (POTENTIAL). 


TRANSMEM 


511 


531 


POTENTIAL. 


DOMAIN 


532 


1125 


CYTOPLASMIC (POTENTIAL). 


CARBOHYD 


185 


185 


POTENTIAL. 


CARBOHYD 


409 


409 


POTENTIAL. 


SEQUENCE 


1125 


AA; 126256 MW? 6471318 CN; 



Initial Score = 6 Optimized Score = 6 Significance = 4.69 
Residue Identity = 66% Matches = 6 Hi snatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
III III 

I AVLPETFEM VS I FFSD I VGFTALSAASTP I QVVNLLNDLYTLFDA I I SNYDVYKVET IGDAYMLVSGLPLR 
910 920 930 940 950 X 960 970 

NGDRHAG8I AST AHHLLESVKGF I VPHKPEVFLKLR I 
980 990 1000 1010 



10. US-08-249- 182-6 (1-9) 

Y0G7.CAEEL HYPOTHETICAL 375.7 KD PROTEIN ZK112.7 IN CHROMOSOM 

ID Y0G7_CAEEL STANDARD, PRT; 3343 AA. 
AC P34616; 



nesioue xoentiiy - oo/. naicnes - o rusnaicnes - o 

Gaps = 0 Conservative Substitutions - 0 



X X 
YDVPHNETI 
II I III 

YSLSNEGLRVLGFASKSFTKDQVNDDGLKNITSNRATAESDLVFLGLICIYDPPRNETAGAVKKFHQAGINV 
600 610 620 630 640 650 X 660 

HMLTGDFVGT AK A I ASEVG I LPTNLYH YSQE I VDSNV 
670 680 690 700 



8. US-08-249- 182-6 (1-9) 

ATN1_YEAST SODIUM TRANSPORT ATPASE 1 (EC 3.6.1.-). 

ID ATN1JEAST STANDARD; PRT; 1091 AA. 

AC PI 3587; 

DT 01-JAN-1990 (REL. 13. CREATED) 

DT 01-JAN-1990 (REL. 13. LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE SODIUM TRANSPORT ATPASE 1 (EC 3.6.1.-). 

GN ENA1 OR PHR2. 

OS SACCHAROHYCES CEREVISIAE (BAKER'S YEAST). 

OC EUKARYOTA; FUNGI; ASCOMYCOTINA; HENIASCOHYCETES. 

RN El] 

RP SEQUENCE FROM N.A. 

RM 89324047 

RA RUDOLPH H.K., ANTEBI A., FINK G.R.. BUCKLEY CM., DORMAN T.E.. 

RA LEVITRE J., DAVIDOW L.S., MAO J. -I., MOIR D.T.; 

RL CELL 58:133-145(1989). 

RN 12] 

RP SEQUENCE OF 534-1091 FROM N.A. 

RC STRAIN-7305B; 

RM 91260670 

RA MARTINEZ R., LATREILLE M.-T., HIRANDE N.; 

RL MOL. GEN. GENET. 227:149-154(1991). 

RN [33 

RP PRELIMINARY SEQUENCE OF 534-1091 FROM N.A. 

RC STRAIN=X2180; 

RM 89054027 

RA MIRANDE H. , WALLER J. -P.; 

RL J. BIOL. CHEM. 263:18443-18451(1988). 

CC -!- FUNCTION: THIS MAGNESIUM-DEPENDENT, ENZYME PROBABLY CATALYZES THE 

CC HYDROLYSIS OF ATP COUPLED WITH THE TRANSPORT OF THE CALCIUM. 

CC -!- CATALYTIC ACTIVITY: ATP + H(2)0 = ADP ♦ ORTHOPHOSPHATE. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. 

CC -!- SIMILARITY: BELONGS TO THE CATION TRANSPORT ATPASES FAMILY 

CC (E1-E2 ATPASES) . 

DR EMBL; M25489; SCPMR2. 

DR EMBL; X 58626; SCPMR2G. 

DR EMBL; J04186; SCKRS1A. 

DR PIR; S05788; PUBYR2. 

DR PROSITEJ PS00154S ATPASE_E1_E2. 

KU HYDROLASE; SODIUM TRANSPORT; TRANSMEMBRANE; PHOSPHORYLATION; 

KH MAGNESIUM; ATP-BINDING; MULTIGENE FAMILY. 

FT MOD.RES 369 369 PHOSPHORYLATION (BY SIMILARITY). 

SQ SEQUENCE 1091 AA, 120357 MM; 6231113 CN; 

Initial Score = 6 Optimized Score = 6 Significance = 4.69 
Residue Identity = 667. Matches = 6 Mi snatches = 3 

Gaps = 0 Conservative Substitutions = 0 



X X 
YDVPHNETI 
II I III 



DT 01-DEC-1992 (REL. 24, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE AL1 PROTEIN (ORF CI). 

OS PANICUM STREAK VIRUS. 

OC VIR1DAE; SS-DNA NONENVELOPED VIRUSES; GEHINIVIRIDAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RN 92268861 

RA BRIDDQN R.U., LUNNESS P., CHAMBERLAIN L.C., BRUNDISH H. , 

RA PINNER M.S., MARKHAM P. 6. ; 

RL J. GEN. VIROL. 73? 1041-1047 (1992) . 

CC -!- SIMILARITY: TO AL1 PROTEIN IN OTHER GEMINIVIRUSES. 

DR EMBL; X60168; PSGIITDNA. 

DR PIR; JQ1552; JQ15S2. 

SG SEQUENCE 323 AA; 37026 MM; 5660 18 CN; 



Initial Score = 6 Optica zed Score - 6 Significance = 4.69 
Residue Identity = 66X Matches = 6 Misnatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 

II llll 

H ATSREEYLSL VQSSLPYDUATKLN YFEYSASRLFPD I AEPYTNPHPTTEYDLHCNET I EDULKPNIY8VSP 
160 170 180 190 200 X 210 220 



QAYKLLEPSCLSLEQAIADLEULDDTTRHLGEKEREA 
230 240 250 260 



7. US-08-249-182-6 (1-9) 

ATN2JEAST SODIUM TRANSPORT ATPASE 2 (EC 3.6.1.-). 

ID ATN2. YEAST STANDARD; PRT; 1091 AA. 

AC GO 1896,' 

DT 01-0CT-1993 (REL. 27, CREATED) 

DT 01-0CT-1993 (REL. 27, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE SODIUM TRANSPORT ATPASE 2 (EC 3.6.1.-). 

GN ENA2. 

OS SACCHAROHYCES CEREVISIAE (BAKER'S YEAST). 

OC EUKARYOTA; FUNGI; ASCOMYCOTINA; HEHIASCOMYCETES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=DBY746; 

RM 93173113 

RA GARCIADEBLAS B., RUBIO F., GUINTERO F.J., BANUELOS M.A., HARO R.» 

RA RODRIGUEZ -NAVARRO A.; 

RL MOL. GEN. GENET. 236:363-368(1993). 

CC -!- FUNCTION: THIS MAGNESIUM-DEPENDENT, ENZYME PROBABLY CATALYZES THE 
CC HYDROLYSIS OF ATP COUPLED WITH THE TRANSPORT OF SODIUM AND 

CC LITHIUM. 

CC -!- CATALYTIC ACTIVITY: ATP + H(2)0 = ADP + ORTHOPHOSPHATE. 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. 

CC -!- SIMILARITY: BELONGS TO THE CATION TRANSPORT ATPASES FAMILY 

CC (E1-E2 ATPASES) . 

DR EMBL; X67136; SCENA2. 

DR PIR? S25007J S25007. 

DR PROSITE; PS00154; ATPASE_E1_E2. 

KW HYDROLASE; SODIUM TRANSPORT; TRANSMEMBRANE; PHOSPHORYLATION; 

KU MAGNESIUM; ATP-BIND1NG; MULT I GENE FAMILY. 

FT MOD.RES 369 369 PHOSPHORYLATION (BY SIMILARITY). 

SQ SEQUENCE 1091 AA; 120317 MU; 6218101 CN; 



Initial Score = 6 Optimized Score - 6 Significance = 4.69 



Initial Score = 6 Optica zed Score = 6 Significance = 4.49 
Residue Identity = 66X Hatches = 6 Misnatches = 3 

Gaps - 0 Conservative Substitutions = 0 

X X 
YDVPMNETI 
III I II 

EADPKAYAQHVFRSFDANSDGTLDFKEYVIALHNTSAGKTN8KLEUAFSLYDVDGNGTISKNEVLEIVTAIF 
60 70 80 90 100 110 X 120 

KNISPEDTKHLPEDENTPEKRAEKIUGFFGKKDDDKL 
130 140 150 160 



5. US-08-249-182-6 (1-9) 

FLGL._EC.OLI FLAGELLAR HOOK-ASSOCIATED PROTEIN 3 (HAP3) (HOOK-F 



ID FLGL.ECOLI STANDARD; PRT; 317 AA. 

AC P29744,* 

DT 01-APR-1993 (REL. 25, CREATED) 

DT 01-0CT-1993 (REL. 27, LAST SEQUENCE UPDATE) 

DT 01 -OCT- 1993 (REL. 27, LAST ANNOTATION UPDATE) 

OE FLAGELLAR HOOK-ASSOCIATED PROTEIN 3 (HAP3) (HOOK-FILAMENT JUNCTION 

DE PROTEIN) . 

GN FLGL OR FLAT OR FLAU. 

OS ESCHERICHIA COLI. 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIAf FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTERI ACEAE . 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12 / CS520; 

RA FAHRNER K.A., BLOCK S.H., KRISHNASUAHY S., PARKINSON J.S., BERG H.C.! 

RL SUBMITTED (OCT-1993) TO EMBL/GENBANK/DDBJ DATA BANKS. 

RN [21 

RP SEQUENCE OF 292-317 FROM N.A. 

RC STRAIN=K12; 

RN 93078265 

RA CASAREGOLA S., JACQ A.. LAOUDJ D., MCGURK G.. MARGARSON S.. 

RA TEMPETE H. , NORRIS V., HOLLAND I.B.! 

RL J. MOL. BIOL. 228 .'30-40(1 992 ) . 

DR EMBL; U 02514; U02514. 

DR EMBL; X67470; ECGAMS. 

DR ECOGENEI EG11545; FLGL. 

KU FLAGELLA. 

SQ SEQUENCE 317 AA; 34281 HW,' 489131 CN; 



Initial Score = 6 Optimized Score = 6 Significance - 4.69 
Residue Identity = 66X Matches = 6 Misnatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPMNETI 
II II II 

LDKTNRGLKNSLNNVLTVRAELGTQLNELESLDSLGSDRALGQTSQMSDLVDVDUNATISSYIMQSTALQAS 
230 240 250 260 270 280 290 300 



YKAFTDM8GLSLFSLSK 
310 



6. US-08-249-182-6 (1-9) 

VALl.PASV AL1 PROTEIN (ORF CI). 

ID VAL1_PASV STANDARD; PRT; 323 AA. 
AC Q00338! 



RA DIZHOOR A.M., RAY S., KUHAR S.. N I EM I G., SPENCER H.r BROLLEY D., 

RA WALSH K.A., PHILIPOV P.P., HURLEY J.B., STRYER L.J 

RL SCIENCE 251:915-918(1991). 

RN E2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=RETINA? 

RM 92070564 

RA KUTUZOV H.A., SHMUKLER B.E., SUSLOV O.N., DERGACHEV A.E.. 

RA ZARGAROV A. A., ABDULAEV N.G.; 

RL FEBS LETT. 293521-24(1991). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=RETINA; 

RM 92335166 

RA RAY S., ZOZULYA S., NIEMI G.A., FLAHERTY K.H., BROLLEY D.» 

RA DIZHOOR A.M., MCKAY D.B., HURLEY J., STRYER L.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 8955705-5709(1992). 

RN [4] 

RP SEQUENCE OF 17-35. 

RC TISSUE=RETINA; 

RN 91184124 

RA LAHBRECHT H.G., KOCH K.U.i 

RL EMBO J. 10:793-798(1991). 

RN [53 

RP MYRISTOYLATION. 

RM 92355549 

RA DIZHOOR A.M. > ERICSSON L.H., JOHNSON R.S., KUMAR S., OLSHEVSKAYA E., 

RA ZOZULYA S., NEUBERT T.A., STRYER L. » HURLEY J.B., WALSH K.A., 

RL J. BIOL. CHEM. 267:16033-16036(1992). 

RN [61 

RP RETRACTATION ON FUNCTION. 

RM 93248555 

RA HURLEY J.B., DIZHOOR A.M., STRYER L.; 

RL SCIENCE 260:740-740(1993). 

RN [7] 

RP X-RAY CRYSTALLOGRAPHY. 

RM 94061988 

RA FLAHERTY K.M., ZOZULYA S., STRYER L., MCKAY D.B.S 

RL CELL 755709-716(1993). 

CC -!- FUNCTION: SEEMS TO BE IMPLICATED IN THE PATHWAY FROM RETINAL ROD 
CC GUANYLATE CYCLASE TO RHODOPSIN. MAY BE INVOLVED IN THE BLOCKING OF 

CC THE PHOSPHORYLATION OF RHODOPSIN. 

CC -!- BINDS TWO CALCIUM IONS: ONE WITH HIGH AFFINITY, THE OTHER WITH 
CC LOW AFFINITY. 

CC -!- PTM: THE N-TERHINAL GLYCINE IS LINKED TO ONE OF FOUR DIFFERENT 
CC TYPES OF ACYL GROUPS. THE MOST ABUNDANT IS MYRISTOLEATE (14:1), 

CC BUT 14:0, 14:2, AND 12:0 ACYL RESIDUES ARE ALSO PRESENT. 

CC -!- SIMILARITY: TO OTHER EF-HAND CALCIUM BINDING PROTEINS, AND 
CC MORE SPECIFICALLY TO THE RECOVER! N SUBFAMILY. 

DR EMBL; X 63322; BTP26CBP. 

DR EMBL; S39582J S39582. 



DR 


PIR; A38433J A38433. 




DR 


PIR; S193055 S19305. 




DR 


PIR; A46129,' A46129. 




DR 


PROSITE! PS00018! 


EF HAND . 




KU 


CALCIUM-BINDING; 


HYRISTYLATION; VISION. 


FT 


INIT MET 0 


0 




FT 


LIPID 1 


1 


MYRISTATE. 


FT 


DOMAIN 36 


47 


ANCESTRAL CALCIUM SITE 1. 


FT 


CAJIND 73 


84 


LOW AFFINITY. 


FT 


CA.BIND 109 


120 


HIGH AFFINITY. 


FT 


DOMAIN 159 


170 


ANCESTRAL CALCIUM SITE 4. 


FT 


CONFLICT 18 


18 


L -> 0 (IN REF. 4). 


FT 


CONFLICT 20 


20 


T -> N (IN REF. 4). 


SQ 


SEQUENCE 201 AA; 23202 


MM; 208090 cn; 



3. US-OA-249- 182-6 (1-9) 

RECO_M0USE RECOVER IN (CANCER ASSOCIATED RETINOPATHY PROTEIN) 

ID RECOJOUSE STANDARD; PRT; 201 AA. 

AC P34057; 

DT 01-FEB-1994 (REL. 28, CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SE6UENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE RECOVER IN (CANCER ASSOCIATED RETINOPATHY PROTEIN) (CAR PROTEIN) 

DE (23 KD PHOTORECEPTOR CELL-SPECIFIC PROTEIN). 

OS MUS HUSCULUS (HOUSE) . 

oc eukaryota; hetazoa; chord at a; vertebrata; tetrapoda; mammalia; 

oc eutheria; rodentia. 

rn en 

rp sequence from n.a. 

rc tissue=retina; 

RM 92339508 

RA MCGINNIS J.F., STEPANIK P.L., BAEHR U., SUBBARAYA 1., LERIOUS V.; 

RL FEBS LETT. 302:172-176(1992). 

CC -!- FUNCTION: SEEMS TO BE IMPLICATED IN THE PATHWAY FROM RETINAL ROD 
CC GUANYLATE CYCLASE TO RHODOPSIN. MAY BE INVOLVED IN THE BLOCKING OF 

CC THE PHOSPHORYLATION OF RHODOPSIN. 

CC -!- BINDS TWO CALCIUM IONS; ONE WITH HIGH AFFINITY, THE OTHER WITH 
CC LOW AFFINITY. 

CC -!- SIMILARITY: TO OTHER EF-HAND CALCIUM BINDING PROTEINS, AND 
CC MORE SPECIFICALLY TO THE RECOVERIN SUBFAMILY. 

DR EMBLi X66196; MM23KDA. 

kw calcium-binding; myristylation; vision. 



FT 


INIT.MET 


0 


0 


BY SIMILARITY. 


FT 


LIPID 


1 


1 


MYRISTATE (BY SIMILARITY). 


FT 


DOMAIN 


36 


47 


ANCESTRAL CALCIUM SITE 1 (BY SIMILARITY). 


FT 


CAJIND 


73 


84 


LOW AFFINITY (BY SIMILARITY). 


FT 


CA.BIND 


109 


120 


HIGH AFFINITY (BY SIMILARITY). 


FT 


DOMAIN 


159 


170 


ANCESTRAL CALCIUM SITE 4 (BY SIMILARITY). 


SO 


SEQUENCE 


201 AA! 


23275 MW! 


: 204261 CN; 



Initial Score = 6 Dptinized Score = 6 Significance = 4.69 
Residue Identity = 66% Matches = 6 Hi snatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 
ill I II 

DSDPKAYAQHVFRSFDANSDGTLDFKEYVIALHMTTAGKPTQKLEWAFSLYDVDGNGTISKNEVLEIVHAIF 
60 70 80 90 100 110 X 120 

KMIKPEDVKLLPDDENTPEKRAEKIWAFFGKKEDDKL 
130 140 150 160 



4. US-08-249- 182-6 (1-9) 

RECOJOVIN RECOVERIN (P26) . 

ID RECOJOVIN STANDARD; PRT; 201 AA. 
AC P21457; 

DT 01-MAY-1991 (REL. 18, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SESUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE RECOVERIN (P26). 

OS BOS TAURUS (BOVINE) . 

OC EUKARYOTA; METAZOA; CHORD ATA; VERTEBRATA; TETRAPODA; MAMMALIA; 
OC EUTHERIA; ARTIODACTYLA. 
RN [13 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 5-194. 
RC TISSUE=RETINA; 



JUU Q1U 3CU 



2. US-08-249-182-6 (1-9) 

RECD_HUMAN RECOVERIN (CANCER ASSOCIATED RETINOPATHY PROTEIN) 

ID RECO_HUHAN STANDARD,* PRT J 199 AA. 

AC P35243J 

DT 01-FEB-1994 (REL. 28, CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE RECOVERIN (CANCER ASSOCIATED RETINOPATHY PROTEIN) (CAR PROTEIN) . 

GN RCV1. 

OS HOMO SAPIENS (HUMAN) . 

OC EUKARYOTA, METAZOA, CHORD AT A; VERTEBRA! A; TETRAPODA' MAMMALIA, 

OC EUTHERIA; PRIMATES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=RETINA! 

RM 92392330 

RA MURAKAMI A., YAJIMA T., INANA G.; 

RL BIOCHEM. BIOPHYS. RES. COMMUN. 1875234-244(1992). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=RETINA; 

RM 93272873 

RA WIECHNANN A.F., HAHHARBACK J. A.: 

RL EXP. EYE RES. 56:463-470(1993). 

RN [3] 

RP SEQUENCE FROM N.A. 

RM 92406381 

RA THIRKILL C.E., TAIT R.C., TYLER N.K., ROTH A.M., KELTNER J.L.s 

RL INVEST. OPHTHALMOL . VIS. SCI. 33:2768-2772(1992). 

CC -!- FUNCTION: SEEMS TO BE IMPLICATED IN THE PATHWAY FROM RETINAL ROD 

CC GUANYLATE CYCLASE TO RHODOPSIN. HAY BE INVOLVED IN THE BLOCKING OF 

CC THE PHOSPHORYLATION OF RHODOPSIN. 

CC -!- BINDS TWO CALCIUM IONS; ONE UITH HIGH AFFINITY, THE OTHER WITH 
CC LOW AFFINITY. 

CC -!- SIMILARITY: TO OTHER EF-HAND CALCIUM BINDING PROTEINS, AND 

CC MORE SPECIFICALLY TO THE RECOVERIN SUBFAMILY. 

DR EMBL; S438S5; S43855. 

DR EMBL! S45545; S4S545. 

DR EMBL; S62028: S62028. 

DR MIM; 179618; TENTH EDITION. 

KH CALCIUM-BINDING; MYR1STYLATI0NJ VISION. 



INIT.MET 


0 


0 


BY SIMILARITY. 


LIPID 


1 


1 


MYRISTATE (BY SIMILARITY). 


DOMAIN 


36 


47 


ANCESTRAL CALCIUM SITE 1 (BY SIMILARITY). 


CA.BIND 


73 


84 


LOM AFFINITY (BY SIMILARITY) . 


CA.BIND 


109 


120 


HIGH AFFINITY (BY SIMILARITY). 


DOMAIN 


159 


170 


ANCESTRAL CALCIUM SITE 4 (BY SIMILARITY). 


SEQUENCE 


199 AA; 


22999 tlHi 


201221 cn; 



Initial Score = 6 Optimized Score = 6 Significance - 4.69 
Residue Identity = 66% Matches = 6 Ni snatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 

III I II 

DTDPKAYAQHVFRSFDSNLDGTLDFKEYVIALHMTTAGKTNQKLEHAFSLYDVDGNGTISKNEVLEIVMAIF 
60 70 80 90 100 110 X 120 



KMITPEDVKLLPDDENTPEKRAEKIHKYFGKNDDDKL 
130 140 150 160 



til. 


lNe_hUKot 


INItnrbKUN UtlH KKtUJKSUK. 


106 


a 


D 


J. 31 


U 


22. 


inb'human 


INTERFERON BETA PRECURSOR (FI 


187 


5 


5 


3.52 


0 


23. 


CUP8. 


.DROME 


PUPAL CUTICLE PROTEIN EDG-84 


188 


5 


5 


3.52 


0 


24. 


CHS2. 


.AJECA 


CHITIN SYNTHASE 2 (EC 2.4.1.1 


194 


5 


5 


3.52 


0 


25. 


YLH3. 


.CAEEL 


HYPOTHETICAL 25.6 KD PROTEIN 


216 


5 


5 


3.52 


0 


26. 


UCR2. 


JOB AC 


CYTOCHROME B6-F COMPLEX IRQN- 


228 


5 


5 


3.52 


0 


27. 


UCR1. 


JOBAC 


CYTOCHRONE B6-F COMPLEX IRON- 


228 


5 


5 


3.52 


0 


28. 


UCRI. 


.PEA 


CYTOCHROHE B6-F COMPLEX IRON- 


230 


5 


5 


3.52 


0 


29. 


SUNT. 


.HETIV 


UROPORPHYRIN-III C-METHYLTRAN 


230 


5 


6 


3.52 


0 


30. 


GNTR. 


.BACSU 


GLUCONATE OPERON TRANSCRIPTIO 


243 


5 


5 


3.52 


0 


31. 


UCRI. 


.SPIOL 


CYTOCHROME B6-F COMPLEX IRON- 


247 


5 


5 


3.52 


0 


32. 


HEMA. 


.IAX31 


HEMAGGLUTININ PRECURSOR (FRAG 


249 


5 


5 


3.52 


0 


33. 


PHGY. 


.ECOLI 


PHOSPHOGLYCERATE MUTASE (EC 5 


250 


5 


5 


3.52 


0 


34. 


CLCA. 


.PSEPU 


CHLOROCATECHOL 1.2-DIOXYGENAS 


260 


5 


5 


3.52 


0 


35. 


YGFD. 


.ECOLI 


HYPOTHETICAL 29.4 KD PROTEIN 


268 


5 


5 


3.52 


0 


36. 


TRPA. 


.VIBPA 


TRYPTOPHAN SYNTHASE ALPHA CHA 


268 


5 


5 


3.52 


0 


37. 


VG15. 


BPT4 


TAIL CONNECTOR PROTEIN GP15. 


272 


5 


5 


3.52 


0 


38. 


TRA9.MYCTU 


PUTATIVE TRANSPOSASE (INSERTI 


278 


5 


5 


3.52 


0 


39. 


DH47. 


.ARATH 


DEHYDRIN C0R47 (COLD-INDUCED 


294 


5 


5 


3.52 


0 


40. 


XYL1 


PICST 


NAD(P)H-DEPENDENT XYLOSE REDU 


318 


5 


5 


3.52 


0 



i. US-08-249- 182-6 (1-9) 

VG01_BPP22 PORTAL PROTEIN (PROTEIN GP1). 

ID VG01 BPP22 STANDARD; PRT; 724 AA. 

AC P26744; 

DT 01-AUG-1992 (REL. 23, CREATED) 

DT 01-AUG-1992 (REL. 23, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE PORTAL PROTEIN (PROTEIN GP1). 

GN 1. 

OS BACTERIOPHAGE P22. 

OC VIRIDAEi DS-DNA NONENVELOPED VIRUSES; SIPHOVIRIDAE. 

RN [1] 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 1-5. 

RM 91306435 

RA EPPLER K., WYCKOFF E., GOATES J., PARR R., CAS JENS S.; 

RL VIROLOGY 183:519-538(1991). 

CC -!- FUNCTION: REQUIRED FOR SUCCESSFUL CONDENSATION OF DNA WITHIN THE 

CC CAPSID. GP1 IS A MINOR STRUCTURAL PROTEIN. THE PORTAL PROTEIN IS 

CC PRESENT AS A SINGLE RING-SHAPED DODECAMER LOCATED AT THE POINT 

CC WHERE TAILS ATTACH. IT IS THROUGH THIS RING THAT DNA IS THOUGHT 

CC TO ENTER THE PROHEAD. 

CC -!- SUBUNIT: HOMODODECAMER. 

DR EMBL; M59749; P0P22PAC. 

DR PIR; C40474; Z1BP22. 

KW METAL-BINDING; LATE PROTEIN. 

FT INITJ1ET 0 0 

FT METAL 149 149 POTENTIAL. 

FT METAL 154 154 POTENTIAL. 

FT METAL 171 171 POTENTIAL. 

FT METAL 176 176 POTENTIAL. 

SQ SEQUENCE 724 AA; 82611 MU; 2488846 CN; 

Initial Score = 7 Optimized Score = 7 Significance = 5.86 
Residue Identity = 77% Matches = 7 Mi snatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 

llll I II 

TEAVNGGQVAFDTVNQLNMRADLET YVF9DNL AT AMRRDGE I YQS I VND I YDVPRNVT I TLEDGSEKDV8LM 
430 440 450 460 470 480 490 



AEVVDLATGEKQVLNDIRGRYECYTDVGPSF6SMKQQ 



STDEV -10 12 4 5 



6 / 



PARAMETERS 



Similarity matrix 


Unitary 


K-tuple 


2 


Mismatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 


15 


Optimized scores to 


save 0 


Display context 


50 



SEARCH STATISTICS 



Scores: Mean Median 

2 3 

Times: CPU 

00:00:50.95 

Number of residues: 12496420 
Number of sequences searched: 36000 
Number of scores above cutoff: 4523 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 

The scores below are sorted by initial score 
Significance is calculated based on initial score. 

A 100% identical sequence to the query sequence was not found. 



Standard Deviation 
0.85 

Total Elapsed 
00:01:02.00 



The list of best scores is: 



Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 







»•»« 5 standard deviations i 


above Re an 


«•«» 








1. 


VG01.BPP22 


PORTAL PROTEIN (PROTEIN GP1). 


724 


7 


7 


5.86 


0 






»•«* 4 standard deviations < 


above Fie an 


*««> 








2. 


REC0 HUMAN 


RECOVER IN (CANCER ASSOCIATED 


199 


6 


b 


4.69 


0 


3. 


REC0.M0USE 


REC0VERIN (CANCER ASSOCIATED 


201 


6 


b 


4.69 


0 


4. 


REC0_B0VIN 


RECOVER IN (P26). 


201 


b 


b 


4.69 


0 


5. 


FLGL EC0LI 


FLAGELLAR HOOK-ASSOCIATED PRO 


317 


b 


b 


4.69 


0 


6. 


VALl.PASV 


AL1 PROTEIN (ORF CI). 


323 


b 


b 


4.69 


0 


7. 


ATN2_YEAST 


SODIUM TRANSPORT ATPASE 2 (EC 


1091 


b 


b 


4.69 


0 


8. 


ATN1_YEAST 


SODIUM TRANSPORT ATPASE 1 (EC 


1091 


b 


b 


4.69 


0 


9. 


CYGS.STRPU 


SPERACT RECEPTOR PRECURSOR (G 


1125 


b 


b 


4.69 


0 


10. 


Y0G7.CAEEL 


HYPOTHETICAL 375.7 KD PROTEIN 


3343 


b 


b 


4.69 


0 






**** 3 standard deviations < 


above nean 


««•• 








11. 


TES1.RAT 


TESTIN 1 (CMB-22) (FRAGMENT). 


30 


5 


5 


3.52 


0 


12. 


VKILJPP22 


KIL PROTEIN. 


62 


5 


5 


3.52 


0 


13. 


FER1JQUTE 


FERREDOXIN I. 


95 


5 


5 


3.52 


0 


14. 


FER1.EQUAR 


FERREDOXIN I. 


95 


5 


5 


3.52 


0 


15. 


CYTCJ0VIN 


CYSTATINf COLOSTRUM (THIOL PR 


112 


5 


b 


3.52 


0 


16. 


RS24.STRPU 


40S RIBOSOMAL PROTEIN S24. 


130 


5 


5 


3.52 


0 


17. 


CYT1. MAIZE 


CYST AT IN I PRECURSOR (CORN KE 


135 


5 


5 


3.52 


0 


18. 


LEPA_PSEFL 


LEPA PROTEIN (FRAGMENT) . 


161 


5 


5 


3.52 


0 


19. 


VGG_iPG4 


MAJOR SPIKE PROTEIN (G PROTEI 


177 


5 


5 


3.52 


0 


20. 


INB.M0USE 


INTERFERON BETA PRECURSOR. 


182 


5 


5 


3.52 


0 



FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249_6s.res nade by on Thu 22 Sep 94 10: 1 1 : 13-PDT. 



Query sequence being compared .'US-08-249- 182-6 (1-9) 
Number of sequences searched: 36000 
Nurtber of scores above cutoff: 4523 

Results of the initial conparison of US-08-249-182-6 (1-9) uith: 
Data bank : Suiss-Prot 28. all entries 

100000- 

N 

U50000- 

n 

B 

E - * 
R 

0 

F 10000- 

S - » 

E 5000- 

G < 

U 

E 

N 

C 

E - « 
S 1000- 



500- 



♦ 

100- 
50- 



10- 



II I 



II I I 



unbttnuan ftturndi jriane nattus norvegicus ffconmon_name Norway rat 

DATE 07-Jun-1990 #sequence_revision 07-Jun-1990 #text_change 

30-Sep-1993 
ACCESSIONS A34199? PC1250 

REFERENCE A92740 

^authors Cheng* C.Y.; Grinar J,; Stahlen U.S.; Lockshin* R.A.; 

Bardin, C.W. 

#journal J. Biol . Chen. (1989) 264:21386-21393 

#title Testins are structurally related Sertoli cell proteins whose 

secretion is tightly coupled to the presence of germ cells. 
Scross-references MUID:90078247 
iaccession A34199 

##nolecule_type protein 
#*residues 1-30 #llabel CHE 
SUMMARY #length 30 #checksun 6050 

SEQUENCE 



Initial Score = 
Residue Identity - 
Gaps = 



5 Optimized Score = 5 Significance = 4.22 
55X Matches = 5 Mismatches « 4 

0 Conservative Substitutions - 0 



X X 
YDVPWNETI 



TPDPSLDVEUNEWRTKHGKTYNHNEERLKR 
X 10 X 20 30 



15. US-08-249-182-6 (1-9) 

JU0054 hypothetical adhl protein 



Clostridium acetobutyl 



ENTRY JU0054 #type fragment 

TITLE hypothetical adhl protein - Clostridium acetobutyl icun 

(fragment) 

ORGANISM #formal_name Clostridium acetobutyl icum 

DATE 31-Mar-1990 #sequence_revision 31-Mar-1990 #text_change 

30-Sep-1993 
ACCESSIONS JU0054 
REFERENCE A91610 

#authors Younglesoni J.S.i Jones ; U.A.; Jones* D.T.? Woods r D.R. 

ijournal Gene (1989) 78:355-364 

#title Molecular analysis and nucleotide sequence of the adhl gene 

encoding an NADPH-dependent butanol dehydrogenase in the 
gram-positive anaerobe Clostridium acetobutyl icun. 
tcross-references MUID:89378762 
#accession JU0054 
##nolecule_type DNA 
##residues~ 1-52 ##label YOU 



##note 
GENETICS 
#gene 
SUMMARY 
SEQUENCE 



the amino acid sequence is not shown in this paper 

adhl 

# length 52 Checksum 6532 



Initial Score = 5 
Residue Identity = 55X 
Gaps = 0 



Optimized Score = 5 Significance = 4.22 
Matches = 5 Mismatches = 4 

Conservative Substitutions = 0 



> 0 
01 

> 0 



X X 
YDVPWNETI 
II III 

DL I GLDVCLAI MDVLFNETGDSKYRASS I LRK Y VRAGHLGRKSGKGF YD YSK 
10 X 20 30 40 50 

< 

0 IntelliGenetics 

< 



KEYWORDS 

FEATURE 
1-21 
22-1125 
22-510 
509-530 
532-1125 
573-841 
892-1093 

SUMMARY 
SEQUENCE 



cyclase tdtaiy^it oonain nofioiogyr protein Kinase nomoiogy 
carbon-oxygen lyase; cGMP synthesis; glycoprotein; hormone 
receptor; membrane protein; phosphorus-oxygen lyase 

Idomain signal sequence #status predicted Ilabel SIG\ 
Iprotein speract receptor Istatus predicted Ilabel MAT\ 
Idomain extracellular Istatus predicted Ilabel EXT\ 
Idomain transmembrane Istatus predicted Ilabel TMM\ 
Idomain intracellular Istatus predicted Ilabel CYT\ 
Idomain protein kinase homology Ilabel KIN\ 
Idomain guanylate cyclase catalytic domain homology 
Ilabel CAT 

llength 1125 Imolecular-ueight 126256 Ichecksua 5278 



Initial Score 
Residue Identity 
Gaps 



6 Optimized Score = 6 Significance = 5.28 
66X Matches = 6 Mismatches = 3 

0 Conservative Substitutions = 0 



X X 
YDVPHNETI 
III III 

IAVLPETFEMVSIFFSDIVGFTALSAASTPiaVVNLLNDLYTLFDAIISNYDVYKVETIGDAYHLVSGLPLR 
910 920 930 940 950 X 960 970 

NGDRHAGQIASTAHHLLESVKGFIVPHKPEVFLKLRI 
980 990 1000 1010 



13. US-08-249-182-6 (1-9) 
PC1251 testin II 



rat (fragment) 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
tauthors 
Ijournal 
Ititle 



^accession 
llmolecule 
llresidues 

SUMMARY 

SEQUENCE 



PC1251 Itype fragment 
testin II - rat (fragment) 

lfornal_name Rattus norvegicus lcommon_name Norway rat 
30-Sep-1993 lsequence_revision 30-Sep-1993 Itext change 

30-Sep-1993 
PC1251 
PC1250 

Cheng* C.Y.; Morrisi I.; Bardin* C.W. 

Biochem. Biophys. Res. Commun. (1993) 191:224-231 

Testins are structurally related to the mouse cysteine 

proteinase precursor but devoid of any 

protease/anti-protease activity. 
PC1251 

.type protein 

1-19 lllabel CHE 
llength 19 Ichecksum 4922 



Initial Score = 5 
Residue Identity = 55X 
Gaps = 0 



Optimized Score = 5 Significance = 4.22 
Matches = 5 Mismatches = 4 

Conservative Substitutions = 0 



X X 
YDVPHNETI 

II III 
AAPTPDPSLDVEWNEHRTK 
10 X 



14. US-08-249-182-6 (1-9) 

A34199 testin I - rat (fragment) 



ENTRY 
TITLE 



A34199 Itype fragment 
testin I - rat (fragment) 



uiLt ua^t-transporiring Airase ttu J.b.i.JB) rmd - yeast 

(Saccharonyces cerevisiae) 
ALTERNATE JAMES calciun pump PHR2 
ORGANISM #fornal jian»e Saccharonyces cerevisiae 

DATE 30-Jun-199i 8sequence_revision 30-Jun-1991 itext change 

05-May-1994 
ACCESSIONS S05788; B30990 

REFERENCE A30990 

#authors Rudolph, H.K.i Antebir A.; Fink, G.R.; Buckley, CM.; Dornan, 

T.E.; LeVitre; J.; Davidou* L.S.; Maoi J.I.; Hoir, D,T. 
#journal Cell (1989) 58:133-145 

fttitle The yeast secretory pathway is perturbed by nutations in 

PMRh a neither of a Ca<2+) ATPase fanily. 
#cross-references MUID: 89324047 
#accession S05788 

##nolecule_type DNA 

##residues~ 1-1091 Silabel RUD 

##cross-references EHBL:H25489 
GENETICS 

#gene LISTA:PHR2 
#nap_position 4R 

CLASSIFICATION isuperfanily Na+/K+-transporting ATPase alpha chain 
KEYWORDS ATP; calcium transport; hydrolase; newbrane protein 

FEATURE 

369 #active_site Asp Sstatus predicted 

SUMMARY ilength 1091* #e»olecular-yeight 120356 SchecksuR 4741 

SEQUENCE 

Initial Score = 6 Optimized Score = 6 Significance = 5.28 
Residue Identity = 66X Matches = 6 Mismatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 
II I til 

YSLSNEGLRVLGFASKSFTKDaVNDDQLKNITSNRATAESDLVFLGLIGIYDPPRNETAGAVKKFHQAGINV 
600 610 620 630 640 650 X 660 

HMLTGDFVGTAKAIAQEVCILPTNLYHYSQEIVDSMV 
670 680 690 700 



12. US-03-249-182-6 (1-9) 

0YURCP speract receptor precursor - sea urchin (Strongylo 

ENTRY 0YURCP itype complete 

TITLE speract receptor precursor - sea urchin (Strongylocentrotus 

purpuratus) 

ALTERNATEJ4AMES guanylate cyclase* nenbrane-bound 
CONTAINS " guanylate cyclase (EC 4.6.1.2) 

ORGANISM SFornaljiane Strongylocentrotus purpuratus icoflftonjnane 

purple urchin 

DATE 31-Mar-1993 #sequence_revision 31-Mar-1993 #text change 

31-Dec-1993 
ACCESSIONS A33535J A30856 

REFERENCE A33535 

#authors Thorpe, D.S.J Garbers* D.L. 

# journal J. Biol. Chew. (1989) 264:6545-6549 

#ti tie The neabrane forn of guanylate cyclase. Homology with a 

subunit of the cytoplasnic forn of the enzyne. 
#cross-references MUID:89197965 
taccession A33535 
##nolecule_type oRNA 
SSresidues" 1-1125 ##label TH0 
##cros5-references GB:J04693 
CLASSIFICATION Bsuperfaflily ptenbrane-bound guanylate cyclase; ouanulate 



X X 
YDVPWNETI 

Mil I! 

TPGIVAGYPARMMIKLLYQLTEPEEPSHYISRLPKLAKVPNVVQEHVKPRSDVPMSETVEELDWREALAILR 
680 690 700 710 720 730 740 750 

HRPARTSELVT8WLQLIGRFTAAHPDKRALTLLYRWF 
760 770 780 



10. US-08-249-182-6 (1-9) 

S25007 Ca2*-transporting ATPase (EC 3-6.1.38) ENA2 - yeas 



ENTRY 
TITLE 

ORGANISM 
DATE 

ACCESSIONS 

REFERENCE 
lauthors 
♦submission 
♦accession 



S25007 Hype complete 

Ca2+-transporting ATPase (EC 3.6.1.38) ENA2 - yeast 

(Saccharonyces cerevisiae) 
#fornal_nane Saccharonyces cerevisiae 
07-May-1993 #sequence_revision 07-May-1993 fttext change 

28-May-1993 
S25007; S30898 
S25007 

Rodriguez-Navarre i A. 

subRitted to the EMBL Data Library^ June 1992 
S25007 
##nolecule_type DNA 
##residues" 1-1091 ##label ROD 
##cross-references ENBUX67136 
REFERENCE S30898 

lauthors Garciadeblas* B.» Rubio* F.; Guintero* F.J.; Banuelosi H.A.; 
HarOf R.; Rodriguez-Navarroi A. 
ilol. Gen. Genet. (1993) 236:363-368 

Differential expression of two genes encoding isoforns of the 
ATPase involved in sodiun efflux in Saccharonyces 
cerevisiae. 
S30898 
##nolecule_type DNA 

##residues" 1 -97 ? 752-754 ; 1 038- 1 09 1 Mlabel R02 

##cross-references EMBUX67136 
GENETICS 

#gene ENA2 
#napj)osition 4 

CLASSIFICATION #superfanily Na+/K+-transporting ATPase alpha chain 
KEYWORDS calciun transport; hydrolase? ion transport; nenbrane protein 

SUMMARY #length 1091 #nolecular-ueight 120316 tchecksun 5270 

SEQUENCE 



#journal 
♦title 



#accession 



Initial Score = 6 
Residue Identity = 662 
Gaps = 0 



Optimized Score = 6 Significance = 5.28 
Matches = 6 Mismatches = 3 

Conservative Substitutions = 0 



X X 
YDVPWNETI 
II I III 

YSLSNEGLRVLGFASKSFTKD8VNDDQLKNITSNRATAESDLVFLGLIGIYDPPRNETAGAVKKFH9AGINV 
600 610 620 630 640 650 X 660 

HMLTGDFVGTAKAIAQEVGILPTNLYHYSflEIVDSMV 
670 680 690 700 



11. US-08-249-182-6 (1-9) 

PWBYR2 Ca2+-transporting ATPase (EC 3.6.1.38) PHR2 



yeas 



ENTRY PWBYR2 Hype complete 



v,i protein - ranicun strea* virus 



ENTRY JQ1552 #type complete 

TITLE CI protein - Panicum streak virus 

ORGANISM #formal_nane Panicun streak virus 

DATE 30-Sep-1993 #sequence_revision 30-Sep-1993 #text change 

30-Sep-1993 
ACCESSIONS JQ1552 
REFERENCE JQ1549 

#authors Briddon* R.W.; Lunness. P.; Chanberlinr L.C.L.; Pinner, M.S. 

Brundishr H.; Markham» P.G. 
#journal J. Gen. Virol. (1992) 73:1041-1047 

ttitle The nucleotide sequence of an infections insect-transmissibl 

clone of the geninivirus Panicun streak virus, 
#contents Isolate Kenya 
^accession J81552 
#tnolecule_type DNA 
itresidues" 1-323 iilabel BR I 
Mcross-references EHBLJX60168 
CLASSIFICATION #superfamily tomato golden Rosaic virus AL1 protein 
SUMMARY Slength 323 ftnolecular-ueight 37026 ichecksum 1817 

SEQUENCE 



Initial Score = 6 
Residue Identity = 667. 
Gaps = 0 



Optimized Score = 6 
Matches = 6 

Conservative Substitutions 



Significance ~ 
Mi snatches = 



S.28 
3 
0 



X X 
YDVPHNETI 
II llll 

HATSREEYLSLVSSSLPYDWATKLNYFEYSASRLFPDIAEPYTNPHPTTEYDLHCNETIEDWLKPNIYSVSP 
160 170 180 190 200 X 210 220 



9 AYKLLEPSCLSLE8A I ADLEHLDDTTRMLSEKEREA 
230 240 250 260 



9. US-08-249-182-6 (1-9) 

B47521 putative RNA-dependent RNA polymerase fusion prote 



ENTRY B47521 #type complete 

TITLE putative RNA-dependent RNA polymerase fusion protein - 

giardiavirus GLV 
ORGANISM ifornat_name giardiavirus t GLV 

DATE 21-Jan-1994f tsequence_revision 21-Jan-1994J #text_change 

21-Jan-1994 
ACCESSIONS B47521 
REFERENCE A47521 

#authors Wang. A.L.; Yangr H.M.? Sheni K.A.; Hang, C.C. 
#journal Proc. Natl. Acad. Sci. U.S.A. (1993) 90?8595-8599 
Ititle Giardiavirus double-stranded RNA genome encodes a capsid 

polypeptide and a gag-pol-like fusion protein by a 
translation frameshift. 
ftcross-references MUID:93391401 
#contents host= Giardia lamblia 
iaccession B47521 

##status preliminary 
##molecule_type nucleic acid 
^residues" 1-1057 ##label WAN 
iScross-references NCBIN: 137593; NCBIP: 137595 
##note sequence extracted from NCBI backbone 

SUMMARY tlength 1057 tmolecular-ueight 120098 Schecksun 72 

SEQUENCE 



Initial Score = 6 Optimized Score = 6 Significance = 5.28 
Residue Identity = 66X Matches = 6 Mismatches = 3 



idryaruvi n.n.r noouiaevi n.i>. 
tjournal FEBS Lett. U991) 293:21-24 
#title P26 - calciun binding protein Fron bovine retinal 

photoreceptor cells. 
#cross-reference5 MUID:92070564 
^accession S19305 

tistatus preliminary 
Mresidues 1-202 #1 label KUT 
ft#cross-references EMBL:X63322 
SUMMARY tlength 202 fnolecular-ueight 23333 8c heck sum 1961 

SEQUENCE 



Initial Score = 6 Optimized Score = 6 Significance = S.28 
Residue Identity = 66X Hatches = 6 Hi snatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPMNETI 

III I II 

EADPKAYA8HVFRSFDANSDGTLDFKEYVIALHMTSAGKTNGKLEHAFSLYDVDGNGTISKNEVLEIVTAIF 
60 70 80 90 100 110 120 130 



KttlSPEDTKHLPEDENTPEKRAEKIHGFFGKKDDDKL 
140 150 160 



7. US-08-249-182-6 (1-9) 

A38433 recover in - bovine 



ENTRY A38433 #type conplete 

TITLE recoverin - bovine 

ORGANISM 8fornal_nane Bos prinigenius taurus #connon_nane cattle 

DATE 28-Feb-1992 #sequence_revision 28-Feb-1992 itext_change 

30-Sep-1993 
ACCESSIONS A38433 
REFERENCE A38433 

•authors Dizhoor, A.M.; Rayt S.» Kunarr S.J Nienir G..' Spencer i B.l 

Brolley. D. ; Walsh, K.A.; Philipov, P.P.; Hurley. J.B.; 

Stryeri L. 

#journal Science (1991) 251 5915-918 

ftitle Recoverin: a calciun sensitive activator of retinal rod 

guanylate cyclase, 
icross-references MUl 0:91157004 
#accession A38433 

Itstatus preliminary 
ltflolecule_type nRNA 
##residues~ 1-202 ## label DIZ 
##cross-references GB:H95858; GB:«77094 
SUMMARY S length 202 Snolecular-ueight 23333 tchecksun 1961 

SEQUENCE 



Initial Score = 6 Optimized Score = 6 Significance = 5.28 
Residue Identity = 667. Hatches = 6 Hisnatches = 3 

Gaps = 0 Conservative Substitutions = 0 



X X 
YDVPHNETI 
III I II 

EADPKAYA8HVFRSFDANSDGTLDFKEYVIALHMTSAGKTNQKLEWAFSLYDVDGNGTISKNEVLEIVTAIF 
60 70 80 90 100 110 120 130 



KHISPEDTKHLPEDENTPEKRAEKIHGFFGKKDDDKL 
140 150 160 



8. US-08-249-182-6 (1-9) 



Initial Score = 6 Optimized Score = 6 Significance = 5.28 
Residue Identity = 66X Hatches = 6 Mismatches = 3 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 
III I II 

DSDPKAVAQHVFRSFDANSDGTLDFKEYVIALHMTTAGKPTQKLEWAFSLYDVDGNGTISKNEVLEIVHAIF 
60 70 80 90 100 110 120 130 

KMIKPEDVKLLPDDENTPEKRAEKIWAFFGKKEDDKL 
140 150 160 



5. US-08-249-182-6 (1-9) 

A46129 recoverin=calcium sensor 



bovine 



ENTRY A46129 itype complete 

TITLE recoverin=calcium sensor - bovine 

0RGANISH ifornal_nane Bos primigenius taurus #connon_name cattle 

DATE 21-Sep-1993; tsequencejrevision 21-Sep-1993; Stext.change 

21-Sep-1993 
ACCESSIONS A46129 
REFERENCE A46129 

ftauthors Ray. S,? Zozulyar S.; Nienit G.A.; Flahertyt K.M.J Brolley» 

D.? Dizhoon A.M.; McKayi D.B.; Hurleyi J.» Stryer, L. 
ijournal Proc. Natl. Acad. Sci. U.S.A. (1992) 8955705-5709 
ititle Cloningi expression, and crystallization of recovering a 

calcium sensor in vision. 
#cross-references HUID:92335166 
tcontents retinas 
#accession A46129 

##status preliminary 
##nolecule_type mRNA 
##residues" 1-202 ##label RAY 
##cross-references NCBIN: 108412; NCBIP? 108413 
itnote sequence extracted from NCBI backbone 

SUMMARY #length 202 tnolecular-weight 23333 tchecksum 1961 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



6 Optimized Score = 6 Significance = 5.28 
66X Matches = 6 Mismatches = 3 

0 Conservative Substitutions = 0 



X X 
YDVPWNETI 
III I II 

EADPKAYASHVFRSFDANSDGTLDFKEYVIALHMTSAGKTN8KLEMAFSLYDVDGNGTISKNEVLEIVTAIF 
60 70 80 90 100 110 120 130 



KMISPEDTKHLPEDENTPEKRAEKIHGFFGKKDDDKL 
140 150 160 



6. US-08-249-182-6 (1-9) 

S19305 calcium-binding protein P26 - bovine 



ENTRY S19305 itype complete 

TITLE calcium-binding protein P26 - bovine 

ORGANISM #formal_name Bos prinigenius taurus #common_name cattle 

DATE 22-Nov-i993; #sequence_revision 22-Nov-1993; ttext change 

22-Nov-1993 

ACCESSIONS SI 9305 

REFERENCE S19305 

#authors Kutuzovr M.A.i ShRuk1er» B.E.; Suslovr O.N.; Dergachev* A.E.; 



3. US-08-249-182-6 U-9) 

JC1227 recoverin - hunan 



ENTRY JC1227 ttype complete 

TITLE recoverin - hunan 

ORGANISM #fornal_nane Hono sapiens #connon_nane nan 

DATE 30-Sep-1993 #sequence_revision 30-Sep-1993 Itext change 

04-Mar-1994 
ACCESSIONS JC1227 
REFERENCE JC1227 

tauthors Murakanii A.; Yajina* T.; Inanar G. 

tjournal Biochen. Biophys. Res. Comnun. (1992) 187:234-244 

ititle Isolation of hunan retinal genes* recoverin cDNA and gene. 

Scross-references HUID: 92392330 

taccession JC1227 
ttnolecule^type piRNA 
liresidues" 1-200 tllabel MR 
COMMENT This protein modulates guanylate cyclase activity. 
GENETICS 

tgene GDB:RCV1 

|ptap_position 17 



iintrons 
FEATURE 

37-47 

73-84 

108-122 
SUMMARY 
SEQUENCE 



127/3? 165/1 

idonain calcium-binding (EF-hand) Ilabel CB1\ 
tdonain calciun-binding (EF-hand) Ilabel CB2\ 
Idonain calcium-binding <EF-hand) Ilabel CB3 
llength 200 Inolecular-ueight 23130 tchecksun 7845 



Initial Score = 
Residue Identity = 
Gaps = 



6 Optimized Score = 6 Significance = 5.28 
66X Matches = 6 Misnatches = 3 

0 Conservative Substitutions = 0 



X X 
YDVPWNETI 

III I II 

DTDPKAYAQHVFRSFDSNLDGTLDFKEYVIALHMTTAGKTNQKLEHAFSLYDVDGNGTISKNEVLEIVMAIF 
60 70 80 90 100 110 120 130 



KMITPEDVKLLPDDENTPEKRAEKINKYFGKNDDDKL 
140 150 160 



4. US-08-249-182-6 U-9) 
S21155 23K protein 



Rouse 



ENTRY S21155 ttype conplete 

TITLE 23K protein - nouse 

ORGANISM ifornal^nane Mus ruscuIus #connon_nane house Rouse 

DATE 22-Nov-1993; #sequence_revision 22-Nov-1993; itext change 

22-Nov-1993 
ACCESSIONS S2U55 
REFERENCE S21155 

tauthors McGinnis* J.F.; Stepanik* P.L.; Baehr* W.; Subbarayar I.; 

Lerious* V. 

Ijournal FEBS Lett. (1992) 302:172-176 

iiitle Cloning and sequencing of the 23 kDa nouse photoreceptor 

cell-specific protein. 
Icross-references MUID:92339508 
•accession S21155 

listatus preliminary 
llresidues 1-202 lilabel MCG 
SUMMARY llength 202 tnolecular-weight 23406 tchecksun 526 

SEQUENCE 



auue identification! punt icaiionr ano partial sequence analysis 

of autotaxin. a novel notility-stinulating protein, 
ficross-references MUID:92129337 
•accession A42329 

••status preliminary 

••nolecule_type protein 

••residues" 1-114 «*label STR 

••cross-references NCBIP:785265 NCBIP:78523; NCBIP:7852l! NCBIP:78518; 

NCB!P:78S15; NCBIP:78512.* NCBIP:7851i; NCBIP'.78510'» 
NCBIP:78509; NCBIP:78508; NCBIP:78503 
ttnote sequence extracted fron NCBI backbone 

SUMMARY * length 114 tchecksun 7335 

SEQUENCE 



Initial Score = 9 Optinized Score = 9 Significance = 8.44 
Residue Identity = 100X Hatches = 9 Hisnatches = 0 

Gaps = 0 Conservative Substitutions = 0 



X X 
YDVPWNETI 
lllllllll 

RDIEHLTSLDFFRVNSN9TVFVGYGPTFKGGQPLWITATKSPPFENINLYYDVPHNETIPEEVTXPNYLQAE 
30 40 50 60 70 X 80 90 



VSYPAFKPXLDVYKWHVAAN 
100 110 



2. US-08-249-182-6 (1-9) 

Z1BP22 gene 1 protein - phage P22 

ENTRY Z1BP22 fttype conplete 

TITLE gene 1 protein - phage P22 

ORGANISM •fornal_nane phage P22 

•note host Salnonella typhi nuriun 

DATE 30-Jun-1992 #sequence_revision 30-Jun-1992 #text_change 

08-Apr-1994 
ACCESSIONS C40474 
REFERENCE A40474 

•authors Eppler> K.i Wyckoff» E.i Goatesi J. 5 Parr> R.; Casjens. S. 

•journal Virology (1991) 183:519-538 

•title Nucleotide sequence of the bacteriophage P22 genes required 

for DNA packaging, 
•cross-references HUID:91306435 
laccession C40474 
ltnolecule_type DNA 
••residues' 1-725 tSlabel EPP 
••cross-references GB5H59749 
GENETICS 

•gene 1 
CLASSIFICATION ftsuperfanily phage P22 gene 1 protein 
SUMMARY Slength 725 inolecular-weight 82742 #checksun 9976 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



7 Optinized Score = 7 
77X Hatches = 7 

0 Conservative Substitutions 



Significance = 
Hisnatches = 



6.33 
2 
0 



X X 
YDVPWNETI 
llll I II 

TEAVNGGQVAFDTVNQLNMRADLETY VFODNL ATAMRRDGE IY8SIVNDI YDVPRNVT I TLEDGSEKDVQLH 
430 440 450 460 470 X 480 490 



AEVVDLATGEK9VLNDIRGRYECYTDVGPSF0SHK9Q 
500 510 520 530 



Lengui acore score aig. rrane 



1. A42329 autotaxin - hunan (fragnents) 114 9 9 8.44 0 

The list of other best scores is? 



Init. Opt. 

Sequence Name Description Length Score Score Sig. Frane 







•*<* 6 standard deviations 


above nean 


•«»* 








2. 


Z1BP22 


gene 1 protein - phage P22 
*«»« 5 standard deviations 


725 
above nean 


7 

*«*» 


7 


6.33 


0 


3. 


JC1227 


recoverin - hunan 


200 


6 


6 


5.28 


0 


4. 


S2U55 


23K protein - nouse 


202 


6 


6 


5.28 


0 


5. 


A46129 


recoverin=calciun sensor - bo 


202 




6 


5.28 


0 


6. 


S19305 


calciun-binding protein P26 - 


202 




6 


5.28 


0 


7. 


A38433 


recoverin - bovine 


202 




6 


5.28 


0 


S. 


JG1552 


CI protein - Panicun streak v 


323 




6 


5.28 


0 


9. 


B47521 


putative RNA-dependent RNA po 


1057 




6 


5.28 


0 


10. 


S25007 


Ca2+-transporting ATPase (EC 


1091 




6 


5.28 


0 


11. 


PHBYR2 


Ca2+-transporting ATPase (EC 


1091 




6 


5.28 


0 


12. 


OYURCP 


speract receptor precursor - 


1125 




6 


5.28 


0 






**** 4 standard deviations 


above nean »*»* 








13. 


PC1251 


testin II - rat (fragment) 


19 


5 


5 


4.22 


0 


14. 


A34199 


testin I - rat (fragnent) 


30 


5 


5 


4.22 


0 


15. 


JU0054 


hypothetical adhl protein - C 


52 


5 


5 


4.22 


0 


16. 


VDBP22 


kil protein - phage P22 


62 


5 


5 


4.22 


0 


17. 


FEE81F 


ferredoxin I - field horsetai 


95 


5 


5 


4.22 


0 


IB. 


FEE91 


ferredoxin I - horsetail (Equ 


95 


5 


5 


4.22 


0 


19. 


UDBO 


cystatin - bovine 


112 


5 


6 


4.22 


0 


20. 


S36733 


nalate dehydrogenase (NADP+) 


125 


5 


5 


4.22 


0 


21. 


A43907 


ribosonal protein S24 - sea u 


130 


5 


5 


4.22 


0 


22. 


S27239 


cysteine proteinase inhibitor 


135 


5 


5 


4.22 


0 


23. 


A45708 


iwnunoreactive epitope (overt 


144 


5 


5 


4.22 


0 


24. 


S36680 


lepA protein - Pseudononas fl 


161 


5 


5 


4.22 


0 


25. 


S04479 


Interferon beta-1 - Hunan 


166 


5 


5 


4.22 


0 


26. 


A23537 


gnt operon regulatory protein 


169 


5 


5 


4.22 


0 


27. 


ZGBPG4 


gene G protein - phage G4 


177 


5 


5 


4.22 


0 


28. 


S02020 


interferon beta precursor - n 


182 


5 


5 


4.22 


0 


29. 


IVHSB 


interferon beta precursor - n 


182 


5 


5 


4.22 


0 


30. 


IVH0B1 


interferon beta-I precursor - 


186 


5 


5 


4.22 


0 


31. 


IVHUB1 


interferon beta-1 precursor - 


187 


5 


5 


4.22 


0 


32. 


B49773 


ecdysone-dependent cuticle pr 


188 


5 


5 


4.22 


0 


33. 


H45189 


chitin synthase=HcCHS2 gene p 


195 


5 


5 


4.22 


0 


34. 


S38667 


ribonuclease S3 - Peruvian to 


195 


5 


5 


4.22 


0 


35. 


S34833 


stylar protein - Peruvian ton 


205 


5 


5 


4.22 


0 


36. 


S40720 


hypothetical protein - Caenor 


216 


5 


5 


4.22 


0 


37. 


S39545 


plastoquinol—plastocyanin re 


228 


5 


5 


4.22 


0 


38. 


S25312 


plastoquinol— plastocyanin re 


228 


5 


5 


4.22 


0 


39. 


S23735 


plastoquinol—plastocyanin re 


228 


5 


5 


4.22 


0 


40. 


S26199 


plastoquinol—plastocyanin re 


230 


5 


5 


4.22 


0 



1. US-08-249-182-6 (1-9) 

A42329 autotaxin - hunan (fragnents) 



ENTRY A42329 #type fragnents 

TITLE autotaxin - hunan (fragnents) 

ORGANISM tfornal_nane Mono sapiens #connon_nane nan 

DATE 04-Nar-1993? tsequence revision Ol-Jan-1993; #text change 

OS-May-1993 
ACCESSIONS A42329 
REFERENCE A42329 

#authors Stracke» M.L.; Krutzsch, H.C.; Unsuorth, E.J.; Arestad, A.; 
Cioce, V.; Schiffnann» E. ? Liotta* L.A. 

#journal J. Biol. Chen. (1992) 267:2524-2529 



ft 



100- 



50- 



10- 



o 

II II II I! II II II II I 

SCORE 0 | 1! 2| 3| 4| 5| 6| 7| 8 
STDEV 0 1 2 3 4 5 6 7 



PARAHETERS 



Similarity natrix Unitary 

Mismatch penalty 1 

Gap penalty 1.00 

Gap size penalty 0.05 

Cutoff score 0 

Randomization group 0 



K-tuple 

Joining penalty 
Window size 



2 

20 
5 



Initial scores to save 40 
Optimized scores to save 0 



Alignnents to save 15 
Display context 50 



SEARCH STATISTICS 



Scores? 



Mean Median Standard Deviation 

1 3 0.95 



Tines! 



CPU 
00:01:25.89 



Total Elapsed 
00:01:50.00 



Nunber of residues: 20816057 
Number of sequences searched: 70848 
Nunber of scores above cutoff: 3789 

Cut-off raised to 2. 

Cut-off raised to 3. 

Cut-off raised to 4. 

Cut-off raised to 5. 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 



A 100% sinilar sequence to the query sequence uas found: 



Init. Opt. 



PS Disclosure; Fig 14; 96pp; English. 

CC The sequence encodes IFN-beta(IFN-beta(2-7)-IFN-alpha2(l-5J- 

CC (IFN-beta(9-56)-IFN-alphal(7-54>) a nodified interferon-beta which has 

CC increased biological activity conpared to natural IFN-beta. and 

CC uhich is nore effective in the treatnent of viral or neoplastic 

CC diseases* or innunosuppressed or innunodef icient conditions. 

Sfl Sequence 165 AA; 

Sfl 7 A; 12 R; io N; 7 D; 0 B; 3 c; 8 9; 11 E; 0 Z; 6 G; 7 H; 

Sfi 11 i; 21 l; 8 K> 6 H; 10 Fi 4 p; 11 s; 8 Tf 2 w; 8 YJ 5 V? 



Initial Score = 5 Qptinized Score = 5 Significance = 4.34 
Residue Identity = 557. Matches = 5 Misnatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X X 
YDVPHNETI 
lllll 

R1SPSSCLMDRHDFGFP8EEFDGNQFQKAPAILTIYEML8NIFAIFRSDSSSTGUNETIVENLLANVYH8IN 
30 40 50 60 70 X 80 X 90 



HLKTVLEEKLEKEDFTRGKLMSSLHLKRYYGRILHYL 
100 110 120 130 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 



FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249_6p.res nade by on Thu 22 Sep 94 10s38;59-PDT. 



Query sequence being conpared J US-08-249- 182-6 (1-9) 
Nunber of sequences searched: 70848 
Nunber of scores above cutoff J 3789 

Results of the initial comparison of US-08-249-182-6 (1-9) with: 
Data bank : PIR 41. all entries 

100000- 

N 

U50000- * 

M 

8 

E 

R 

0 * « 
F 10000- 

S 

E 5000- 
Q 
U 
E 

N - • 

C 
E 

S 1000- 



500- 



HLKTCLEEKLEKEDFTRGKLHSSLHLKRYYGRILHYL 
100 110 120 130 



14. US-08-249-182-6 (1-9) 

P50280 Protein sequence encoding modified inierferon-beta 

ID PS0280 standard; protein; 165 AA. 

AC P50280; 

DT 09-DEC-1991 (first entry) 

DE Protein sequence encoding nodified interferon-beta. 

KM Interferon-beta; IFNX403; virucide; antitumor; innunostinulant. 

OS Synthetic. 

PN EP-131816-A. 

PD 23- JAN- 1985. 

PF 28-JUN-1984; 107498. 

PR 01-JUL-1983; GB-017880. 

PA (SEAR ) SEARLE G D & CO. 

PI Bell LD; Boseley PG; Smith JC; Houghton H. 

DR HPI; 85-020165/04. 

DR N-PSDB; N50307. 

PT Neu nodified beta-interferon(s) - useful as antiviral i 

PT ant i -neoplastic and innuno-nodulatory agents. 

PS Disclosure; Fig 22; 96pp; English. 

CC The sequence encodes IFN-beta(IFN-beta(9-56)-IFN-alpha2(7-53) 

CC (Leul7-Cysl7) a nodified interferon-beta which has 

CC increased biological activity conpared to natural IFN-betar and 

CC which is nore effective in the treatment of viral or neoplastic 

CC diseasesi or innunosuppressed or innunodef icient conditions. 

SQ Sequence 165 AA; 

SG 6 A; 12 R; 10 N; 4 D; 0 B; 3 c; 7 a; 12 E; o z; 8 G; 7 H; 

SQ ll I; 22 L; 10 k; 5 «; 11 F; 1 P; 11 s; 9 T; 2 h; 9 Y; 5 v; 

Initial Score = 5 Optimized Score = 5 Significance = 4.34 
Residue Identity = 552 Hatches = 5 Nisnatches = 4 

Gaps = 0 Conservative Substitutions - 0 

X X 
YDVPHNETI 

Hill 

RKISLFSCLKDRHDFGFPQEEFGNGF9KAETILTIYEML8NIFAIFRQDSSSTGWNETIVENLLANVYH9IN 
30 40 50 60 70 X 80 X 90 

HLKTVLEEKLEKEDFTRGKLHSSLHLKRYYGRILHYL 
100 110 120 130 



15. US-08-249-182-6 (1-9) 

P50273 Protein sequence encoding nodified interferon-beta 

ID P50273 standard; protein; 165 AA. 

AC P50273; 

DT 09-DEC-1991 (first entry) 

DE Protein sequence encoding nodified interferon-beta. 

KH Interferon-beta; IFNX410; virucide; antitumor; innunostinulant. 

OS Synthetic. 

PM EP-131B16-A. 

PD 23-JAN-1985. 

PF 28-JUN-1984; 107498. 

PR 01-JUL-1983; GB-017880. 

PA (SEAR ) SEARLE G D & CO. 

PI Bell LD; Boseley PG; Snith JC; Houghton PI. 

DR ypi; 85-020165/04. 

DR N-PSDB; N50300. 

PT Neu nodified beta-interfercm(s) - useful as antiviral. 



k\* re ju j j/ / « mm 1 was usee to oigesi, i»ne i/wr sequences in Lne region 

CC of significant handicaps (see N30153, N30154* N30158r N30159) r and 

CC the restriction fragments were tigated to forn hybrid DNA. 

S3 Sequence 165 AA; 

SQ 7 A; 12 R; 9 N; 6 D; o B; 3 c; 10 fi; 11 E? o z; 6 G; 7 H; 

SG 12 I; 20 Li 10 KJ 6 HS 9 F; 3 P; 10 Si 9 T; 2 W; 8 Y; 5 V? 



Initial Score = 5 Optimized Score = 5 Significance = 4.34 
Residue Identity = 55X Matches = 5 Hisnatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 
Illll 

RISPFSCLKDRHDFGFPQEIKSLQQFQKEDAALTIYEMLQNIFAIFRQDSSSTGHNETIVENLLANVYH8IN 
30 40 50 60 70 X 80 X 90 

HLKTVLEEKLEKEDFTRGKLMSSLHLKRYYGRILHYL 
100 110 120 130 



13. US-0B-249-182-6 (1-9) 

P50207 Modified hunan interferon UFN-457). 



ID P50207 standard; protein; 165 AA. 

AC P50207; 

DT 01-JAN-1980 (first entry) 

DE Modified hunan interferon (IFN-457). 

KH Interferon; antitumor; virucide! innunostinulant; 

KW protein engineering; ss. 

OS Hono sapiens. 

PN EP-146413-A. 

PD 26-JUN-1985. 

PF 20-DEC-1984; 308984. 

PR 21-DEC-1983; GB-034102. 

PA (SEAR ) SEARLE G D & CO. 

PI Bell LD» Snith JC> Porter AGr Adair JR; 

DR WPI; 85-154456/26. 

DR N-PSDB; N50214. 

PT New interferon with nodified cysteine pattern - useful for 

PT altered antiviral and antiproliferative effectsi stability etc. 

PS Disclosure; Page 54; 62pp; English. 

CC The IFN is obtained by expression fron a nicroorganisn which has been 

CC transformed by reconbinant DNA techniques. The modified IFN nay 

CC allow selective properties to be shouni e.g. in the alteration of 

CC cellular nenbranes» as virucidal iflftunonodulatory* antitumor and an 

CC antiproliferative agent. It has increased stability which results 

CC in improved recovery during production! increased storage life and 

CC prolonged activity. The IFN nay be hunan-alpha ♦ hunan-beta* hunan- 

CC beta + hunan-alpha or hunan-beta + hunan-ganna. Cysteine deletions 

CC or substitutions pernit the fornation of disulfide linkages at 

CC positions corresponding to linkages in a 2nd IFN. See also 

CC N50207-13 and P50192-97 and P50206. 

59 Sequence 165 AA; 

SS 6 A; 11 r; li N? 6 D; 0 B; 4 C; 12 fi; 13 E; 0 Z; 5 G; 5 H; 

SQ 11 I; 23 L; 11 k; 4 m; 9 f; 2 p; 9 s; 7 T; 3 w; 9 Y? 4 v; 



Initial Score = 5 Optinized Score = 5 Significance = 4.34 
Residue Identity = 55Z Matches = 5 Hisnatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 
Illll 

NGRLEYCLKDRMNFDIPEEIKQLQQFQKEDAALTIYEMLSNIFAIFRQDSSSTG«NETIVENLLANVYHQIN 
30 40 50 60 70 X 80 X 90 



PA (SEAR ) SEARLE G D & CO. 

PI Bell LD; Boseley PG; Smith JC; Houghton M. 

DR WPI; 85-020165/04. 

DR N-PSDB; N50308. 

PT New modified beta-interferon(s) - useful as antiviral t 

PT anti-neoplastic and immunomodulatory agents. 

PS Disclosure; Fig 21? 96pp; English. 

CC The sequence encodes IFN-beta(IFN-beta(i-56)-IFN-alpha2(l-53> 

CC (Leul6-Cysl6) a modified interferon-beta which has 

CC increased biological activity compared to natural IFN-betar and 

CC which is more effective in the treatment of viral or neoplastic 

CC diseases* or imnunosuppressed or innunodeficient conditions. 

SQ Sequence 164 AA; 

sa 6 a; 12 Ri 9 N? 5 Dj o b; 4 c; 8 Q; 12 E; o z; 7 G; 7 H? 

SO 11 15 21 i; 10 K? 5 h; 10 F; 2 PJ 10 s; 10 T; 2 H? 8 Y; 5 V? 

Initial Score = 5 Optimized Score = 5 Significance = 4.34 
Residue Identity = 552 Hatches = 5 Mismatches = 4 

Gaps = o Conservative Substitutions = 0 

X X 
YDVPHNETI 

inn 

RKISLFSCLKDRHDFGFPQEEFGN8F8KAETILTIYEML0NIFAIFRQDSSSTGUNETIVENLLANVYHSIN 
30 40 50 60 70 X 80 90 

HIKTVLEEKLEKEDFTRGKLHSSLHLKRYYGRILHYL 
100 110 120 130 



12. US-08-249-182-6 (1-9) 

P30227 Sequence of hybrid interferon HuFIN-alpha-61A-beta 

ID P30227 standard; Protein; 165 AA. 

AC P30227; 

DT 25-HAY-1992 (first entry) 

DE Sequence of hybrid interferon HuFIN-alpha-61A-beta-l 

DE composed of AAs 1-41 of HuIFN-alpha-61A and AAs 43-166 of 

DE HuIFN-beta-1. 

KM Hybrid interferon; antiviral; therapy; cancer; tumour. 

OS Homo sapiens. 

PN W08302461-A. 

PD 21-JUL-1983. 

PF 18-JAN-19B3; 900607, 

PR 19-JAN-1982; US-340782. 

PR 03-FEB-1983; US-463574. 

PR 15-JUL-1985; US-755265. 

PA (CETU-) CETUS CORP. 

PI Mark DF, Creasey AA; 

DR WPI; 83-723186/30, 

DR N-PSDB; N30160. 

PT Multipass hybrid interferon polyipeptide(s) - with restricted 

PT antiviral and cell growth regulatory activities 

PS Example; Fig 23; 61pp; English. 

CC The inventors claim a multiclass hybrid interferon polypeptide and a 

CC DNA unit having a nucleotide sequence uhich encodes it. Pref. the 

CC AA sequence consists of alpha and beta interferons. Pref. IF1 is 

CC (i) the 1-73 AA seq. of HuIFN-alpha-1 (and IF2 is the 74-166 AA seq. 

CC of HuIFN-beta-1) (see N30155, P30222) J or (ii) the 1-41 AA seq, of 

CC HuIFN-alpha-61A (and IF2 is the 43-166 AA seq. of HuIFN-beta-1) (see 

CC N30160, P30227). Alternativeley IF1 is the amino terminal end of a 

CC beta-IF and IF2 is the carboxy terminal of an alpha-IF (esp. the 

CC 1-73 seq. of HuIFN-beta-1 and the 74-167 seq. of HuIFN-alpha-1 

CC resp.) (see N30156> P30223). In the examples plasmids pGW5 and 

CC pDMOl/trp/beta-i and p-alpha-61A were used (see N3015h N30152, 



P80055 Sequence of hunan interferon (huIFN) alpha-61A-bet 

ID P80055 standard; protein? 161 AA. 

AC P80055; 

DT 17-N0V-1990 (first entry) 

DE Sequence of huRan interferon (huIFN) alpha-61A-beta-l hybrid 

KW Alpha-beta hybrid interferon? nulti-class hybrid interferon? 

KH antitunour? antiviral? therapy. 

OS Horo sapiens. 

PM US4758428-A. 

PD 19-JUL-198S. 

PF 15-JUL-1985? 755265. 

PR 19-JAN-1983? CA-419758, 

PR 15-JUL-1985? US-755265, 

PA (CETU) Cetus Corp. 

PI Mark DF, Creasey AA? 

DR WPI? 88-219882/31. 

DR N-PSDB? n80052. 

PI Multi-class hybrid interferon polypeptide(s) - 

PT having sequence froft interferon-alpha-1 and sequence fron 

PT interferon-beta-1 for restricted activity 

PS Exanple? Fig 23? 24pp? English. 

CC Multi-class hybrid interferon polypeptides having an AA sequence composed 

CC of 2 distinct subsequences are claimed. The plasnids used in the 

CC construction of huIFN-alpha-61A-beta-l hybrid are plasnids palpha61A and 

CC pDM101/trp/beta-l. The hybrid gene uas constructed by taking advantage of 

CC homologies between huIFN alpha-61A and huIFN beta-1 at around AA 40 of 

CC both proteins. The sequence S'-proxinal to the Ddel restriction enzyne 

CC cutting site of the huIFN alpha-61A DNA is ligated to the DNA sequence 

CC 3'-proxinal to the site of huIFN beta-1 » to create a fusion of the 

CC 2 genes while preserving the translational reading frane of both genes. 

CC This hybrid interferon is denoted huIFN alpha-61A-beta-l. The anino 

CC terninal portion of this polypeptide starting yith methionine is conposed 

CC of the sequence 1-41 of huIFN alpha-61A and the carboxy terminal portion 

CC is composed of AAs 47-166 of huIFN beta-1. 

59 Sequence 161 AA? 

SQ 7 A? 12 R? 10 N; 6 D? OB? 3 C? 9 8? 10 E? 0 Z? 6 G? 7 H? 

SQ 111! 20 L? 8 K? 6 M? 9 F? 3 P? 10 S? 9 T? 2 W? 8 Y? 5 V? 

Initial Score = 5 Optimized Score = 5 Significance = 4.34 
Residue Identity = 55X Matches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
Mill 

ASMGRISPFSCLKDRHDFGFPaLQQFQKEDAALTIYEMLQNIFAIFRSDSSSTGHNETIVENLLANVYHQIN 
20 30 40 50 60 70 80 90 

HLKTVLEEKLEKEDFTRGKLMSSLHLKRYYGRILHYL 
100 110 120 



11. US-08-249-182-6 (1-9) 

P50281 Protein sequence encoding Rodified interferon-beta 

ID P50281 standard? protein? 164 AA. 

AC P50281? 

DT 09-DEC-1991 (first entry) 

DE Protein sequence encoding aodified interferon-beta. 

KW Interferon-beta; IFNX406? virucide? antitumor? innunostinulant. 

OS Synthetic. 

PN EP-131816-A. 

PD 23-JAN-1985. 

PF 28-JUN-1984; 107498. 



YDVPMIETI 
II 111 

PEVESSPLDLS9KKEKQSEYEQQVVKSIKPQKSEPQPYSQTYAKAPIWESYDFDWMEDDAKFILPAPYRLTK 
30 40 50 60 70 X 80 90 



ADEIVLGSKIVKLRTI IETAIKTQNYSALPEAVFELD 
100 110 120 130 



9. US-08-249- 182-6 (1-9) 

P60691 Sequence of the region coding for a hybrid protein 



ID P60691 standard; protein; 161 AA. 

AC P60691; 

DT 06-AUG-1991 (First entry) 

DE Sequence of the region coding for a hybrid protein 

DE comprising human interferon-alpha-61A and interferon-beta-l 

DE from plasnid p-alpha-61A and plasnid pDM101/trp/beta-l 

KU Plasnid pDN101/trp/beta-l? human; interferon-beta-l; virucide; 

KW imnunostimulant; plasnid p-alpha-61A; chimeric protein; 

KU chimeric gene; hybrid protein? ss. 

OS Homo sapiens. 

PN US4569908-A. 

PD 11-FEB-19B6. 

PF 03-FEB-1983? 463574. 

PR 19-JAN-1982; US-340782. 

PR 19-JAN-1983; CA-419758. 

PR 03-FEB-1983; US-463574. 

PA (CETU ) CETUS CORP. 

PI Mark DF> Creasey AA; 

DR WPI; 86-061770/09. 

DR N-PSDB; N60657. 

PT DNA producing hybrid interferon polypeptide - which has activity 

PT restricted to cell growth regulatory or antiviral activity 

PS Disclosure; Fig. 23; 25pp? English. 

CC The N-terninal end coding region of the HuIFN-alpha-6lA DNA is 

CC fused to the DNA coding for the C-terninal end region of 

CC HuIFN-beta-1 in such a way that the translational reading 

CC frame of the 2 proteins are preserved and the resulting 

CC protein being expressed fron this hybrid gene will have the 

CC AA sequence of HuIFN-alpha-61A at its N-terninal portion and 

CC the AA sequence of HuIFN-beta-1 at its C-terninal portion. 

CC The chimeric protein nay be used to treat tumors and cancers 

CC without immunosuppression side effects. They nay also be used 

CC to treat encephalomyocarditis virus infection rabies and other 

CC viral zoonoses and arbo virus infections. See also N60652-6 

CC and P60686-90. 

SO Sequence 161 AA; 

SQ 7 A; 12 R; 9 N? 6 D; 0 B; 3 C; 9 QJ 10 E; 0 Z; 6 Gf 7 H? 

sg 11 I; 20 l; 9 k; 6 H; 9 F; 3 P; 10 s; 9 T; 2 tf; 8 Y? 5 v; 



Initial Score = 5 Optimized Score = 5 Significance = 4.34 
Residue Identity = 557. Matches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
lllll 

AQMGRISPFSCLKDRHDFGFPfiLQQFQKEDAALTIYEMLQNIFAIFRSDSSSTGWNETIVENLLANVYHQIN 
20 30 40 50 60 70 80 90 



HLKTVLEEKLEKEDFTRGKLMSSLHLKRYYGRILHYL 
100 110 120 



Lne vector. \ui rronn, or a yeast or non-yeast poiypepLioe or ils 
CC deriv. by culturing the transformed yeast. The polypeptides are 
CC hormones i antiviral and anticancer peptides* enzymes and interferon. 
SQ Sequence 155 AAi 

SQ 6 A; 10 R; 11 N; 5 0; 0 B; 3 C; 10 6? 13 E; 0 Z; 5 g? 5 H; 

SG 11 I; 21 L; 11 K; 3 M; 8 Fi 1 P? 8 S; 7 T; 3 Wr 9 Y? 5 V? 

Initial Score = 5 Optimized Score = 5 Significance = 4.34 
Residue Identity = 55% Hatches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
lllll 

NGRLEYCLKDRHNFDIPEEIKQLQ8FSKEDAALTIYEf1LQNIFAIFRSDSSSTGUNETIVEMLLANVYHSIN 
20 30 40 50 60 X 70 X 80 

HLKTVLEEKLEKEDFTRGKLHSSLHLKRYYGRILHYL 
90 100 110 120 



8. US-08-249-132-6 (1-9) 

R29179 Astrovirus serotype A2 clone A43351-encoded polype 

ID R29179 standard; Protein? 156 AA. 

AC R29179? 

DT 08-APR-1993 (first entry) 

DE Astrovirus serotype A2 clone A4335i-encoded polypeptide. 

KH Sequence-independent single primer amplif ication; SISPA; 

KM viral gastroenteritis. 

OS Astrovirus. 

PN W09220803-A. 

PD 26-N0V-1992. 

PF 20-MAY-1992J U04276. 

PR 20-NAY-1991; US-702731. 

PA (GENE-) GENELABS INC. 

PA (STRD ) UNIV LELAND STANFORD JUNIOR. 

PI Greenberg HB> Kin JP, Matsui SH, Reyes GR? 

DR MPIJ 92-415781750. 

DR N-PSDB! S31811. 

PT Recombinant Astrovirus polynucleotide(s) (e.g. cDNA) - and 

PT corresponding polypeptide antigens and antibodies r useful in 

PT diagnosis of Astrovirus infection and as vaccines 

PS Claim 3; Fig 5; 85pp; English. 

CC The cell lines Al LLCNK2 and LLCHK2 (i.e. monkey kidney cell lines 

CC infected with Astrovirus type 1 and uninfected cells* respectively) 

CC were cultured and RNA uas extracted. The RNA was used to generate 

CC cDNA uhich uas amplified by the SISPA method. The amplified cDNA uas 

CC cloned into lambda gtllt packaged and used to infect E.coli KM392. 

CC The cDNA library was screened using astrovirus-immunised rabbit sera 

CC to obtain ten clones containing Astroviral coding sequences (i.e. 

CC clones A35, A43, A39, Ah A2, Allr A13, A14r A21 and A33). The 

CC clones can be used as probes to obtain a complete set of 

CC overlapping genomic cDNA clones. The sequence of A43351 is made up 

CC of the overlapping clones A43i A35 and Al. The sequence of the 

CC A43351 polypeptide referred to as SEQ ID NO 14 in the claims 

CC consists of amino acids 7 to 150. 

CC See Q31806-G31816 and Q37634. 

SQ Sequence 156 AA? 

S9 15 A; 6 R; 2 n; 7 D; 0 b; l c; 10 Q; 17 Ei o z? 4 G; 0 H; 

sq 9 i; ii l; 19 k; o m; 6 f; n P; u s; 9 T; 2 w; 7 Y? 7 v; 



Initial Score - 5 Optimized Score = 5 Significance = 4.34 
Residue Identity = 552 Hatches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 



PD 13-APR-1983. 

PF 01-SEP-1982? 024871. 

PR 03-DCT-1981? GB-029937. 

PR 26-MAR-1982? GB-008988. 

PR 01 -SEP- 1982 ? GB-024871. 

PA (CIBA ) CIBA GEIGY AG, 

PI Meyer F? 

DR WPI? 83-37194K/16. 

DR N-PSDB? N30101. 

PT Hunan interferon polypeptide - produced by host cells* esp. 

PT E.coli transformed with recombinant DNA contg. sequence 

PT obtainable from human lymphoblastoid cells 

PS Claim 42; Page 108-9 and Fig 5; 142pp? German. 

CC The inventors clain DNA encoding a polypeptide uith human 

CC lynphoblastoid interferon activity* hosts which have been 

CC transformed uith the DNA and the polypeptide. Variants of the DNA 

CC sequence are also claimed. The polypeptides are useful as immuno- 

CC nodulatorSf esp. as antiviral » ant i tun our and anticancer agents. 

S8 Sequence 155 AA; 

SO 6 A; 10 R; ll N; 5 d? o B; 3 c; 10 Or 13 E; 0 Z; 5 G; 5 H; 

SQ li I; 21 L? ll K; 3 M; 8 F? l P? 8 S? 7 T * 3 w; 9 y; 5 v? 



Initial Score = 5 Optimized Score = 5 Significance - 4.34 
Residue Identity = 55X Matches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPHNETI 
lllll 

NGRLEYCLKDRMNFDIPEEIK8LQ9F9KEDAALTIYEMLQNIFAIFRQDSSSTGHNETIVENLLANVYH8IN 
20 30 40 50 60 X 70 X 80 



HLKTVLEEKLEKEDFTRGKLMSSLHLKRYYGRILHYL 
90 100 110 120 



7. US-08-249-182-6 (1-9) 

P40124 Sequence encoded by the cDNA insert of the recombi 

ID P40124 standard? Protein? 155 AA. 

AC P40124? 

DT 12-FEB-1992 (first entry) 

DE Sequence encoded by the cDNA insert of the recombinant plasmid 

DE CG-pBR 322/RLycIFN-betal. 

KH Yeast expression vector? promoter. 

OS Homo sapiens. 

PN EP-1005M-A. 

PD 15-FEB-1984. 

PF 08-AUG-1983? 107804. 

PR 09-AUG-1982? GB-022883. 

PR 31 -DEC- 1982? GB-037026. 

PR 02-JUN-1983? GB-015145. 

PR 14-JUL-1983? GB-019099. 

PA (CIBA ) CIBA GEIGY AG. 

PI Hinnen A* Meyhack Bi Meyer F? 

DR WPI? 84-044243/08. 

DR N-PSDB? N40108. 

PT Acid phosphatase promoting DNA fragment - for expressing 

PT peptide(s) in yeasts which are more easily cultured than e.coli 

PS Example? Fig 11? 166pp? English. 

CC The inventors clain: (A) DNA fragment consisting of yeast acid 

CC phosphatase promoter and flanking sequences or its mutants which 

CC retain the promoter function. (B) Hybrid vector consisting of a 

CC yeast acid phosphatase promoter and a yeast or non-yeast polypeptide 

CC coding region controlled by the promoter, (C) A yeast transformed by 



10 20 30 40 X 50 



5. US-08-249-182-6 (1-9) 

R29174 Astrovirus serotype Al clone A35-encoded polypepti 

ID R29174 standard? Protein? 75 AA. 

AC R29174; 

DT 06-APR-1993 (first entry) 

DE Astrovirus serotype Al clone A35-encoded polypeptide. 

KW Sequence-independent single primer anplif ication? SISPA; 

KW viral gastroenteritis. 

OS Astrovirus. 

PN W09220803-A. 

PD 26-N0V-1992. 

PF 20-HAY-1992; U04276. 

PR 20-HAY-199U US-702731. 

PA (GENE-) GENELABS INC. 

PA (STRD ) UNIV LELAND STANFORD JUNIOR. 

PI Greenberg HB> Kin JP, Matsui SPIr Reyes GR ; 

DR HPU 92-415781/50. 

DR N-PSDB; 931806- 

PT Reconbinant Astrovirus polynucleotide(s) (e.g. cDNA) - and 

PT corresponding polypeptide antigens and antibodies ? useful in 

PT diagnosis of Astrovirus infection and as vaccines 

PS Claiw 3, Page 50-51? 85pp? English. 

CC The cell lines Al LLCHK2 and LLCHK2 (i.e. monkey kidney cell lines 

CC infected uith Astrovirus type 1 and uninfected cells* respectively) 

CC were cultured and RNA uas extracted. The RNA uas used to generate 

CC cDNA uhich uas amplified by the SISPA nethod. The anplified cDNA uas 

CC cloned into lanbda gill * packaged and used to infect E.coli KM392. 

CC The cDNA library uas screened using astrovirus-itwunised rabbit sera 

CC to obtain ten clones containing Astroviral coding sequences (i.e. 

CC clones A35r A43, A39, Al, A2, All, AI3, A14, A21 and A33). The 

CC clones can be used as probes to obtain a complete set of 

CC overlapping genomic cDNA clones. 

CC See also 631807-831816 and 937434. 

SQ Sequence 75 AA! 

SS 5 A; 2 Rl 1 N; 5 D; 0 B; 0 C; 5 9; 7 E? 0 Z; 2 Gf 0 Hi 

S9 7 I; 4 LJ 8 K; 0 M; 2 F; b P? 6 S; 4 T? 2 U; 5 Yf 4 V? 



Initial Score = 5 Optinized Score = 5 Significance « 4.34 
Residue Identity = 55X Matches - 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions - 0 



X X 
YDVPWNETI 
II III 

SEYEQ9VVKSIKPQKSEPQPYSQTYGKAPIHESYDFDHNEDDAKF1LPAPYRLTKADEIVLGSKIVKLRTII 
10 20 30 X 40 X 50 60 70 



ETA 



6. US-08-249-182-6 (1-9) 

P30180 Sequence of a polypeptide with hunan lynphoblastoi 

ID P30180 standard; Protein; 155 AA. 

AC P30180? 

DT 14-JUN-1992 (first entry) 

DE Sequence of a polypeptide uith hunan lynphoblastoid interferon 

DE activity encoded by plasnid CG-pBR 322/HL ganna cIFN-betal. 

KH Innunonodulator; antiviral agent; antitunor; anticancer. 

OS Hono sapiens. 



ceo-* sequences, inese peptiaes were uses to construct Degenerate 
CC primers and probes. 
SQ Sequence 30 AA; 

SQ i A; 4 R; i M ; 3 D; 0 B; l C; 2 Q; 4 E; 0 Z; 0 g; 0 h; 

SQ 2 IS 3 L? 0 K; 0 M; 0 F; 0 P? 1 S? 3 T; 1 W; 0 Y; 4 V; 



Initial Score = 5 
Residue Identity = 55X 
Gaps = 0 



Optimized Score = 5 Significance = 4.34 
Matches = 5 Mismatches = 4 

Conservative Substitutions - 0 



X X 
YDVPWNETI 
II III 

DDVVQEETIRMAQELRLRCLVTTRDVEISN 
X 10 20 30 



4. US-08-249-1S2-6 (1-9) 
R27732 SalF20.5R. 

ID R27732 standard? peptide; 59 AA. 

AC R27732; 

DT 09-HAR-1993 (first entry) 

DE SalF20.5R. 

KM Virus vector; vaccinia virus; papillomavirus; HPV; 

KM i mmunother apeut i c . 

OS Vaccinia virus. 

PN W09216636-A. 

PD 01-0CT-1992. 

PF 10-MAR-1992; G00424. 

PR 14-MAR-199U GB-005383. 

PA (IMMU ) IMMUNOLOGY LTD. 

PI Boursnell MEGr Inglis SC, Munro AJ? 

DR WPI; 92-349219/42. 

DR N-PDSBi Q29392. 

PT Recombinant virus vectors encoding human papillomavirus proteins 

PT - for treating and vaccinating against HPV infections and 

PT conditions caused by then* such as cervical cancer 

PS Disclosure; Fig 10; 83pp; English. 

CC To make a recombinant virus vector comprising human papillomavirus 

CC genes inserted into the vaccinia virus genome» neutral sites 

CC for insertion must be utilised such that replicative ability is not 

CC adversely affected. The neutral sites are identified by analysing 

CC the viral genome to identify ORFs which are likely to encode 

CC functional genes and selecting sites between such ORFs or within 

CC sequences for non-functional genes. One such neutral site is site Br 

CC present in an intergenic region between SalF20R and 5aIF20.5R. It is 

CC placed 70 bases upstream of SalF20.5R to avoid promoter elements 

CC associated with that gene. Also there is no transcription 

CC termination signal with which site B could interfere and hence the 

CC sequence is suitable as a neutral insertion site. HPV DNA sequences 

CC may be inserted at this site; e.g. those encoding Eb or E7 of HPV 16 

CC and 18 or mutants of these proteins. The recombinant virus vector may 

CC be used immunotherapeutically to activate cells of the immune system 

CC against HPV. See also R27723-43. 

SQ Sequence 59 AAf 

SQ 3 AM R; 7 N; 4 D; 0 B; 0 c; 0 3; 1 E; 0 Z; 0 G; 3 H; 

SQ 8 I; 6 L? 1 K; 1 M; 3 F; 0 P; l S; 4 T; l W; 6 Y; 9 V; 

Initial Score = 5 Optimized Score = 5 Significance = 4.34 
Residue Identity = 55X Matches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X X 
YDVPWNETI 
It I II 



ID R42781 standard? Protein? 30 AA. 

AC R42781? 

DT 28-APR-1994 (first entry) 

DE Consensus peptide* (ced-4) 

KW Long-distance homology; evolution? nematode? 

KW hybridisation; louer organism? structural homologue; 

KW Alzheimer's disease; cell death gene? PCR? polymerase chain reaction? 

KH ciona intestinalis? echinoderm? lamprey? puffer fish; 

KU mammal? probe. 

OS Synthetic. 

PN W09320237-A. 

PD 14-0CT-1993. 

PF 01-APR-1993? U03102. 

PR 01-APR-1992? US-861458. 

PA (CANB-) CAMBRIDGE NEUROSCIENCE INC. 

PI Johnson CDt Marchionni MA? 

DR WPI? 93-336943/42. 

PT Long-distance homology cloning of genes from lower organisms - 

PT used to identify DNA that codes for evolutionary conserved 

PT aninoacid sequences 

PS Disclosure? Fig 25? 188pp? English. 

CC Sequences (R42771-87) show conserved stretches within deduced 

CC ced-4 sequences. These peptides were used to construct degenerate 

CC primers and probes. 

SO Sequence 30 AA? 

SG I A? 4 R? 1 N? 4 D? 0 B? 2 C? 2 Q? 3 E? 0 Z? 0 G? 0 H; 

SO 3 I? 3 L? 0 K? 0 H? 0 F? 0 P? 0 S? 3 T? 1 W? 0 Y; 3 V? 

Initial Score = 5 Optimized Score = 5 Significance = 4.34 

Residue Identity = 55X Hatches = 5 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
YDVPWNETI 

II III 

DDVVQDET I RWAGELRLRCL I TTRDVE I CN 

X 10 20 30 



3. US-08-249-182-6 (1-9) 

R42780 Consensus peptide* (ced-4) 

ID R42780 standard? Protein? 30 AA. 

AC R42780? 

DT 28-APR-1994 (first entry) 

DE Consensus peptide, (ced-4) 

KW Long-distance homology? evolution? nematode? 

KW hybridisation? lower organism; structural homologue? 

KW Alzheimer's disease? cell death gene? PCR? polymerase chain reaction? 

KW ciona intestinalis? echinoderm? lamprey? puffer fish? 

KW mammal; probe. 

OS Synthetic. 

PN W09320237-A. 

PD 14-QCT-1993. 

PF 01-APR-1993? U03102. 

PR 01-APR-1992? US-861458. 

PA (CAMB-) CAMBRIDGE NEUROSCIENCE INC. 

PI Johnson CD* Marchionni HA? 

DR WPI? 93-336943/42. 

PT Long-distance homology cloning of genes from louer organisms - 

PT used to identify DNA that codes for evolutionary conserved 

PT aminoacid sequences 

PS Disclosure; Fig 25? 188pp? English. 

CC Sequences (R42771-87) show conserved stretches within deduced 



cc. 


raw 1*0 


noomeo nunan interferon \if 


166 




3 




u 


23. 


P50195 


Modified hunan interferon (IF 


166 


5 


5 


4.34 


0 


24. 


P50194 


Modified hunan interferon (IF 


166 


5 


5 


4.34 


0 


25. 


P50193 


Modified hunan interferon (IF 


166 


5 


5 


4.34 


0 


26. 


P50192 


Modified hunan interferon (IF 


166 


5 


5 


4.34 


0 


27. 


P50279 


Protein sequence encoding syn 


166 


5 


5 


4.34 


0 


28. 


P50278 


Protein sequence encoding nod 


166 


5 


5 


4.34 


0 


29. 


P50277 


Protein sequence encoding nod 


166 


5 


5 


4.34 


0 


30. 


P50276 


Protein sequence encoding nod 


166 


5 


5 


4.34 


0 


31. 


P50275 


Protein sequence encoding nod 


166 


5 


5 


4.34 


0 


32. 


P50274 


Protein sequence encoding nod 


166 


5 


5 


4.34 


0 


33. 


P50272 


Protein sequence encoding nod 


166 


5 


5 


4.34 


0 


34. 


P50271 


Protein sequence encoding nod 


166 


5 


5 


4.34 


0 


35. 


P50270 


Protein sequence encoding nod 


166 


5 


5 


4.34 


0 


36. 


P50262 


Sequence encoded by the seque 


166 


5 


5 


4.34 


0 


37. 


P50591 


Sequence of the interferon nu 


166 


5 


5 


4.34 


0 


38. 


P50032 


Sequence of neu nodified huna 


166 


5 


5 


4.34 


0 


39. 


P50031 


Sequence of neu nodified huna 


166 


5 


5 


4.34 


0 


40. 


P50030 


Sequence of neu nodified huna 


166 


5 


5 


4.34 


0 



1. US-08-249-182-6 (1-9) 

R37448 Autotaxin peptide ATX 47. 

ID R37448 standard; peptide; 9 AA. 

AC R37448; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 47. 

KW Cell motility stimulating; cancer metastasis; antibody; detection; 

KW immunostains; disease outcome prediction? therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch Hr Liotta LA» Schiffnann E, Stracke fl. 

DR WPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 47. It may be used to 

CC raise anti-autotaxin antibodies uhich can be used to diagnose cancer 

CC metastasis and in immunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy which may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SQ Sequence 9 AA; 

SO 0 A; 0 R? 1 N; 1 

S3 l I; 0 L; o K; o 

Initial Score = 9 
Residue Identity = 100X 
Gaps = 0 

X X 
YDVPWNETI 

niiimi 

YDVPWNETI 
X X 



D; o b; o c; o Q? l E; o z; o G; o H; 

H; 0 F; l P; o s; i T; l H; i y; i v; 

Optimized Score = 9 Significance = 7.81 

Hatches = 9 Mismatches = 0 

Conservative Substitutions = 0 



2. US-08-249-182-6 (1-9) 



PARAMETERS 



Similarity natrix 


Unitary 


i\-tupie 


c 


Nisnatch penalty 


* 

l 


Joining penalty 


20 


Gap penalty 


1.00 


Hindoo size 


5 


Gap size penalty 


o.os 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save IS 




Optimized scores to 


save 0 


Display context 50 






SEARCH STATISTICS 




Scores: 


Mean 


Median Standard Deviation 




0 


1 1.15 




Tines: 


CPU 


Total Elapsed 






00:00.'27.93 


00:00:30.00 




Number of residues: 




5287517 




Nunber of sequences 


searched: 


42145 




Number of scores above cutoff: 


3921 





Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 1007. identical sequence to the query sequence uas found: 
Sequence Nane Description 



Init. Opt. 
Length Score Score Sig. Frane 



1. R37448 Autotaxin peptide ATX 47. 

The list of other best scores is: 

Sequence Nane Description 



9 7.81 0 



Init. Opt. 
Length Score Score 



Sig. Frane 



2. 


R42781 


3. 


R42780 


4. 


R27732 


5. 


R29174 


6. 


P30180 


7. 


P40124 


8. 


R29179 


9. 


P60691 


10. 


P80055 


11. 


P50281 


12. 


P30227 


13. 


P50207 


14. 


P50280 


15. 


P50273 


16. 


P20025 


17. 


P30222 


18. 


P30219 


19. 


P40626 


20. 


P50206 


21. 


P50197 



**** 4 standard deviations 
Consensus peptide* (ced-4) 
Consensus peptide* (ced-4) 
SalF20.5R. 

Astrovirus serotype Al clone 
Sequence of a polypeptide uit 
Sequence encoded by the cDNA 
Astrovirus serotype A2 clone 
Sequence of the region coding 
Sequence of hunan interferon 
Protein sequence encoding Rod 
Sequence of hybrid interferon 
Modified hunan interferon (IF 
Protein sequence encoding nod 
Protein sequence encoding nod 
Sequence encoded by hunan fib 
Sequence of hybrid interferon 
Sequence of interferon (HuIFN 
Sequence of interferon-beta-S 
Modified hunan interferon (IF 
Modified hunan interferon (IF 



above nean **** 



30 


5 


5 


4.34 


0 


30 


5 


5 


4.34 


0 


59 


5 


5 


4.34 


0 


75 


5 


5 


4.34 


0 


155 


5 


5 


4.34 


0 


155 


5 


5 


4.34 


0 


156 


5 


5 


4.34 


0 


161 


5 


5 


4.34 


0 


161 


5 


5 


4.34 


0 


164 


5 


5 


4.34 


0 


165 


5 


5 


4.34 


0 


165 


5 


5 


4.34 


0 


165 


5 


5 


4.34 


0 


165 


5 


5 


4.34 


0 


166 


5 


5 


4.34 


0 


166 


5 


5 


4.34 


0 


166 


5 


5 


4.34 


0 


166 


5 


5 


4.34 


0 


166 


5 


5 


4.34 


0 


166 


5 


5 


4.34 


0 



Results file u249_6a.res made by on Thu 22 Sep 94 10:51 541-PDT. 



Query sequence being compared?US-08-249-182-6 (1-9) 
Number of sequences searched: 42145 
Number of scores above cutoff: 3921 

Results of the initial comparison of US-08-249-182-6 (1-9) uith: 
Data bank : A-GeneSeq 15. all entries 

100000- 
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E 
N 
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500- ♦ 



» 

100- 
50- 



10- 



5- 



II II II I II II II II 

SCORE 0| 1| 2| 3 4| 5| 6| 7| 
STDEV 1 2 3 4 5 6 7 



ri uiovuriu ji jt di siniuttniii. 

SQ SEQUENCE 61 AA; 7033 HU>' 16901 CN; 



Initial Score = 5 Gptinized Score = 5 Significance = 3.49 
Residue Identity - 507. Hatches = 5 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
PEEVTRPNYL 
II I II 

FTCFTTPSDTSETCPDGQNICYEKRWNSHQGVEIKGCVASCPEFESRFRYLLCCRIDNCNK 
10 20 30 40 X 50 60 



15. US-08-249-1S2-5 (1-10) 

NUCC.SECCE NADH-PLASTOQUI NONE DXIOOREDUCTASE 49 KD SUBMIT, C 



ID NUCC.SECCE STANDARD; PRT; 90 AA. 

AC P27758; 

DT 01-AUG-1992 (REL. 23, CREATED) 

DT 01-AUG-1992 (REL. 23, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26. LAST ANNOTATION UPDATE) 

DE NADH-PLASTOQU I NONE OXIDOREDUCTASE 49 KD SUBUNIT. CHLOROPLAST 

DE (EC 1.6.5.3) (FRAGHENT) . 

GN NDHH. 

OS SECALE CEREALE (RYE). 

OG CHLOROPLAST. 

oc eukaryota; plant a; ehbryophyta; angiosperhae; nonocotyledoneae; 

OC CYPERALESJ GRAMINEAE. 

RN £11 

RP SEQUENCE FROM N.A. 

RH 90037033 

RA PRONBONA A., SUBRAHANIAN A.R.; 

RL J. BIOL. CHEM. 264:19060-19065(1989). 

CC -!- FUNCTION: TRANSFER OF ELECTRONS FROM NADH TO THE RESPIRATORY 
CC CHAIN. THE IMMEDIATE ELECTRON ACCEPTOR FOR THE ENZYME IS BELIEVED 

CC TO BE PLASTOQUINONE. COHPONENT OF THE IRON-SULFUR (IP) FRAGHENT OF 

CC THE ENZYME. 

CC -!- CATALYTIC ACTIVITY: NADH + PLASTOQUINONE = NAD(+) + PLASTOQUINOL. 

CC -!- SIMILARITY! TO OTHER COMPLEX I 49 KD SUBUNITS AND TO E.COLI 

CC FORMATE HYDROGENLYASE (FHL) SUBUNIT HYCE. 

DR EHBLf X14557! CHSCRPS3. 

DR PIR; C34435; C34435. 

DR PROSITEJ PS00535; COMPLEX 1.49K. 

KH OXIDOREDUCTASE; NAD; PLASTOQUINONE; CHLOROPLAST; IRON-SULFUR; 4FE-4S. 

FT NON_TER 1 1 

FT NON_TER 90 90 

SQ SEQUENCE 90 AA; 10509 MM; 42181 CN; 



Initial Score = 5 Optinized Score = 5 Significance = 3.49 
Residue Identity = 502 Hatches = 5 Mi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 
III II 

IVTLDGEDVIDCEPILGYLHRGHEKIAENRTIISYLPYVTRHDYLATMFTEAITVNAPEFLENIQIPQRASY 
10 20 30 X 40 X 50 60 70 



IRVIMLELSRI ASHLLHL 
80 90 

> 0 < 

0| |0 IntelliGenetics i 

>o< sej. u 

FastDB - Fast Pairuise Comparison of Sequences 



AC P05517; 

DT 01-N0V-1988 (REL. 09, CREATED) 

DT 01-N0V-1988 (REL. 09, LAST SEQUENCE UPDATE) 

DT 01-MAY-1991 (REL. 18, LAST ANNOTATION UPDATE) 

DE 139 KD CRYSTAL PROTEIN (DELTA ENDOTOXIN) (CRYST ALINE ENTOMOCIDAL 

DE PROTOXIN). 

GN CRYA4. 

OS BACILLUS THURINGIENSIS (SUBSP. KURSTAKI). 

OC PROKARYOTA; FIRHICUTES; ENDOSPORE -FORMING RODS AND COCCI; BACILLACEAE. 

RN [11 

RP SEQUENCE FROM N.A. 

RC STRAIN=HD-2? 

RM 88203216 

RA BR1ZZARD B.L., HHITELEY H.R.; 

RL NUCLEIC ACIDS RES. 16:2723-2723(1988). 

CC -!- FUNCTION: PROMOTES COLLOIDOSMOTIC LYSIS BY BINDING TO THE MIDGUT 
CC EPITHELIAL CELLS OF INSECTS. 

CC -!- TOXIC SEGMENT OF THE PROTEIN IS LOCATED IN THE N-TERMINUS. 

CC -!- DEVELOPMENTAL STAGE: THE CRYSTAL PROTEIN IS PRODUCED DURING 
CC SPORULATION AND IS ACCUMULATED BOTH AS AN INCLUSION AND AS PART 

CC OF THE SPORE COAT. 

DR EHBL; X06711; BTCRYA4. 

DR PIR; S00873; S00873. 

KU TOXIN: SPORULATION. 

S9 SEQUENCE 1228 AA; 139629 HU; 7697478 CN; 

Initial Score = 6 Optinized Score = 6 Significance = 4.65 
Residue Identity = 60X Matches = 6 Mi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
PEEVTRPNYL 
II liil 

GIYLEPIHGVPTVRFNFTNPQNISDRGTANYSQPYESPGLQLKDSETELPPETTERPNYESYSHRLSHIGII 
400 410 420 430 440 450 460 470 

LQSRVNVPVYSWTHRSADRTNTIGPNRIT8IPMVKASE 
480 490 500 



14. US-08-249-182-5 (1-10) 

TXU2.NAJHH WEAK TOXIN CH-2. 

ID TXU2 NAJHH STANDARD; PRT; 61 AA. 

AC P01415J 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-19S6 (REL. 01, LAST SEQUENCE UPDATE) 

DT Ot-MAY-1992 (REL. 22, LAST ANNOTATION UPDATE) 

DE WEAK TOXIN CM-2. 

OS NAJA HAJE HAJE (EGYPTIAN COBRA) . 

oc eukaryota; metazoa; chordata; vertebrata; tetrapdda; reptilia; 

oc lepidosauria; serpentes. 

RN [1] 

rp sequence, 

rc tissue=venom; 

RM 79045337 

RA JOUBERT F.J., TALJAARD N. ; 

RL EUR. J. BI0CHEN. 90:359-367(1978). 

CC -!- LD(50) IS 16.1 MG/KG BY SUBCUTANEOUS INJECTION. 

DR PIR! A01685; T2NJ2Y . 

DR PROSITE; PS00272; SNAKE.TOXIN. 

KU VENOM ; TOXIN. 

FT DISULFID 3 21 BY SIMILARITY. 

FT DISULFID 14 37 BY SIMILARITY. 

FT DISULFID 41 53 BY SIMILARITY. 



1UU tlU HCU ■tOU 

12. US-08-249- 182-5 (1-10) 

K6PF_RABIT 6-PH0SPH0FRUCT0KINASE, MUSCLE TYPE (EC 2.7.1.11) ( 

ID K6PF_RABIT STANDARD; PRT; 779 AA. 

AC P00511! 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 01-0CT-1989 (REL. 12, LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE 6-PHOSPHOFRUCTOKINASE, MUSCLE TYPE (EC 2.7.1.11) (PHOSPHOFRUCTOKINASE 

DE 1) (PHOSPH0HEX0KINASE) . 

OS ORYCTOLAGUS CUNICULUS (RABBIT). 

OC EUKARYOTA; flETAZOA; CHORD AT A; VERTEBRATA; TETRAPODA; MAMMALIA; 

OC EUTHERIAf LAGOMORPHA. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=HUSCLE; 

RM 87166033 

RA LEE CP. , KAO M.C., FRENCH B.A., PUTNEY S.D., CHANG S.H.; 

RL J. BIOL. CHEH. 262:4195-4199(1987). 

RN [2] 

RP SEQUENCE. 

RM 84219739 

RA POORMAN R.A.. RANDOLPH A., KEMP R.G., HEINRIKSON R.L.; 

RL NATURE 309:467-469(1984). 

CC -!- CATALYTIC ACTIVITY: ATP + D-FRUCTOSE 6-PHOSPHATE = ADP + 

CC D-FRUCTOSE 1 ,6-BISPHOSPHATE. 

CC -!- PATHWAY: KEY CONTROL STEP OF GLYCOLYSIS. 

CC -!- SUBUNIT: HOMOTETRAMER. 

CC -!- COF ACTOR: REQUIRES MAGNESIUM ION. 

CC -!- ENZYME REGULATION: ALLOSTERIC ENZYME ACTIVATED BY ADP, AMP, OR 

CC FRUCTOSE BISPHOSPHATE AND INHIBITED BY ATP OR CITRATE. 

DR PIRi A00604,' KIRBF. 

DR PIR; A26550.* A26550. 

DR PROSITE; PS00433; PHOSPHOFRUCTOKINASE. 

KU KINASE; TRANSFERASE; GLYCOLYSIS; DUPLICATION; ALLOSTERIC ENZYME; 

KU PHOSPHORYLATION; MAGNESIUM; MULT I GENE FAMILY. 



INIT_MET 


0 


0 




REPEAT 


1 


401 


APPROXIMATE. 


REPEAT 


402 


779 


APPROXIMATE. 


MOD.RES 


774 


774 


PHOSPHORYLATION. 


CONFLICT 


268 


268 


R -> S (IN REF. 2). 


CONFLICT 


479 


507 


MISSING (IN REF. 2). 


CONFLICT 


558 


558 


S -> I (IN REF. 2). 


CONFLICT 


565 


565 


MISSING (IN REF. 2). 


SEQUENCE 


779 AA> 


85072 NU 


3033380 CN; 



Initial Score = 6 Optinized Score = 6 Significance = 4.65 
Residue Identity = 60% Matches = 6 Mi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
PEEVTRPNYL 

II II II 

GRLRA AHNL VKRG I TNLCV I GCDGSLTG ADTFRSEHSDLLSDLQK AGK I TAEEATRSSYLNI VGLVGS IDND 
100 110 120 130 140 X 150 X 160 

FCGTDMTIGTDSALHRITEIVDAITTTAQSHQRTFVLE 
170 180 190 200 



13. US-08-249-182-5 (1-10) 

CRYX.BACTK 139 KD CRYSTAL PROTEIN (DELTA ENDOTOXIN) (CRYSTALI 



DR PR0B1TE; PS00727J AP.NUCLEASE.F1.2. 
DR PROSITE; PS00728? AP_NUCLEASE_F1_3. 

KW DNA REPAIR; ENDONUCLEASE; HYDROLASE; NUCLEASE; NUCLEAR PROTEIN. 
FT DOMAIN 428 679 AP ENDONUCLEASE. 

SQ SEQUENCE 679 AA; 74662 MU? 2298057 CN; 

Initial Score = 6 Optinized Score = 7 Significance = 4.65 
Residue Identity = 72X Matches = 8 Hisnatches = 2 

Gaps = 1 Conservative Substitutions = 0 

X X 
PEEVTR-PNYL 
llllll I I 

P ADKEFNLK I CSHNV AGLRAHLKKDGLQL I DLEEPDI FCLSETKCAND8LPEEVTRLPGYHPYHLCMPGGYA 
420 430 440 450 460 470 480 490 

GVAIYSK I HP I HVE YGI GNEEFDD VGRM ITAE YEKFYL I 
500 510 520 530 



10. US-08-249- 182-5 (1-10) 

G13A.DICDI CELL SURFACE GLYCOPROTEIN GP138A PRECURSOR. 

ID G13A_DICDI STANDARD; PRT; 730 AA. 

AC P34115; 

DT 01-FEB-1994 (REL. 28, CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE CELL SURFACE GLYCOPROTEIN GP138A PRECURSOR. 

GN FUSA. 

OS DICTY0STEL1UH DISCOIDEUM (SLIME MOLD). 

oc eukaryota; protozoa; sarcomastigophora; sarcodina; rhizopoda; 

oc eumycetozoa; dictyostelia. 

rn 111 

RP SEQUENCE FROM N.A., AND PARTIAL SEQUENCE. 

RM 93193972 

RA FANG H., HIGA N. , SUZUKI K., AIBA K., URUSHIHARA H., Y AN AG IS AH A K.! 

RL DEV. BIOL. 156:201-208(1993). 

CC -!- FUNCTION: INVOLVED IN THE SEXUAL CELL FUSION OF D. DISCOIDEUM. 

DR EtIBL; D 12383; DDGP138A. 



DR 


dictydb; 


DD03014; 


FUSA. 




KU 


GLYCOPROTEIN; SIGNAL; MULT I GENE FAMILY. 


FT 


SIGNAL 


1 


20 




FT 


CHAIN 


21 


730 


CELL SURFACE GLYCOPROTEIN GP138A. 


FT 


CARBOHYD 


58 


58 


POTENTIAL. 


FT 


CARBOHYD 


89 


89 


POTENTIAL. 


FT 


CARBOHYD 


124 


124 


POTENTIAL. 


FT 


CARBOHYD 


198 


198 


POTENTIAL. 


FT 


CARBOHYD 


224 


224 


POTENTIAL. 


FT 


CARBOHYD 


392 


392 


POTENTIAL. 


FT 


CARBOHYD 


420 


420 


POTENTIAL. 


FT 


CARBOHYD 


435 


435 


POTENTIAL. 


FT 


CARBOHYD 


482 


482 


POTENTIAL. 


FT 


CARBOHYD 


498 


498 


POTENTIAL. 


FT 


CARBOHYD 


523 


523 


POTENTIAL. 


FT 


CARBOHYD 


534 


534 


POTENTIAL. 


FT 


CARBOHYD 


596 


596 


POTENTIAL. 


FT 


CARBOHYD 


605 


605 


POTENTIAL. 


FT 


CARBOHYD 


614 


614 


POTENTIAL. 


FT 


CARBOHYD 


620 


620 


POTENTIAL. 


FT 


CARBOHYD 


621 


621 


POTENTIAL. 


FT 


CARBOHYD 


630 


630 


POTENTIAL. 


S8 


SEQUENCE 


730 AA; 


80960 MU; 3075521 CN; 



Initial Score 



6 Optinized Score = 



6 Significance = 4.65 



4 
0 



X 10 
PEEVTRPNYL 
II I III 

NTFSYECIDFSNIADFSDYYNDYEQYLPNIDNAPLLTGVDIYQSVVVGDIPESYCRINYLSLYYNQLNGTVP 
330 340 350 360 370 X 380 X 390 

SCIQCLGGVKGGDIVLPNPFLNFNKTTEPYCPTFKIDE 
400 410 420 430 



11. US-08-249-1B2-5 (1-10) 

G13B DICDI CELL SURFACE GLYCOPROTEIN GP138B PRECURSOR. 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
RN 
RP 
RH 
RA 
RL 
CC 
DR 
DR 
KU 



G13B_DICDI STANDARD; PRTJ 734 AA. 
P34116; 

01-FEB-1994 (REL. 28, CREATED) 
01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 
01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 
CELL SURFACE GLYCOPROTEIN GP138B PRECURSOR. 
FUSB. 

dictyostelium discoideum (sline hold). 

eukaryota; protozoa; sarcohastigophora; sarcodina; rhizopoda; 

euhycetozoa; dictyostelia. 

[1] 

SEQUENCE FROM N.A., AND PARTIAL SEQUENCE. 
93193972 

FANG H., HIGA H., SUZUKI K., AIBA K.. URUSHIHARA H., YANAGISAHA K. 
DEV. BIOL. 156:201-208(1993). 

-!- FUNCTION: INVOLVED IN THE SEXUAL CELL FUSION OF D. DISCOIDEUH. 

EHBL; 013884; DDGP138B. 

DICTYDB; DD03015; FUSB. 

GLYCOPROTEIN; SIGNAL; HULTIGENE FAMILY. 



SQ 



SIGNAL 


1 


20 




CHAIN 


21 


734 


CELL SURFACE GLYCOPROTEIN GP138B. 


CARBOHYD 


58 


58 


POTENTIAL. 


CARBOHYD 


89 


89 


POTENTIAL. 


CARBOHYD 


124 


124 


POTENTIAL. 


CARBOHYD 


198 


198 


POTENTIAL. 


CARBOHYD 


224 


224 


POTENTIAL. 


CARBOHYD 


392 


392 


POTENTIAL. 


CARBOHYD 


420 


420 


POTENTIAL. 


CARBOHYD 


435 


435 


POTENTIAL. 


CARBOHYD 


482 


482 


POTENTIAL. 


CARBOHYD 


498 


498 


POTENTIAL. 


CARBOHYD 


523 


523 


POTENTIAL. 


CARBOHYD 


596 


596 


POTENTIAL. 


CARBOHYD 


605 


605 


POTENTIAL. 


CARBOHYD 


614 


614 


POTENTIAL. 


CARBOHYD 


621 


621 


POTENTIAL. 


CARBOHYD 


630 


630 


POTENTIAL. 


SEQUENCE 


734 AAi 


81063 


HW, 3108617 cn; 



Initial Score - 6 
Residue Identity = 60X 
Gaps = 0 



Optimized Score = 6 Significance = 4.65 
Hatches = 6 Hi snatches = 4 

Conservative Substitutions = 0 



X 10 
PEEVTRPNYL 

II I III 

NTFSYACIDFSNLAYFYDYYNEYEQYLPNIDNAPLLNEIYISESVVVGDIPESYCRINYLGLNYNQLNGTAP 
330 340 350 360 370 X 380 X 390 



SCILCLGGNRGGDIVLPNPLLNFNKTSEPYCPTFKIDS 



ID R29583 standard; Protein; 513 AA. 

AC R29583? 

DT 19-APR-1993 (First entry) 

DE Hunan activin receptor, 

KW Activin receptor; mouse? Xenopus; human? extracellular; ligand binding! 

KW hydrophobic; trans-membrane? intracellular; receptor! domain? 

KW serine kinase-like! activity? probe! superf amily! secretion signal? 

KW golgi membrane? diagnosis! treatment; activin-dependent tumour; brain; 

KW neuron; abortion; twinning; wound healing! TGF-beta? immune response? 

KW liver regeneration. 

OS Homo sapiens. 

PN W09220793-A. 

PD 26-NQV-1992. 

PF 08-MAY-1992? U03825. 

PR 10-MAY-1991? US-698709. 

PR 09-0CT-1991? US-773229. 

PA (SALK ) SALK INST BIOLOGICAL STUDIES. 

PI Mathews LS, Vale WW? 

DR WPI! 92-415771/50. 

DR N-PSDB; 931912. 

PT New member of activin-transforming growth factor beta 

PT super-family - for diagnosis and treatment of cancer and 

PT disorders of the immunei reproductive or central nervous system 

PS Disclosure; Page 40; 68pp? English. 

CC The sequences given in R29581-83 represent activin receptors from 

CC mouse, Xenopus and human respectively. Each of these proteins 

CC comprise three distinct domains; an extracellular, ligand binding 

CC domain, a hydrophobic , trans-membrane domain and an intracellular! 

CC receptor domain having serine kinase-like activity. The DNA sequences 

CC encoding these proteins can be used as probes for the identification 

CC of additional members of this superfanily of receptor molecules. The 

CC proteins may further comprise a second hydrophobic domain at the amino 

CC terminal which comprises a secretion signal sequence which promotes 

CC the intracellular transport of the initially expressed receptor 

CC protein across the golgi membrane. These receptor proteins can be 

CC used to develop agents for the diagnosis and/or treatment of eg. 

CC activin-dependent tumours , for enhancing the survivial of brain 

CC neurons f for inducing abortion or twinning in livestock! for 

CC stimulating wound healing? for suppression of growth of TGF-beta 

CC sensitive tumours, for suppressing immune response* for promoting 

CC liver regeneration and for stimulating some immune responses. 

SO Sequence 513 AA; 

SO 38 A; 21 R? 20 N; 28 D? 0 B? 19 C? 18 8; 37 E; 0 Z; 32 G; 15 H; 

SQ 28 I! 46 L! 32 K? 16 H? 20 F? 30 P! 25 S! 24 T? 11 W? 17 Y? 36 V? 

Initial Score = 6 Optimized Score = 6 Significance = 4.21 
Residue Identity = 607. Matches * 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
I) II II 

INCYDRTDCVEKKDSPEVYFCCCEGNMCNEKFSYFPEHEVTflPTSNPVTPKPPYYNILLYSLVPLNLIAGIV 
90 100 110 120 130 X 140 X 150 

ICAFWVYRHHKMAYPPVLVPTQDPGPPPPSPLLGLKPL 
160 170 180 190 



5. US-08-249-182-7 (1-10) 

R29581 Mouse activin receptor. 

ID R29581 standard; Protein? 513 AA. 
AC R29581? 

DT 19-APR-1993 (first entry) 



ul nouse dtiivin receptor. 

KW Activin receptor; mouse? Xenopus; human? extracellular; ligand binding; 

KH hydrophobic; trans-membrane? intracellular; receptor? domain* 

KM serine kinase-like? activity? probe; superfamily? secretion signal; 

KU golgi membrane? diagnosis; treatment; activin-dependent tumour; brain; 

KU neuron; abortion; twinning; wound healing; TGF-beta; immune response; 

KH liver regeneration . 

OS Hus musculus. 

PN W09220793-A. 

PD 26-N0V-1992. 

PF 08-MAY-1992? U03825. 

PR 10-HAY-1991? US-698709. 

PR 09-0CT-1991? US-773229, 

PA <SALK ) SALK INST BIOLOGICAL STUDIES. 

PI Mathews LS, Vale UU? 

DR DPI; 92-415771/50. 

DR N-PSDB? 931910. 

PT New member of activin-transforming growth factor beta 

PT super-family - for diagnosis and treatment of cancer and 

PT disorders of the income* reproductive or central nervous system 

PS Disclosure? Page 42-45? 68pp? English. 

CC The sequences given in R29581-83 represent activin receptors from 

CC nouse* Xenopus and hunan respectively. Each of these proteins 

CC comprise three distinct domains? an extracellular ligand binding 

CC domain* a hydrophobic* transmembrane domain and an intracellular* 

CC receptor domain having serine kinase-like activity. The DNA sequences 

CC encoding these proteins can be used as probes for the identification 

CC of additional members of this superfamily of receptor molecules. The 

CC proteins nay further conprise a second hydrophobic domain at the amino 

CC terminal which comprises a secretion signal sequence which promotes 

CC the intracellular transport of the initially expressed receptor 

CC protein across the golgi membrane. These receptor proteins can be 

CC used to develop agents for the diagnosis and/or treatment of eg. 

CC activin-dependent tumours i for enhancing the survivial of brain 

CC neurons* for inducing abortion or twinning in livestock* for 

CC stimulating wound healingr for suppression of growth of TGF-beta 

CC sensitive tumours* for suppressing immune response* for promoting 

CC liver regeneration and for stimulating some immune responses. 

S3 Sequence 513 AA; 

SO 38 A; 22 R; 20 N? 28 D; o B; 19 C? 17 a? 38 E; 0 z; 32 G; 15 h; 

SQ 29 I? 46 L; 31 K; 16 M; 20 F; 30 P; 25 s; 24 T; 11 u; 17 y; 35 V; 

Initial Score = 6 Optimized Score = 6 Significance - 4.21 
Residue Identity = 60X Hatches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 

II II II 

INCYDRTDCIEKKDSPEVYFCCCEGNMCNEKFSYFPEMEVTQPTSNPVTPKPPYYNILLYSLVPLMLIAGIV 
90 100 110 120 130 X 140 X 150 

ICAFWVYRHHKHAYPPVLVPT8DPGPPPPSPLLGLKPL 
160 170 180 190 



6. U5-08-249-182-7 (1-10) 

R42635 Human interferon receptor. 

ID R42635 standard; Protein? 557 AA. 

AC R42635? 

DT 20-APR-1994 (first entry) 

DE Human interferon receptor. 

KU IFN-R; extracellular domain; monoclonal antibody; viral infection; 

KU cell proliferation; allograft rejection* systemic lupus erythematosus; 

KW psoriasis? multiple sclerosis; Behcet's Disease? aplastic anaemia? 



m mwunooet lciencyi neastes virus i interterori-atpna-oets. 

OS Mono sapiens. 

FH Key Location/Qualifiers 

FT Donain 1..436 

FT /label= extracellular_donain 

FT /note= "soluble* immunogenic forn of IFN-R 0 

PN EP-563487-A. 

PD 06-QCT-1993. 

PF 31-MAR-1992J 400902. 

PR 31-HAR-1992; EP-400902. 

PA (EUBI-) LAB EURO BIOTECHNOLOGIE SA. 

PI Benoit Pi Maguire D» Meyer Fr Plavec It Tovey MG? 

DR HPI; 93-312951/40. 

DR P-PSDB; R42635. 

PT Monoclonal antibody to hunan interferon type-I receptor - having 

PT neutralising activity against hunan type I interferon! used for 

PT therapy and diagnosis 

PS Disclosure? Fig 3; 21pp; English. 

CC Monoclonal antibodies produced against soluble forms of the hunan 

CC interferon alpha-beta receptor based on the full-length hunan IFN-R 

CC sequence are clained. The antibodies are useful for treatnent and 

CC prophylaxis of disorders involving cell proliferation and/or viral 

CC infection. 

SO Sequence 557 AA; 

SQ 26 a; 13 r; 35 n; 25 D; 0 B; 11 C? 23 Q; 42 e; 0 2? 20 G; 9 H; 

59 43 I; 44 L? 44 K? 6 Mi 26 F; 26 P? 51 S? 36 T; 13 U; 22 Y? 42 V; 



Initial Score = 6 Optinized Score = 6 Significance = 4.21 
Residue Identity = 60X Matches = 6 Misnatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 
II llll 

RIENIYSRHKIYKLSPETTYCLKVKAALLTSHKIGVYSPVHCIKTTVENELPPPENIEVSV8N9NYVLKHDY 
180 190 200 210 220 230 240 250 



TYANMTFQVQHLHAFLKRNPGNHLYKWKQIPDCENVKT 
260 270 280 



7. US-08-249-182-7 (1-10) 

R28496 Sequence of a soulble forn of the interferon (IFN) 

ID R28496 standard; Protein? 557 AA. 

AC R28496; 

DT 31-MAR-1993 (first entry) 

DE Sequence of a soulble forn of the interferon (IFN) receptor 

DE uith a high affinity for IFN-alpha and -beta. 

KW Interferon receptor; alpha-interferon; beta-interferon. 

OS Synthetic. 

PN W09218626-A. 

PD 29-0CT-1992. 

PF 17-APR-1991; F00318. 

PR 17-APR-1991; W0-F00318. 

PA (EUBI-) LAB EURO BIOTECHNOLOGIE. 

PI Eid Pi Gresser I* Lutfalla Gi Meyer Fr Mogensen KE> 

PI Tovey M* Uze G; 

DR WPI; 92-382110/46. 

DR N-PSDB; Q30533. 

PT Water soluble polypeptide(s) strongly bind interferon(s) alpha 

PT and beta - useful as immunosuppressants! for treating autoiinnune 

PT diseases and transplant rejection 

PS Clain 3; Fig 2; 58pp; English. 

CC DNA encoding the uater-soluble polypeptide uith a high affinity for 

CC IFN-alpha and -beta is isolated by PCRr using appropriate 



m oligonucleotides as primers and cloned cum as template. For example* 

CC bacteriophage lambda ZAP* containing the entire coding sequence of 

CC the IFN-alpha and -beta receptor (Q30533), was incubated with oligos 

CC Q30534 and Q30535. R2B496 represents the complete receptor. R28495 

CC lacks the transmembrane and cytoplasmic domains. Both ferns bind 

CC IFN in the sane uay as antibodies so are immunosuppressants e.g. for 

CC treating autoimmune diseases and graft rejection. They lack the 

CC toxic side-effects of known immunosuppressants such as steroids. 

SQ Sequence 557 AA; 

SQ 26 a; 13 R; 35 N; 25 D; o B; 11 C? 23 Qr 42 E? 0 Z? 20 G; 9 H? 

SQ 43 If 45 L; 44 K; 6 M; 26 F; 26 P; 51 S; 36 T; 13 W; 22 Y; 41 V? 

Initial Score = 6 Optimized Score = 6 Significance = 4.21 
Residue Identity = 60X Matches = 6 Mi snatches = 4 

Gaps = 0 Conservative Substitutions * 0 

X X 
VPPFENIELY 
II llll 

RIENIYSRHKIYKLSPETTYCLKVKAALLTSWKIGVYSPVHCIKTTVENELPPPENIEVSVQNaNYVLKHDY 
180 190 200 210 220 230 240 250 

TYANMTFfiVQHLHAFLKRNPGNHLYKUKaiPDCENVKT 
260 270 280 



8. US-08-249-1B2-7 U-10) 

R14488 Complete interferon-alpha/beta receptor. 

ID R14488 standard? Protein; 557 AA. 

AC R14488? 

DT 16-JAN-1992 (first entry) 

DE Complete interferon-alpha/beta receptor. 

KW IFN; autoimmune disease; graft rejection; histocompatibility. 

OS Horo sapiens. 

FH Key Location/Qualifiers 

FT Domain 437.. 457 

FT /label= transmembrane 

FT Domain 458.. 557 

FT /label* cytoplasmic 

PN FR2657881-A. 

PD 09-AUG-1991. 

PF 05-FEB-1990; 001298. 

PR 05-FEB-19905 FR-001298. 

PA (EUBI-) LAB EURO BI0TECHN0. 

PI Eid Pi Gresser I, Lutfalla Gr Meyer Ft Mogensen KE; 

PI Tovey MG, Uze G; 

DR HPI; 91-319778/44. 

DR N-PSDB; Q14240. 

PT Neu water-soluble polypeptide(s) with affinity for IFN-alpha and 

PT beta - used to treat e.g. lupus erythematosus > Behcet's disease* 

PT aplastic anaemia* diabetes mellitus* rheumatoid arthritist etc. 

PS Disclosure; Page 47; 52pp; French. 

CC The invention covers derivatives of the interferon-alpha and/or beta 

CC receptor obtained by deleting the transmembrane and cytoplasmic domains 

CC of the native receptor or by substitution. Potentially immunogenic 

CC epitopes are eliminated and the deriv. can be secreted from 

CC transformed cells. Soluble deriv. s block the activity of IFN alpha/beta 

CC and can be used to treat autoimmune diseases or to inhibit graft 

CC rejection. See also 914239. 

SQ Sequence 557 AA; 

SQ 26 A; 13 R; 35 N; 25 D; 0 B; 11 c; 23 8; 42 E; 0 z; 20 G; 9 h; 

SQ 43 I; 44 l; 44 K; 6 M; 26 F? 26 P; 51 S; 36 T; 13 U; 22 y; 42 v; 



Initial Score = 6 Optimized Score * 6 Significance = 4.21 
Residue Identity = 60* Matches - 6 Mismatches = 4 



baps = u uonservaiave suDsiauivions - u 

X X 
VPPFENIELY 
il III) 

RIENIYSRHKIYKLSPETTYCLKVKAALLTSHKIGVYSPVHCIKTTVENELPPPENIEVSV8NQNYVLKHDY 
180 190 200 210 220 230 240 250 

TYANMTFQVMJLHAFLKRNPGNHLYKWKQIPDCENVKT 
260 270 280 



9. US-08-249-182-7 (1-10) 

R11958 Hunan alpha-interferon receptor protein. 

ID Rl 1958 standard; Protein; 557 AA. 

AC R11958; 

DT 18-JUL-1991 (first entry) 

DE Hunan alpha-interferon receptor protein. 

KM Hunan alpha I FN ; IFN agonists; antiviral; anti tumour agent; 

KW drug targetting. 

OS Hono sapiens. 

FH Key Location/Qualifiers 

FT Peptide 1..27 

FT /label= signal peptide 

PN W09105862-A. 

PD 02-MAY-1991. 

PF 19-0CT-1990; F00758. 

PR 20-0CT-1989; FR-013770. 

PA (CNRS ) CNRS CENT NAT RECH SCI. 

PI Mogensen KE> Uze G, Lutfalla G, Gresser I; 

DR WPI; 91-148740/20. 

DR N-PSDBJ Q11701. 

PT New hunan alpha- interferon receptor protein - useful for testing 

PT interferon agonists and in treatment or diagnosis 

PS Disclosure; fig 4; 30pp; French. 

CC This recombinant hunan alpha interferon (IFN) receptor protein is 

CC useful for the testing of IFN agonists and for treatment and diag- 

CC nosis of viral diseases and tunours. Antibodies raised against 

CC this protein can be used for blocking the receptor uhen required > 

CC eg uhere overexpression of alpha-IFN is harflful. The Abs are 

CC also useful for eg drug targetting. Variants of the protein? 

CC having residue 164 (Thr) replaced by Arg and an Asp inserted 

CC between residues 479 and 460 » are also useful. 

SO Sequence 557 AA; 

SO 26 A; 13 R; 35 N? 25 D; o B; 11 C; 23 0; 42 E; o z; 20 G; 9 H; 

Sfi 43 I! 44 L; 44 K; 6 h; 26 F; 26 P; 51 S; 36 T; 13 H? 22 y; 42 v; 

Initial Score = 6 Optimized Score = 6 Significance = 4.21 
Residue Identity = 60X Hatches = 6 Hisnatches = 4 

Gaps = 0 Conservative Substitutions - 0 

X X 
VPPFENIELY 
II III) 

RIENIYSRHKIYKLSPETTYCLKVKAALLTSWKIGVYSPVHCIKTTVENELPPPENIEVSVQN8NYVLKHDY 
180 190 200 210 220 230 240 250 

TYANMTFQV8WLHAFLKRNPGNHLYKHKQIPDCENVKT 
260 270 280 



10. US-08-249-182-7 (1-10) 

R10607 Peptide with notilin-like activity (J). 

ID R10607 standard; Protein; 13 AA. 



fit KlU6U/i 

DT 18-APR-1991 (first entry) 

DE Peptide with aotilin-like activity (J). 

KW Moti I in; activity; gastrointestinal disorder; drug. 

OS Synthetic. 

PN JQ2311495-A. 

PD 27-DEC-1990. 

PF 24-MAY-1989; 128911. 

PR 24-HAY-1989; JP-128911. 

PA (SANW ) SANWA KAGAKU KENKYUSHO. 

DR WPI? 91-047299/07. 

PT Polypeptide^) with notilirrlike activity - used as active 

PT component of drug for treating gastrointestinal disorder 

PS Disclosure; Page 4; 7pp; Japanese* 

CC Compared with notilin* the peptide chain is considerably shorter. 

CC Chenical synthesis is easy and cheap. The peptide is used 

CC as active component in a drug for treating gastrointestinal disorders. 

CC See also R10598-R10611. 

SO Sequence 13 AAi 

sg o a; l r; o n; o d; o b; o c; o a; l E; o z; i g; 0 H; 

SQ l I; i l; 2 k; o ii; 2 F; l P; o s; l T; o w; i y; i v; 

Initial Score = 5 Qptinized Score = 5 Significance = 3.37 
Residue Identity = SOX Matches = 5 Mismatches = 5 

Gaps - 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

It t II 
FVP1FTYGELKRK 
X 10 



11. US-08-249-182-7 (1-10) 

R10606 Peptide uith notilin-like activity (I). 

ID R10606 standard; Protein; 14 AA. 

AC R1060&; 

DT 18-APR-1991 (first entry) 

DE Peptide with notilin-like activity (I). 

KW Mot i I in; activity; gastrointestinal disorder; drug. 

OS Synthetic. 

PN J02311495-A. 

PD 27-DEC-1990. 

PF 24-MAY-1989; 128911. 

PR 24-MAY-1989; JP-128911. 

PA (SANW ) SANUA KAGAKU KENKYUSHO. 

DR WPI; 91-047299/07. 

PT Polypeptide(s) uith notilin-like activity - used as active 

PT component of drug for treating gastrointestinal disorder 

PS Disclosure; Page 4; 7ppJ Japanese. 

CC Compared uith notilini the peptide chain is considerably shorter. 

CC Chenical synthesis is easy and cheap. The peptide is used 

CC as active component in a drug for treating gastrointestinal disorders. 

CC See also R10598-R10611. 

SQ Sequence 14 AA; 

SO 0 A; l R; 0 N; 0 D; 0 B; 0 C; l G; l E; 0 Z; 1 G; o H; 

SQ l I; l l; 2 k; o m; 2 F; i p; o s; l T? o W; i y; i v? 

Initial Score = 5 Optinized Score = 5 Significance = 3.37 
Residue Identity = SOX Matches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
VPPFENIELY 
II I II 



rvrir i i uc.i_urr\nr\ 
X 10 



12. US-08-249-182-7 (1-10) 

R1060S Peptide hfiih notilin-like activity (H) . 

ID R10605 standard; Protein? 15 AA. 

AC R10605; 

DT 18-APR-1991 (first entry) 

DE Peptide uith notilin-like activity (H) . 

KM Motilin; activity; gastrointestinal disorder; drug. 

OS Synthetic. 

PN J02311495-A. 

PD 27-DEC-1990. 

PF 24-HAY-1989; 128911. 

PR 24-MAY-1989; JP-128911, 

PA (5 ANN } SANWA KAGAKU KENKYUSHO. 

DR WPI? 91-047299/07. 

PT Polypeptide(s) uith ROtil in-like activity - used as active 

PT component of drug for treating gastrointestinal disorder 

PS Disclosure; Page 4; 7pp; Japanese. 

CC Compared uith notilinr the peptide chain is considerably shorter. 

CC Chenical synthesis is easy and cheap. The peptide is used 

CC as active component in a drug for treating gastrointestinal disorders. 

CC See also R10598-R10611 . 

SQ Sequence IS AA? 

S9 0 A; 2 R; o n; 0 D? 0 B; 0 C? 1 9; 1 E; 0 Z; i G? 0 H; 

SQ 1 I; l l; 2 k; o m; 2 f; l p; 0 5; l T; o u; l Y; 1 V; 

Initial Score = 5 Optimized Score = 5 Significance = 3.37 
Residue Identity = 50X Hatches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

II I tl 
FVPIFTYGELORKRK 
X 10 



13. US-08-249-182-7 (1-10) 

R10601 Peptide uith notilin-like activity (D). 

ID R10601 standard; Protein; 15 AA. 

AC R10601; 

DT 18-APR-1991 (first entry) 

DE Peptide with notilin-like activity (D). 

KW Motilin; activity; gastrointestinal disorder; drug. 

OS Synthetic. 

PN J02311495-A. 

PD 27-DEC-1990. 

PF 24-MAY-1989? 128911. 

PR 24-MAY-1989; JP-128911. 

PA (SANW ) SANWA KAGAKU KENKYUSHO. 

DR HPI; 91-047299/07. 

PT Polypeptide(s) with notilin-like activity - used as active 

PT component of drug for treating gastrointestinal disorder 

PS Disclosure; Page 4; 7pp; Japanese. 

CC Compared uith notilinr the peptide chain is considerably shorter. 

CC Chemical synthesis is easy and cheap. The peptide is used 

CC as active component in a drug for treating gastrointestinal disorders. 

CC See also R10598-R10611. 

SQ Sequence 15 AA; 

SQ 0 A; l R; o N; 0 D; 0 B; 0 c; 2 9; 2 E; 0 Z; l G? 0 H; 

SQ l I; 2 L; o K; 0 M; 2 F; l P ; 0 S; 1 T; 0 H» 1 Y; l V; 



Initial Score = 5 Optimized Score = 5 Significance = 3.37 
Residue Identity = SOX Hatches = 5 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

I! I II 
FVPIFTYGEL8RLQE 
X 10 



14. US-08-249-182-7 (1-10) 

R10604 Peptide uith motilin-like activity (G) , 

ID R10604 standard; Protein; 17 AA. 

AC R 10604; 

DT 18-APR-1991 (first entry) 

DE Peptide uith motilin-like activity (G) . 

KM Hotilin; activity; gastrointestinal disorder; drug. 

OS Synthetic. 

PN J02311495-A. 

PD 27-DEC-1990. 

PF 24-MAY-1989; 128911. 

PR 24-HAY-1989; JP-128911. 

PA (SANW ) SANWA KAGAKU KENKYUSHD. 

DR WPI; 91-047299/07. 

PT Polypeptide(s) with motilin-like activity - used as active 

PT component of drug for treating gastrointestinal disorder 

PS Disclosure; Page 4; 7pp; Japanese. 

CC Compared with motilini the peptide chain is considerably shorter. 

CC Chemical synthesis is easy and cheap. The peptide is used 

CC as active component in a drug for treating gastrointestinal disorders. 

CC See also R10598-R10611 . 

SO Sequence 17 AA; 

SO o A; 2 R; 0 N; 0 D; 0 B; 0 c; 2 9; 1 E; 0 Z; l G; 0 H; 

so 1 I; 2 l; 2 K? o «; 2 F; l p; o s; l t; o H? i Y; 1 v; 



Initial Score = 5 
Residue Identity - 507. 
Gaps = 0 



Optimized Score = 5 Significance = 3.37 
Hatches = 5 Mismatches = 5 

Conservative Substitutions = 0 



X X 
VPPFENIELY 

II I II 
FVPIFTYGELORLOKRK 
X 10 



15. US-08-249-182-7 (1-10) 

R10603 Peptide with motilin-like activity (F). 

ID R10603 standard; Protein; 18 AA. 

AC R10603; 

DT 18-APR-1991 (first entry) 

DE Peptide uith motilin-like activity (F). 

KM Hotilin; activity; gastrointestinal disorder; drug. 

OS Synthetic. 

PN J02311495-A. 

PD 27-DEC-1990. 

PF 24-MAY-1989J 128911. 

PR 24-HAY-19895 JP-128911. 

PA (SANW ) SANWA KAGAKU KENKYUSHO. 

DR WPI; 91-047299/07. 

PT Polypeptide(s) uith motilin-like activity - used as active 

PT component of drug for* treating gastrointestinal disorder 



ra Disciosurei rage *» /ppi Japanese. 

CC Compared uith notilin. the peptide chain is consider-ably shorter. 

CC Chen ic a I synthesis is easy and cheap. The peptide is used 

CC as active component in a drug for treating gastrointestinal disorders. 

CC See also R10593-R10611 . 

SO Sequence 18 AA; 

SG 0 A; 2 R; 0 N; 0 

se l i; 2 l; 2 k; o 

Initial Score = 5 
Residue Identity = 50X 
Gaps = 0 

X x 
VPPFENIELY 

II I II 
FVPIFTYGEL8RLQEKRK 
X 10 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Conparison of Sequences 
Release S.4 

Results file u249_7p.res wade by on Thu 22 Sep 94 10:44 J49-PDT. 



Query sequence being conpared :US-08-249- 182-7 (1-10) 
Nuctber of sequences searched: 70848 
Number of scores above cutoff? 3987 

Results of the initial conparison of US-08-249-182-7 (1-10) with: 
Data bank : PIR 41 » all entries 

100000- 

N 

U50000- * 

M 

8 

E 

R 

s 

0 

FlOOOOt 
S 

E 5000- 

Q ♦ 

U 

E 

N 

C 

E 

S 1000- 



500- * 



D; 0 b; 0 C; 2 a; 2 E; 0 z; l G; 0 H; 

h; 2 F; l p; o s; l t; o h; l Y; l v; 

Optinized Score = 5 Significance = 3.37 

Matches - 5 Hi snatches = 5 

Conservative Substitutions = 0 



* 

100- 



so- 



lo- 



s' 



0 

II III 

SCORE 0| 1 | 2 


1 1 1 
1 3 | 


III II 
4 | 4 |S 


1 1 
6 7 


STDEV -1 0 


i 2 


3 4 


5 




PARAMETERS 




Similarity matrix 


Unitary 


K-tuple 


2 


Hi snatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 


40 


Alignments to save 


15 


Optimized scores to save 0 


Display context 


50 




SEARCH STATISTICS 




Scores: 


Mean 


Median Standard Deviation 




2 


3 0.98 




Tines! 


CPU 


Total Elapsed 



00:01:25,99 



00:01:36.00 



Number of residues: 20816057 
Nunber of sequences searched: 70848 
Number of scores above cutoff: 3987 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores beloy are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence was not found. 



The list of best scores is: 



Sequence Nane Description 



Init. Opt. 
Length Score Score Sig. Frane 



6 standard deviations above nean **** 



ft#note 
SUMMARY 
SEQUENCE 



sequence extracted from NCBI backbone 
♦length 114 ♦checksum 7335 



Initial Score = 8 Optimized Score = 8 Significance = 6.11 
Residue Identity = 80% Matches = 8 Mismatches = 2 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
Illlll II 

DITLVPGTLGRDIEHLTSLDFFRVNSM9TVFVGYGPTFKCG9PLHITATKSPPFENINLYYDVPWNETIPEE 
20 30 40 50 60 X 70 X 80 

VTXPNYLQAEVSYPAFKPXLDVYKWHVAAN 
90 100 110 



2. US-08-249-182-7 (1-10) 

R3HS12 ribosomal protein HS12 



Haloarcula marismortui 



ENTRY 
TITLE 

ALTERNATE.NAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

♦authors 

# journal 

#title 



R3HS12 Hype complete 

ribosonal protein HS12 - Haloarcula marismortui 

ribosonal protein El. 3 

♦fornal_name Haloarcula marisnortui 

31-Mar-1991 ♦sequence_revision 31-Mar-199I ♦text.change 

30-Jun-1993 
S00183; C24304 
S00182 

Kinura, J.; Arndti E.? Kimura> M. 
FEBS Lett. (1987) 224:65-70 

Primary structures of three highly acidic ribosonal proteins 
S6r S12 and S15 from the archaebacteriun Halobacterium 
marismortui . 



♦cross-references MUID:88055606 
♦accession S00183 

♦♦molecule_type protein 

♦♦residues" 1-147 ♦♦label KIM 

##note the source is designated as Halobacterium marismortui 

REFERENCE A24304 

♦authors Shoham, M. ; Dijk* J.; Reinhardtt R.; Wittnann-Lieboldi B. 
♦journal FEBS Lett. (1986) 204:323-330 

♦title Purification and characterization of ribosonal proteins fron 

the 30 S subunit of the extrene halophile Halobacterium 
marismortui. 

♦accession C24304 

♦♦molecule_type protein 

♦♦residues" 1-11, 'E\13-14r'IM6-18,'r, 20-21 ♦♦label SH0 
♦♦note the source is designated as Halobacterium marismortui 

CLASSIFICATION ♦superfamily rat ribosonal protein S19 

KEYWORDS protein biosynthesis; ribosome 

SUMMARY ♦length 147 ♦molecular-ueight 16438 ♦checksum 4777 

SEQUENCE 



Initial Score = 
Residue Identity = 
Gaps = 



6 Optimized Score = 6 
60X Matches = 6 

0 Conservative Substitutions 



Significance = 
Mismatches = 



4.07 
4 
0 



X 10 
VPPFENIELY 
III I II 

ATLYDVPPEELIEALTETLADEDWAEFTKTGVDRELPPESEDFWTRRAASLLRKVAVDGPVGVNA 
X 10 X 20 30 40 50 60 



i . 


BtCOC7 


aiHObaxiri - nun an uragnentsj 




O 


6 


O.ll 


u 






4 standard deviations above nean 


«««« 








2. 


R3HS12 


ribosonal protein HS12 - Halo 


147 


6 


6 


4.07 


0 


3. 


S3 1985 


rasC protein - slide nold (Di 


4 ft ri 


6 


6 


4.07 


0 


4. 


SI 5960 


hypothetical protein 1 - yeas 


233 


6 


6 


4.07 


0 


5, 


J61743 


hypothetical 33. 6K protein - 


287 


6 


6 


4.07 


0 


6. 


S33518 


probable nucleotide-binding p 


369 


6 


6 


4.07 


0 


7. 


S37563 


actin-related protein - yeast 


420 


6 


6 


4.07 


0 


8. 


A49193 


type 11 activin receptor ActR 


513 


6 


6 


4.07 


0 


9. 


S27258 


activin receptor type II - ra 


513 


6 


6 


4.07 


0 


10. 


SI 8908 


activin receptor precursor - 


513 


6 


6 


4.07 


0 


11. 


S22345 


activin receptor - hunan 


513 


6 


6 


4.07 


0 


12. 


A39896 


activin receptor precursor - 


513 


6 


6 


4.07 


0 


13. 


JQ1486 


activin receptor II precursor 


513 


6 


6 


4.07 


0 


14. 


S17112 


interferon alpha/beta recepto 


545 


6 


6 


4.07 


0 


15. 


A32694 


interferon alpha receptor pre 


557 


6 


6 


4.07 


0 


16. 


E35216 


FPD5 protein - fowlpox virus 


791 


6 


6 


4.07 


0 


17. 


S16133 


dinethylglycine dehydrogenase 


857 


6 


6 


4.07 


0 


18. 


S37310 


protoporphyrin IX nagnesiun c 


13/7 


6 


6 


4.07 


0 


19. 


JQ1917 


potyprotein - parsnip yellou 


3027 


6 


6 


4.07 


0 






#### 3 standard deviations above nean 


*»»# 








20. 


B36558 


H+/K+-exchanging ATPase (EC 3 


17 


5 


5 


3.05 


0 


21. 


A24228 


H+/K+-exchanging ATPase (EC 3 


17 


5 


5 


3.05 


0 


22. 


A60313 


notilin - dog 


22 


5 


5 


3.05 


0 


23. 


S00189 


Rotilin - dog 


22 


5 


5 


3.05 


0 


24. 


HSPG 


notilin - pig 


22 


5 


5 


3.05 


0 


25. 


S30335 


alcohol/aldehyde oxidoreducta 


41 


5 


5 


3.05 


0 


26. 


SO 1587 


hypothetical protein 50 - liv 


50 


5 


5 


3.05 


0 


27. 


A05031 


hypothetical protein 50 - liv 


50 


5 


5 


3.05 


0 


28. 


A27210 


hypothetical protein (CO 1 1 5' 


59 


5 


5 


3.05 


0 


29. 


WHVZK8 


K8 protein - vaccinia virus ( 


64 


5 


5 


3.05 


0 


30. 


J81197 


hypothetical 7.6K protein (re 


69 


5 


5 


3.05 


0 


31. 


N2EP2D 


long neurotoxin 2 - black nan 


72 


5 


5 


3.05 


0 


32. 


N2EP10 


long neurotoxin 1 - black nan 


72 


5 


5 


3.05 


0 


33. 


S31018 


gene 73 protein - Hycobacteri 


85 


5 


5 


3.05 


0 


34. 


S00740 


hypothetical protein - Hethan 


96 


5 


5 


3.05 


0 


35. 


S06415 


beta-2-nicroglobulin - rat 


99 


5 


5 


3.05 


0 


36. 


HGRBB2 


beta-2-nicroglobulin - rabbit 


99 


5 


5 


3.05 


0 


37. 


PCI 125 


c-ros protein - souse (fragcte 


100 


5 


5 


3.05 


0 


38. 


Z2BPT9 


gene 49,2 protein - phage T4 


106 


5 


5 


3.05 


0 


39. 


A37203 


lens fiber nenbrane najor int 


112 


5 


5 


3.05 


0 


40. 


A30329 


pronotilin precursor - hunan 


114 


5 


5 


3.05 


0 



1 . US-OB-249-182-7 (1-10) 
A42329 autotaxin 



hupian (fragnents) 



#text_change 



ENTRY A42329 itype fragments 

TITLE autotaxin - hunan (fragments) 

ORGANISM #fornal_nane Hobo sapiens Sconnonjiane nan 

DATE 04-Har-l993; #sequence revision Ol-Jan-1993; 

08-May-1993 
ACCESSIONS A42329 
REFERENCE A42329 

#authors Strackei H.L.; Krutzschi H.C.; Unsuorth* E.J. * Arestad» A.; 
Ciocer V.; Schiffnann* E.f Liottai L.A. 

fjournal J. Biol. Chen. (1992) 267:2524-2529 

#title Identification! purif ication* and partial sequence analysis 

of autotaxin* a novel notility-stiaulating protein, 
tcross-references HUID?92129337 
^accession A42329 

##status prelininary 

##piolecule_type protein 

Stresidues" 1-114 i#label STR 

##cross-references NCBIP:78526? NCBIP?78523? NCBIP:78521; NCBIP?78518J 
NCBIP:78515; NCBIP?78512; NCBIP:7851l; NCBIP:78510i 



S31985 rasC protein - sline nold (Dictyosteliun discoideu 



S31985 #type conplete 

rasC protein - sline nold (Dictyosteliun discoideun) 
ifornaljtane Dictyosteliun discoideun 
03-Har-1994» #sequence_revision 03-Nar-1994; #text_change 

03-Har-1994 
S31985 
S31985 

Dartieh J.M.; Bush, J, ; Cardellir J.; Spiegelnanr G.B.I 
Weeks, G. 

subnitted to the EMBL Data Library, December 1992 
S31985 

preliminary 
1-189 Hlabel DAN 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
#authors 

tsubnission 
#accession 
##status 
^residues 
##cross-references EMBL:Z18926 
SUMMARY Ulength 189 #nolecular-weight 21496 Schecksun 4479 

SEQUENCE 



Initial Score = 6 
Residue Identity = 60% 
Gaps = 0 



Optinized Score = 6 
Matches = 6 

Conservative Substitutions 



Significance = 
Mismatches = 



4.07 
4 
0 



X X 
VPPFENIELY 
III I II 

GFLIVYSIISRASFEAVTTFRE9ILRVKDLSTYPIVIIGNKADLPDKDRKVPPMEGKELAKSFGAPFLETSA 
80 90 100 110 120 130 140 



KSRVNVEEAFFTLVREIKRWN8NP9NEEMLPPKKRGCI 
150 160 170 180 



4. US-08-249-182-7 (1-10) 

S15960 hypothetical protein 1 - yeast (Saccharonyces kluy 

ENTRY S15960 #type conplete 

TITLE hypothetical protein I - yeast (Saccharoayces kluyveri) 

plasnid pSKL 

ORGANISM #fornal_nane Saccharonyces kluyveri 

DATE 21-Nov-1993; #sequence_revision 21-Nov-1993? #text change 

21-Nov-1993 
ACCESSIONS S15960 
REFERENCE S15960 

#authors Hishinuna, F.? Hiraii K. 

#journal Hoi. Gen. Genet. (1991) 226:97-106 

#title Genome organization of the linear plasnid* pSKL* isolated 

fron Saccharonyces kluyveri. 
#cross-references MUID:91238725 
iaccession S15960 

#istatus prelininary 
ttresidues 1-233 ISlabel HIS 
tftcross-references EHBL:X54850 
SUMMARY #length 233 #nolecular-ueight 27705 ichecksun 5808 

SEQUENCE 



Initial Score = 6 
Residue Identity = 60X 
Gaps = 0 



Optinized Score = 6 Significance * 4.07 
Matches = 6 Hi snatches = 4 

Conservative Substitutions = 0 



X 10 
VPPFENIELY 
II Mil 

FSYDFEPEMELFVSASVFNNKEVYED8LSKAKEIGYFYKYLDIDSESSLEVPLKENIETDFGFDPNWGEN 
130 140 150 160 170 X 180 X 190 



ELCEEYERKFFRSFEFNKGIEGVKELDEVVRRMGGI 
200 210 220 230 



5. US-08-249-182-7 (1-10) 

JQ1743 hypothetical 33. 6K protein 



rabbit fibrona virus 



ENTRY 
TITLE 

ALTERNATEJIAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

#authors 

#journal 

Hitle 

#accession 
#tnolecule, 
itresidues 

SUMMARY 

SE8UENCE 



JQ1743 Hype complete 

hypothetical 33. 6K protein - rabbit fibrona virus 
H2R protein; protein kinase hoRolog 
#fornal_nane rabbit fibrona virusr Shope fibrona virus 
30-Sep-1993 #sequence_revision 30-Sep-1993 Itext change 

30-Sep-1993 
J81743 
JS1741 

Massungr R.F.; McFadden» G.; Noyer* R.U. 
J. Gen. Virol. (1992) 73:2903-2911 

Nucleotide sequence analysis of a unique near-terninal region 

of the tunorigenic poxvirus* Shope fibrona virus. 
JQ1743 
.type DNA 

1-287 ##label MAS 
Slength 287 tnolecular-ueight 33561 ichecltsun 2007 



Initial Score 
Residue Identity 
Gaps 



6 Qptinized Score = 6 Significance = 4.07 
60X Matches = 6 Mi snatches r 4 

0 Conservative Substitutions = 0 



X X 
VPPFENIELY 
Mil li 

LDMALDCCTGLVNVYKYTKKPYEHLSSVNFLVDEDYTTKILCHGLEKTLSVPPFKNINMLAYRPRRILLOIF 
120 130 140 150 160 170 180 190 

SKYTEKDDVYSLGVVLUEIFTGCVPFDHATSTDIYDAV 
200 210 220 



US-08-249-182-7 (1-10) 

S33518 probable nucleotide-binding protein 



Acholeplasna 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
^authors 
tsubnission 
#accession 
##status 
^residues 
ttcross-references EMBL:Z22875 
SUMMARY # length 369 tnolecular-ueight 42141 t check sun 5061 

SEQUENCE 



S33518 Hype conplete 

probable nucleotide-binding protein - Acholeplasna laidlauii 
#fornal jtane Acholeplasna laidlauii 

03-Mar-1994» #sequence_revision 03-Har-1994; #text_change 

03-Mar-1994 
S33518 
S33518 

Boyeri M.J.? Jarhede* T.K.; Tegnan* 
subnitted to the EMBL Data Libraryr 
S33518 

prelininary 
1-369 ttlabel BOY 



V.r Wieslander* 
June 1993 



A. 



Initial Score 
Residue Identity 
Gaps 



6 Optinized Score = 6 Significance 
60% Matches - 6 Misnatches 

0 Conservative Substitutions 



= 4, 



07 
4 
0 



X X 
VPPFENIELY 



10 20 X 30 40 50 60 70 



TSTLLDKLK 
80 



7. US-08-249-182-7 (1-10) 

S37563 act in-related protein - yeast (Saccharonyces cerev 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
iauthors 
tsubnission 
iaccession 
##status 
##residues 



S37563 #type complete 

actin-related protein - yeast (Saccharonyces cerevisiae) 

#fornal_nane Saccharonyces cerevisiae 

08-Dec-1993; #sequence_revision 08-Dec-1993; #text change 

08-Dec-1993 
S37563 
S37563 

Harata» M.; Karuan* A.; Winter sberger* U. 
submitted to the EMBL Data Libraryi Septenber 1993 
S37563 

prelininary 

1-420 ##label HAR 



##cross-references EHBL:X75317 
SUMMARY #length 420 tnolecular-weight 47649 

SEQUENCE 



#c heck sun 8418 



Initial Score = 6 
Residue Identity = 60X 
Gaps - 0 



Optinized Score = 6 
Hatches = 6 

Conservative Substitutions 



Significance = 
Misnatches = 



4.07 



X 10 
VPPFENIELY 
II II II 

RTKPSGVNKSDKKVTPTEEKEQEAVSKSTSPAANSADTPNETGKRPLEEEKPPKENNELIGLADLVYSSIMS 
260 270 280 290 300 X 310 X 320 



SDVDLRATLAHNVVLTGGTSSIPGLSDRLNTELNKILP 
330 340 350 360 



8. US-08-249-182-7 (1-10) 

A49193 type II activin receptor ActRII - rat (fragnent) 

ENTRY A49193 #type fragnent 

TITLE type II activin receptor ActRII - rat (fragnent) 

ORGANISM #fornal_nane Rattus norvegicus fconnonjiane Noruay rat 

DATE 19-Dec-1993; #sequence_revision 19-Dec-1993; #text_change 

29-Jan-1994 
ACCESSIONS A49193 
REFERENCE A49193 

#authors Feng, Z.M.; Hadiganr H.B.; Chen> C.L. 

#journal Endocrinology (1993) 132:2593-2600 

#title Expression of type II activin receptor genes in the nale and 

fenale reproductive tissues of the rat. 
ftcross-references HUIDJ93279247 
^accession A49193 

##status prelininary 
##nolecule_type nucleic acid 
##residues" 1-513 ##label FEN 
ttcross-references NCBIN: 133008; NCBIP: 133009 
S#note sequence extracted fron NCBI backbone 

SUMMARY #length 513 tchecksun 89 

SEQUENCE 



Initial Score = 
Residue Identity = 



6 Optinized Score 
60X Matches 



6 Significance = 4.07 
6 Mismatches = 4 



baps 



u conservative buosutuuons 



X 10 
VPPFENIELY 
II II II 

INCYDRTDCIEKKDSPEVYFCCCEGNMCNEKFSYFPEMEVTQPTSNPVTPKPPYYNILLYSLVPLRLIAGIV 
90 100 110 120 130 X 140 X 150 

ICAFWVYRHHKMAYPPVLVPTGDPGPPPPSPLLGLKPL 
160 170 180 190 



9. US-08-249-182-7 (1-10) 

S27258 activin receptor type II - rat 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
•authors 



#journal 
•title 
•accession 
••status 
••residues 
SUMMARY 
SEQUENCE 



S27258 Hype conplete 
activin receptor type II - rat 

#fornal_nane Rattus norvegicus •cofwonjiane Norway rat 
22-Nov-1993; •sequence.revision 22-Nov-1993; #text_change 

22-Nov-1993 
S27258 
S27258 

Shinozaki* H.; ltd I.; Hasegauai Y.i Nakanurar K.; Igarashii 
S.; Nakanura, M.J Miyanotor K.; Etor Y.; Ibuki > Y.; 
Minegishi* T. 

FEBS Lett. (1992) 312:53-56 

Cloning and sequencing of a rat type II activin receptor. 
S27258 

prelininary 

1-513 •• label SHI 
•length 513 #nolecular-ueight 57903 ichecksun 9412 



Initial Score = 6 
Residue Identity = 60X 
Gaps = 0 



Optimized Score = 6 Significance = 4.07 
Hatches = 6 Hisnatches = 4 

Conservative Substitutions = 0 



X 10 
VPPFENIELY 
II II II 

INCYDRTDCIEKKDSPEVYFCCCEGNHCNEKFSYFPEMEVTQPTSNPVTPKPPYYNILLYSLVPLHLIAGIV 
90 100 110 120 130 X 140 X 150 



ICAFy\/YRHHKNAYPPVLVPT8DPGPPPPSPLLGLKPL 
160 170 180 190 



10. US-08-249-182-7 (1-10) 

S18908 activin receptor precursor - hunan 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
•authors 
•submission 
•accession 
••status 
••residues 
••cross-references EMBL:X62381 
SUMMARY #length 513 •nolecular-ueight 57847 •checksum 205 

SEQUENCE 



S18908 Stype complete 
activin receptor precursor - hunan 
•fornaljiane Hono sapiens •connon_nane nan 
22-Nov-1993; #sequence_revision 22-Nov-1993? »text change 

22-Nov-1993 
SI 8908 
S18908 
Geiseri A.G. 

submitted to the EMBL Data Library, December 1991 
S18908 

prelininary 
1-513 Silabel GEI 



iruifidi aiure - o ufjuniitu ouure - o DigruTitante - 

Residue Identity = 60X Matches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X 10 
VPPFENIELY 
II II II 

INCYDRTDCVEKKDSPEVYFCCCEGNMCNEKFSYFPEMEVTaPTSNPVTPKPPYYNILLYSLVPLMLIAGIV 
90 100 110 120 130 X 140 X 150 

ICAFUVYRHHKMAYPPVLVPTQDPGPPPPSPLLGLKPL 
160 170 180 190 



11. US-08-249-182-7 (1-10) 

S22345 activin receptor 



human 



ENTRY S22345 #type complete 

TITLE activin receptor - human 

ORGANISM •fornaljiane Homo sapiens ftcomnon_nane nan 

DATE 22-Nov-1993; •sequence_revision 22-Nov-1993! #text change 

22-Nov-1993 
ACCESSIONS S22345 
REFERENCE S22345 

•authors Matzuk» N.M.; Bradley* A. 

#journal Biochin. Biophys. Acta (1992) 1 130? 105-108 

•title Cloning of the hunan activin receptor cDNA reveals high 

evolutionary conservation, 
•cross-references MUID:921S2002 
•accession S22345 

••status preliminary 
••residues 1-513 §f label MAT 
••cross-references EMBL:X63128 
SUNMARY #length 513 •molecular-ueight 57847 «checksum 205 

SEQUENCE 



Initial Score = 6 
Residue Identity = 60% 
Gaps = 0 



Optimized Score = 6 Significance - 4.07 
Matches = 6 Mismatches = 4 

Conservative Substitutions = 0 



X 10 
VPPFENIELY 
II II II 

INCYDRTDCVEKKDSPEVYFCCCEGNMCNEKFSYFPEMEVTQPTSNPVTPKPPYYNILLYSLVPLMLIAGIV 
90 100 110 120 130 X 140 X 150 



ICAFWVYRHHKMAYPPVLVPTQDPGPPPPSPLLGLKPL 
160 170 180 190 



12. US-08-249-182-7 U-10) 



A39896 activin receptor precursor - mouse 

ENTRY A39896 Stype complete 

TITLE activin receptor precursor - mouse 

ORGANISM •formal_name Mus nusculus •common_name house mouse 

DATE 24-Jan-1992 •sequence_revision 24-Jan-1992 *text change 

18-Jun-1993 
ACCESSIONS A39896 
REFERENCE A39896 

•authors Hatheus, L.S.; Valet W.H. 

•journal Cell (1991) 65:973-982 

•title Expression cloning of an activin receptor* a predicted 

transmembrane serine kinase, 
•cross-references MUID:91256317 
•accession A39896 



ftflitdtub preumnary 

##nolecule_type nRNA 

##residues~ 1-513 «t label MAT 

##cross-references GBJM65287 
SUMMARY tlength 513 tnolecular-ueight 57889 tchecksun 23 

SEQUENCE 

Initial Score = 6 Optimized Score = 6 Significance - 4.07 
Residue Identity = 607. Matches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 

II I! II 

INCYDRTDCIEKKDSPEVYFCCCEGNMCNEKFSYFPEMEVTQPTSNPVTPKPPYYNILLYSLVPLMLIAGIV 
90 100 110 120 130 X 140 X 150 

ICAFHVYRHHKMAYPPVLVPTQDPGPPPPSPLLGLKPL 
160 170 180 190 



13. US-08-249-182-7 (1-10) 

J01486 activin receptor II precursor - hunan 

ENTRY J81486 Stype complete 

TITLE activin receptor II precursor - hunan 

ORGANISM #fornal_nane Hono sapiens iconnonjiane nan 

DATE 17-Jul-1992 #sequence_revision 17-Jul-1992 #text_change 

09-Jun-1994 
ACCESSIONS JG1486 
REFERENCE JQ1486 

#authors Donaldson* C.J.; Mathews* L.S.J Vale, U.U. 

#journal Biochen. Biophys. Res. Connun, (1992) 184:310-316 

#title Molecular cloning and binding properties of the hunan type II 

activin receptor. 

#cross-references MUID:92231944 

#contents Testis 

ftaccession JQ1486 
##nolecule_type nRNA 
^residues" 1-513 «#label DON 
COMMENT This protein binds activin A. 
CLASSIFICATION Ssuperfanily protein kinase honology 
KEYWORDS glycoprotein; nenbrane protein 

FEATURE 

1-19 

20-513 



136-161 
199-484 
43,66,333 

SUMMARY 
SEQUENCE 



#donain signal sequence istatus predicted tlabel SIGN 
tprotein activin receptor II istatus predicted tlabel 
MAT\ 

idonain transnenbrane #label TM1\ 
idonain protein kinase honology ilabel KIN\ 
ibinding_site carbohydrate (Asn) (covalent) Istatus 
predicted 

#length 513 inolecular-weight 57847 tchecksun 205 



Initial Score = 
Residue Identity = 
Gaps = 



6 Qptinized Score = 6 Significance = 4.07 
601 Matches = 6 Hisnatches = 4 

0 Conservative Substitutions = 0 



X 10 
VPPFENIELY 

II II II 

INCYDRTDCVEKKDSPEVYFCCCEGNMCNEKFSYFPEMEVT8PTSNPVTPKPPYYNILLYSLVPLMLIAGIV 
90 100 110 120 130 X 140 X 150 



ICAFWYRHHKHAYPPVLVPTQDPGPPPPSPLLGLKPL 
160 170 180 190 



14, US-08-249- 
S17112 



182-7 (1-10) 

interferon alpha/beta receptor - hunan 



ENTRY S17112 #type conplete 

TITLE interferon alpha/beta receptor - hunan 

ORGANISM #fornal_nane Hono sapiens #connon_nane nan 

DATE 21-Nov-1993; #sequence_revision 21-Nov-1993; Stext_change 

2I-Nov-1993 
ACCESSIONS S17112 
REFERENCE S17112 

tauthors Lutfalla* G. ; Gardiner r X.Y.Z.; Proudhonr D.; Vielhi E. ; 

Mogensenr X.Y.Z.; Uze» G. 



Ssubnission submitted to the EMBL Data Library f July 1991 
♦description The structuree of the hunan interferon alpha/beta receptor 
gene. 

#accession S17112 

#istatu5 prel ininary 

##residues 1-545 ##label LUT 

##cro5S-references EMBL:X60459 
SUMMARY #length 545 #nolecular-ueight 62169 #checksun 672 

SEQUENCE 

Initial Score = 6 Optinized Score = 6 Significance = 4.07 
Residue Identity = 60X Matches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 
II llll 

RIENIYSRHKIYKLSPETTYCLKVKAALLTSHKIGVYSPVHCIKTTVENELPPPENIEVSVQNQNYVLKMDY 
180 190 200 210 220 230 240 250 

TYANMTFQVQWLHAFLKRNPGNHLYKWKQIPDCENVKT 
260 270 280 



15. US-08-249-182-7 (1-10) 

A32694 interferon alpha receptor precursor - hunan 



ENTRY A32694 #type conplete 

TITLE interferon alpha receptor precursor - hunan 

ORGANISM #fornal_nane Hono sapiens iconnon_nane nan 

DATE 22-Jun-1990 #sequence revision 22-Jun-1990 itext change 

30-Sep-1993 
ACCESSIONS A32694 
REFERENCE A32694 

#authors Uzei G.; Lutfallar G.? Gresser* I. 

•journal Cell (1990) 60:225-234 

#title Genetic transfer of a functional hunan interferon alpha 

receptor into nouse cells: cloning and expression of its 
cDNA. 

♦cross-references HUID:90124632 
♦accession A32694 

♦♦status prelininary 
♦#nolecule_type nRNA 
♦♦residues" 1-557 ♦♦label UZE 
♦♦cross-references GBJJ03171 
SUMMARY ♦length 557 #nolecular-weight 63525 

SEQUENCE 



♦check sun 7035 



Initial Score 
Residue Identity 
Gaps 



6 Optinized Score = 6 Significance = 4.07 
607. Matches = 6 Mi snatches = 4 

0 Conservative Substitutions = 0 



A A 

VPPFENIELV 
II Mil 

R I EN I YSRHK I YKLSPETT YCLKVKAALLTSWK I GV YSP VHC I KTT VENELPPPENI EVSVSNQNY VLKHD Y 
180 190 200 210 220 230 240 250 

TYANNTFOVaWLHAFLKRNPGNHLYKWKSIPDCENVKT 
260 270 280 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FasiDB - Fast Pairuise Conparison of Sequences 
Release 5.4 

Results file u249_7s.res nade by on Thu 22 Sep 94 10:35:07-PDT. 



Query sequence being compared .'US-08-249- 182-7 (1-10) 
Nunber of sequences searched: 36000 
Nuftber of scores above cutoff: 3802 

Results of the initial conparison of US-08-249-182-7 (1-10) with: 
Data bank : Suiss-Prot 28. all entries 

100000- 

N 

U50000- 

M 

B 

E - * 
R 

0 

F 10000- ♦ 
S 

E 5000- 

a 

u 

E * « 

N 
C 
E 

S 1000- 



500- 



100- 



50- 



10- 



o 

II I I I II I I I II 

SCORE 0| 1 | 1 2| 3 | 3 4| 
STDEV -10 1 2 3 



PARAHETERS 



Similarity natrix Unitary 


K-tuple 




2 


Misnatch penalty 


1 


Joining penalty 




20 


Gap penalty 


1.00 


Window size 




5 


Gap size penalty 


0.05 








Cutoff score 


0 








Randomization group 


0 








Initial scores to save 


40 


Alignments to save 


15 




Optinized scores to save 


0 


Display context 


50 





SEARCH STATISTICS 



Scores: Rean Hedian 

2 3 

Tines: CPU 

00:00:50.95 

Number of residues: 12496420 
Nunber of sequences searched: 36000 
Nunber of scores above cutoff: 3802 

Cut-off raised to 2. 
Cut-off raised to 3, 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores below are sorted by initial score 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence uas not found. 



Standard Deviation 
0.88 

Total Elapsed 
00:00:56.00 



The list of best scores is: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



**** 4 standard deviations above nean 



1. 


RS12_HALMA 


30S RIB0S0HAL PROTEIN HS12 (E 


147 


6 


b 


4.57 


0 


2. 


rasc'dicdi 


RAS-LIKE PROTEIN RASC. 


189 


6 


b 


4.57 


0 


3. 


YK84_CAEEL 


HYPOTHETICAL 43.0 KD PROTEIN 


378 


6 


b 


4.57 


0 


4. 


AVR2.H0USE 


ACTIVIN RECEPTOR TYPE II PREC 


513 


6 


b 


4.57 


0 


5. 


AVR2_HUHAN 


ACTIVIN RECEPTOR TYPE II PREC 


513 


b 


b 


4.57 


0 


b. 


INAR.HUHAN 


INTERFERON-ALPHA RECEPTOR PRE 


557 


b 


b 


4.57 


0 


7. 


VD05_F0WP1 


92.6 KD PROTEIN. 


791 


b 


b 


4.57 


0 


8. 


PQLG.PYFVl 


GENOME P0LYPR0TEIN (CONTAINS: 


3027 


b 


b 


4.57 


0 



**** 3 standard deviations above nean «** 



9. NOTLCANFA H0TILIN. 22 5 5 3.42 0 



1U. 


nunn. 


.unine 


nunn-ucrcnvcni HLV/unuL ucmvn 


ti 


3 




J.1C 


V 


11. 


Alro_UtNBt 


ATP SYNTHASE PROTEIN 8 (EC 3. 


59 


5 


5 


3.42 


0 


12. 


\nuti ua^/mi 
T /l\D_VAttv 


HYPOTHETICAL 7.5 KD PROTEIN. 


64 


5 


5 


3.42 


0 


13. 


mvi n ncMDn 
NALJ_UcNrU 


LONG NEUROTOXIN 3 (TOXIN VN2) 


72 


5 


5 


3.42 


0 


14. 


NALC_UbNrU 


LONG NEUROTOXIN 2 (NEUROTOXIN 


72 


5 


5 


3.42 


0 


15. 


NXL1. 


.DENPO 


LONG NEUROTOXIN 1 (NEUROTOXIN 


72 


5 


5 


3.42 


0 


16. 


VG73.BPHL5 


GENE 73 PROTEIN (GP73) . 


85 


5 


5 


3.42 


0 


17. 


E310_ADE02 


EARLY E3B 10.4 KD PROTEIN PRE 


91 


5 


5 


3.42 


0 


IS. 


YNI1. 


.MET IV 


HYPOTHETICAL PROTEIN IN NIFH2 


96 


5 


5 


3.42 


0 


19. 


B2MG_RAB1T 


BETA-2-MICRDGL0BULIN. 


99 


5 


5 


3.42 


0 


20. 


Y492. 


.BPT4 


HYPOTHETICAL 12.6 KD PROTEIN 


106 


5 


5 


3.42 


0 


21. 


MIP_CHICK 


LENS FIBER MAJOR INTRINSIC PR 


112 


5 


5 


3.42 


0 


22. 




Ul IW A lit 

_HUnAN 


MOTRIN PRECURSOR. 


115 


5 


5 


3.42 


0 


23. 




PI/DPA 

.CYPCA 


BET A-2-HI CROGLOBUL I N PRECURSO 


116 


5 


5 


3.42 


0 


24. 




DDADC 


BETA-2-MICR0GL0BULIN. 


116 


5 


5 


3.42 


0 


25. 




OTP 

,PIG 


MOTRIN PRECURSOR. 


119 


5 


5 


3.42 


0 


26. 




DAT 

^RAT 


BETA-2-MICR0GL0BULIN PRECURSO 


119 


5 


5 


3.42 


0 


27. 


T A TO 

I AT". 


^PIG 


INTER-ALPHA-TRYPSIN INHIBITOR 


123 


5 


5 


3.42 


0 


28. 


MOT I. 


b RABIT 


MOTRIN PRECURSOR. 


133 


5 


5 


3.42 


0 


29. 


YCP4. 


SYNPY 


HYPOTHETICAL 16.1 KD PROTEIN 


140 


5 


5 


3.42 


0 


30. 


ybea" 


EC0LI 


HYPOTHETICAL 17.3 KD PROTEIN 


155 


5 


5 


3.42 


0 


31. 


PETD. 


.SCEOB 


CYTOCHROME B6-F COMPLEX SUBUN 


160 


5 


5 


3.42 


0 


32. 


PETD. 


.PR0H0 


CYTOCHROME B6-F COMPLEX SUBUN 


160 


5 


5 


3.42 


0 


33. 


HPPK. 


.BACSU 


2-AMIN0-4-HYDR0XY-6-HYDR0XYME 


162 


5 


5 


3.42 


0 


34. 


DESS. 


.MYXXA 


DEVELOPMENT-SPECIFIC PROTEIN 


173 


5 


5 


3.42 


0 


35. 


DEST. 


.MYXXA 


DEVELOPMENT-SPECIFIC PROTEIN 


175 


5 


5 


3.42 


0 


36. 


Y0P8. 


.YEREN 


Y0P8 PROTEIN PRECURSOR. 


182 


5 


5 


3.42 


0 


37. 


NRFG. 


.ECOLI 


NRFG PROTEIN. 


198 


5 


5 


3.42 


0 


38. 


CAT4. 


.AGRTU 


CHLORAMPHENICOL ACETYLTRANSFE 


209 


5 


5 


3.42 


0 


39. 


CYB6. 


.WHEAT 


CYTOCHROME B6 (EC 1.10.99.1). 


215 


5 


5 


3.42 


0 


40. 


CYB6 


TOBAC 


CYTOCHROME B6 (EC 1.10.99.1). 


215 


5 


5 


3.42 


0 



1. US-08-249-182-7 (1-10) 

R512.HALMA 30S RIBOSOMAL PROTEIN HS12 (El. 3). 

ID RS12.HALMA STANDARD; PRT; 147 AA. 

AC P199525 

DT Ol-FEB-1991 (REL. 17, CREATED) 

DT 01-FEB-199t (REL. 17. LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25, LAST ANNOTATION UPDATE) 

DE 30S RIBOSOMAL PROTEIN HS12 (El. 3). 

OS HALOARCULA MARISMORTUI (HALOBACTERIUH MARISMORTUI) . 

oc prokaryota; mendosicutes; archaebacteria; halobacteriales; 
oc halobacteriaceae. 

RN [1] 

RP SEQUENCE. 

RM 88055606 

RA KIMURA J. , ARNDT E., KINURA M.; 

RL FEBS LETT. 224:65-70(1987). 

RN 123 

RP SEQUENCE OF 1-21. 

RA SHOHAN M., DIJK J., REINHARDT R. > UITTMANN-LIEBOLD B.; 

RL FEBS LETT. 204:323-330(1986). 

CC -!- SIMILARITY: BELONGS TO THE S19E FAMILY OF RIBOSOMAL PROTEINS. 

DR PIR; S00183; R3HS12. 

DR PROSITE; PS00628; RIB0S0NAL_S19E. 

KU RIBOSOMAL PROTEIN. 

FT CONFLICT 12 12 I -> E (IN REF. 2). . 

FT CONFLICT 15 15 L -> I (IN REF. 2). 

FT CONFLICT 19 19 L -> I (IN REF. 2) . 

SO SEQUENCE 147 AA; 16438 MW; 101142 CN; 

Initial Score = 6 Optimized Score = 6 Significance = 4.57 
Residue Identity = 602 Matches = 6 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 



X 10 
VPPFENIELY 

III I II 

ATLYDVPPEELIEALTETLADEDWAEFTKTGVDRELPPEQEDFUTRRAASLLRKVAVDGPVGVNA 
X 10 X 20 30 40 50 60 



2. US-08-249-182-7 (1-10) 

RASC.DICDI RAS-LIKE PROTEIN RASC. 

ID RASC DICDI STANDARD; PRT; 189 AA. 

AC P32253; 

DT 01-0CT-1993 (REL. 27, CREATED) 

DT 01-0CT-1993 (REL. 27, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE RAS-LIKE PROTEIN RASC. 

GN RASC. 

OS DICTYOSTELIUH DlSCOIDEUtl (SLIME HOLD). 

oc eukaryota; protozoa; sarcomastigophora; sarcodina; rhizopoda; 

oc eumycetozoa; DICTYOSTELIA. 

RN [1] 

RP SEQUENCE FROM N.A. 

RA DANIEL J.M., BUSH J., CARDELLI J., SPIEGELMAN G.B.. WEEKS G.; 

RL ONCOGENE 9:501-508(1994). 

CC -!- FUNCTION: RAS PROTEINS BIND GDP/GTP AND POSSESS INTRINSIC GTPASE 

CC ACTIVITY. 

DR EMBLi Z 18926? DDRASCA. 

DR PIR; S31985; S31985. 

DR DICTYDB; DD05036; RASC. 

KW gtp-binding; prenylation; lipoprotein. 



NP.BIND 


11 


18 


GTP (BY SIMILARITY). 


NP_BIND 


58 


62 


GTP (BY SIMILARITY). 


NPJIND 


117 


120 


GTP (BY SIMILARITY) . 


DOMAIN 


33 


41 


EFFECTOR REGION (BY SIMILARITY). 


LIPID 


186 


186 


GERANYL-GERANYL (BY SIMILARITY). 


SEQUENCE 


189 AA; 


21496 


nh; 181053 cn; 



5Q 

Initial Score = 6 Optinized Score = 6 Significance = 4.57 

Residue Identity = 60X Matches = 6 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 
III I II 

GFLIVYSIISRASFEAVTTFREQILRVKDLSTYPIVIIGNKADLPDKDRKVPPHEGKELAKSFGAPFLETSA 
80 90 100 110 120 130 140 

KSRVNVEEAFFTLVRE I KRUNQNPQNEEMLPPKKRGC I 
150 160 170 180 



3. US-08-249-182-7 (1-10) 

YK84.CAEEL HYPOTHETICAL 43.0 KD PROTEIN C30A5.4 IN CHROMOSOME 

ID YK84_CAEEL STANDARD; PRT; 378 AA. 

AC P34350! 

DT 01-FEB-1994 (REL. 28, CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 43.0 KD PROTEIN C30A5.4 IN CHROMOSOME III. 

GN C30A5.4. 

OS CAENORHABDITIS ELEGANS. 

OC EUKARYOTA; METAZOA; ACOELEMATES; NEMATODA; SECERNENTEA; RHABDITIDA. 

RN [1] 

RP SEQUENCE FROM N.A. 



KC 5TRAlN = BRiSTuL N2i 
RA ANDERSON K.; 

RL SUBMITTED (FEB-1993) TO EMBL/GENBANK/DDBJ DATA BANKS. 

DR EHBL; LI 0990 J CEC30A5. 

DR WORMPEP; C30A5.4; CE00095. 

KW HYPOTHETICAL PROTEIN. 

S8 SEQUENCE 378 AA; 43031 tlW; 750062 CN; 

Initial Score = 6 Optinized Score = 6 Significance = 4.57 
Residue Identity = 60% Hatches = 6 Hisnatches = 4 

Gaps - 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
II llll 

AHQHCFDGYWVLSGRKLITIFSAPMYCNFYKNTGCVLKVDETLGIQTVAFVPESENIEKIIEEHNRVyDISI 
310 320 330 340 350 X 360 X 370 

DCIE 



4. US-08-249- 182-7 (1-10) 

AVR2_M0USE ACTIVIN RECEPTOR TYPE II PRECURSOR (ACTR-II) (EC 2 

ID AVR2_H0USE STANDARD; PRT; 513 AA. 

AC P27038; 

DT 01-AUG-1992 (REL. 23, CREATED) 

DT 01-AUG-1992 (REL. 23, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE ACTIVIN RECEPTOR TYPE II PRECURSOR (ACTR-II) (EC 2.7.1.-). 

OS MUS NUSCULUS (HOUSE). 

OC eukaryota; hetazoa; chordata; vertebrata; tetrapoda; mammalia; 

oc eutheria; rodentia. 

RN [1] 

RP sequence froh n.a. 

RM 91256317 

RA MATHEWS L.S., VALE W.W.; 

RL CELL 65:973-982(1991). 

CC -!- FUNCTION: RECEPTOR FOR ACTIVIN A, ACTIVIN B, AND INHIBIN A. 

CC INVOLVED IN TRANSMEMBRANE SIGNALING. 

CC -!- SUBCELLULAR LOCATION: TYPE I MEMBRANE PROTEIN. 

CC -!- TISSUE SPECIFICITY: BRAIN, TESTIS, INTESTINE, LIVER, AND KIDNEY. 

CC -!- SIMILARITY: WITH THE CONSERVED CATALYTIC DOMAINS OF SER/THR- 

CC PROTEIN KINASES. 

DR ENBL; (165287; MMACTR. 

DR PIR; A39896; A39896. 

DR PROSITE; PS00107; PROTEIN.KINASE.ATP. 

DR PROSITE; PS00108; PROTE I N_K I NASE_ST . 

KU RECEPTOR; TRANSFERASE; SER I NE/ THREON I NE-PROTE I N KINASE; ATP-BINDING; 

KW TRANSMEMBRANE; GLYCOPROTEIN; SIGNAL. 



FT 


SIGNAL 


1 


19 


POTENTIAL. 


FT 


CHAIN 


20 


513 


ACTIVIN RECEPTOR TYPE II. 


FT 


DOMAIN 


20 


135 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSHEM 


136 


161 


POTENTIAL. 


FT 


DOMAIN 


162 


513 


CYTOPLASMIC (POTENTIAL) . 


FT 


DOMAIN 


198 


484 


CATALYTIC. 


FT 


NP.BIND 


198 


206 


ATP (BY SIMILARITY). 


FT 


BINDING 


219 


219 


ATP (BY SIMILARITY). 


FT 


ACT.S1TE 


322 


322 


BY SIMILARITY. 


FT 


CARBOHYD 


43 


43 


POTENTIAL. 


FT 


CARBOHYD 


66 


66 


POTENTIAL. 


SQ 


SEQUENCE 


513 AA; 57889 


MM; 1345779 CN; 



Initial Score = 6 Optimized Score = 6 Significance = 4.57 
Residue Identity = 602 Matches = 6 Hisnatches = 4 



naps - v conservative auosututions = u 

X ID 
VPPFENIELY 
II II II 

INCYDRTDCIEKKDSPEVYFCCCEGNMCNEKFSYFPEMEVTQPTSNPVTPKPPYYNILLYSLVPLHLIAGIV 
90 100 110 120 130 X 140 X ISO 

ICAFWVYRHHKMAYPPVLVPT8DPGPPPPSPLLGLKPL 
160 170 180 190 



5. US-08-249-182-7 (1-10) 

AVR2_HUHAN ACTIVIN RECEPTOR TYPE II PRECURSOR (ACTR-II) (EC 2 

ID AVR2.HUMAN STANDARD; PRT; 513 AA. 

AC P27037; 

DT 01-AUG-1992 (REL. 23, CREATED) 

DT 01-AUG-1992 (REL. 23, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE ACTIVIN RECEPTOR TYPE II PRECURSOR (ACTR-II) (EC 2.7.1.-). 

OS HOMO SAPIENS (HUMAN). 

OC EUKARYOTAJ METAZOA; CHORDATAf VERTEBRATAf TETRAPODA! mammalia; 

OC EUTHERIA; PRIMATES. 

RN 1 1 ) 

RP SEQUENCE FROM N.A. 

RC TISSUE=TESTIS; 

RM 92182002 

RA MATZUK M.M., BRADLEY A.; 

RL BIOCHIM. BIOPHYS. ACTA 1130:105-108(1992). 

RN (23 

RP SEQUENCE FROM N.A. 

RC TISSUE=TESTIS; 

RM 92231944 

RA DONALDSON C.J., MATHEWS L.S., VALE W.W.; 

RL BIOCHEM. BIOPHYS. RES. COHMUN. 184:310-316(1992). 

RN 111 

RP SEQUENCE FROM N.A. 

RC TISSUE=MAMMARY; 

RA GEISER A.G.; 

RL SUBMITTED (DEC-1991) TO EMBL/GENBANK/DDBJ DATA BANKS. 

CC -!- FUNCTION? RECEPTOR FOR ACTIVIN A, ACTIVIN B, AND INHIBIN A. 

CC INVOLVED IN TRANSMEMBRANE SIGNALING. 

CC -!- SUBCELLULAR LOCATION: TYPE I MEMBRANE PROTEIN. 

CC -!- SIMILARITY: WITH THE CONSERVED CATALYTIC DOMAINS OF SER/THR- 

CC PROTEIN KINASES. 

DR EMBL; X63128; HSACTREC. 

DR EMBL; X 6 2361; HSACTR. 

DR EMBL; N93415; HSACTIIA. 

DR PIR; S18908; S18908. 

DR PIR; J91486; JQ1486. 

DR PIR; S22345! S22345. 

DR MIM; 102576; TENTH EDITION. 

DR PROSITE; PS00107; PROTEIN_KINASE_ATP. 

DR PROSITE; PS00108; PROTEIN.KINASE.ST. 

KM RECEPTOR; TRANSFERASE; SERINE/THREONINE-PROTEIN KINASE; ATP-BINDING; 
KU 



TRANSMEMBRANE; GLYCOPROTEIN; 


SIGNAL. 


SIGNAL 


1 


19 


POTENTIAL. 


CHAIN 


20 


513 


ACTIVIN RECEPTOR TYPE II. 


DOMAIN 


20 


135 


EXTRACELLULAR (POTENTIAL). 


TRANSHEM 


136 


161 


POTENTIAL. 


DOMAIN 


162 


513 


CYTOPLASMIC (POTENTIAL) . 


DOMAIN 


198 


484 


CATALYTIC. 


NP.BIND 


198 


206 


ATP (BY SIMILARITY). 


BINDING 


219 


219 


ATP (BY SIMILARITY). 


ACT.SITE 


322 


322 


BY SIMILARITY. 



rt wtnoumu hj <*j ruitimflL. 
FT CARBOHYD 66 66 POTENTIAL. 
SO SEQUENCE 513 AA; 57847 HU; 1347089 CN; 



Initial Score = 6 Optinized Score = 6 Significance = 4.57 
Residue Identity = 607. Hatches = 6 Hisnatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 

II II II 

INCYDRTDCVEKKDSPEVYFCCCEGNMCNEKFSYFPEHEVTQPTSNPVTPKPPYYNILLYSLVPLHLIAGIV 
90 100 110 120 130 X 140 X 150 

ICAFWVYRHHKMAYPPVLVPTQDPGPPPPSPLLGLKPL 
160 170 180 190 



6. US-08-249- 182-7 (1-10) 

1NAR_HUMAN INTERFERON-ALPHA RECEPTOR PRECURSOR (IFN-ALPHA-REC 

ID INAR.HUMAN STANDARD; PRTf 557 AA. 

AC P17181; 

DT 01 -AUG- 1990 (REL. 15, CREATED) 

DT 01-AUG-1990 (REL. 15. LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE INTERFERON-ALPHA RECEPTOR PRECURSOR (IFN-ALPHA-REC). 

GN IFNAR. 

OS HOMO SAPIENS (HUMAN) . 

oc eukaryota; metazoa; chord at a; vertebrata; tetrapoda; mammalia; 

OC EUTHERIA; PRIMATES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RN 90124632 

RA UZE G. r LUTFALLA G., GRESSER I.; 

RL CELL 60:225-234(1990). 

CC -!- FUNCTION: RECEPTOR FOR INTERFERON ALPHA. 

CC -!- SUBCELLULAR LOCATION: TYPE I MEMBRANE PROTEIN. 

CC -!- TISSUE SPECIFICITY: IFN RECEPTORS ARE PRESENT IN ALL TISSUES AND 

CC EVEN ON THE SURFACE OF MOST IFN-RESISTANT CELLS. 

DR EMBLi J03171 J HSIFNRA. 

DR PIR; A32694; A32694. 

DR MIM; 107450; TENTH EDITION. 

KM RECEPTOR; TRANSMEMBRANE; GLYCOPROTEIN; SIGNAL. 



SIGNAL 


1 


27 


POTENTIAL. 


CHAIN 


28 


557 


INTERFERON-ALPHA RECEPTOR. 


DOMAIN 


28 


436 


EXTRACELLULAR (POTENTIAL) . 


TRANSMEM 


437 


457 


POTENTIAL. 


DOMAIN 


458 


557 


CYTOPLASMIC (POTENTIAL). 


DISULFID 


79 


87 


BY SIMILARITY. 


DISULFID 


199 


220 


BY SIMILARITY. 


CARBOHYD 


50 


50 


POTENTIAL. 


CARBOHYD 


58 


58 


POTENTIAL. 


CARBOHYD 


81 


81 


POTENTIAL. 


CARBOHYD 


88 


88 


POTENTIAL. 


CARBOHYD 


110 


no 


POTENTIAL. 


CARBOHYD 


172 


172 


POTENTIAL. 


CARBOHYD 


254 


254 


POTENTIAL. 


CARBOHYD 


313 


313 


POTENTIAL. 


CARBOHYD 


314 


314 


POTENTIAL. 


CARBOHYD 


376 


376 


POTENTIAL. 


CARBOHYD 


416 


416 


POTENTIAL. 


CARBOHYD 


433 


433 


POTENTIAL. 


SEQUENCE 


557 AA; 


63525 


m; 1717510 CN; 



SQ 

Initial Score = 6 Optinized Score = 6 Significance = 4.57 



0 



X X 
VPPFENIELY 
II Mil 

R I EN I YSRHK I YKLSPETT YCLKVKAALLTSWK IGV YSP VHC I KTT VENELPPPEN I EVSVQN8NYVLKWDY 
180 190 200 210 220 230 240 250 

TYANNTFGV8WLHAFLKRNPGNHLYKWKQIPDCENVKT 
260 270 280 



7. US-08-249-182-7 (1-10) 

VD05_F0HP1 92.6 KD PROTEIN. 

ID VD05_F0HP1 STANDARD; PRT; 791 AA. 

AC P21969J 

DT 01-AUG-1991 (REL. 19, CREATED) 

DT 01-AUG-1991 (REL. 19- LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28. LAST ANNOTATION UPDATE) 

DE 92.6 KD PROTEIN. 

GN FPD5. 

OS FOWLPOX VIRUS (STRAIN FP-1). 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES! POXVIRIDAE; CHORDOPOXVIRINAE; 

OC AVIPOXVIRUSES. 

RN [11 

RP SEQUENCE FROM N.A. 

Rfl 90324937 

RA TARTAGLIA J. > WINSLOH J., GOEBEL S.. JOHNSON CP., TAYLOR J., 

RA PAOLETTI E.; 

RL J. GEN. VIROL. 71:1517-1524(1990). 

CC -!- FUNCTION: INVOLVED IN VIRAL REPLICATION, POSSIBLY IN THE 
CC ELONGATION OF DNA. 

CC -!- SIMILARITY: TO VACCINIA VIRUS D5 PROTEIN. 

DR EMBL; X17202; POFPHIND. 

DR PIR; E35216; E35216. 

KM DNA REPLICATION; DNA-BINDING; ATP-BINDING. 

FT NP.BIND 505 512 ATP (POTENTIAL). 

SO SEQUENCE 791 AA; 92655 NW; 3362422 CN; 

Initial Score = 6 Optinized Score = 6 Significance = 4.57 
Residue Identity = 60% Hatches - 6 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
III III 

DIKDSIFY8GNDAKKFVCTVSTGYKYEEGINVDDITTELMSILDDI8PKTKENFENRELYESILSSCLMGTT 
430 440 450 460 470 X 480 X 490 

KQCIFFFYGETATGKSTTKKLLKSVMHNNFLETGQVIL 
500 510 520 530 



8. US-08-249-182-7 (1-10) 

POLG.PYFVl GENOME POLYPROTEIN (CONTAINS: 22.5 KD PROTEIN; 26 

ID P0LG.PYFV1 STANDARD; PRT; 3027 AA. 

AC 805057! 

DT 01-FEB-1994 (REL. 28, CREATED) 

DT 01-FEB-1994 (REL. 28, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE GENOME POLYPROTEIN (CONTAINS: 22.5 KD PROTEIN; 26 KD PROTEIN? 31 KD 

DE PROTEIN; PROBABLE RNA-DIRECTED RNA POLYMERASE (EC 2.7.7.48)). 

OS PARSNIP YELLOW FLECK VIRUS (ISOLATE P-121) (PYFV). 



RN 
RP 
Rtl 
RA 
RL 
CC 
DR 
KU 
FT 
FT 
FT 
SQ 



ViniUflC. NURtNVCLUrtU VIKUSta. VltUUflt. 

Ill 

SEQUENCE FROM N.A. 
93107855 

TURNBULL-ROSS A.D., REAVY B.. MAYO 11. A., NURANT A.F.; 
J. GEN. VIROL. 73:3203-3211(1992). 

-!- SIMILARITY: SOME. TO THE CMPV AND TBRV POLYPROTEINS. 
EMBL; D14066; PYFPOLYP. 

POLYPROTEIN; ATP-BINDING; COAT PROTEIN; RNA-DIRECTED RNA POLYMERASE. 
1467 1474 ATP (POTENTIAL). 

962 962 T -> I. 

1373 1373 L -> F. 



NP.BIND 
VARIANT 
VARIANT 
SEQUENCE 



3027 AA; 336242 MU; 22416420 CN; 



Initial Score 
Residue Identity 
Gaps 



6 Optinized Score = 6 Significance = 4.57 
60% Matches = 6 Hi snatches = 4 

0 Conservative Substitutions = 0 



X X 
VPPFENIELY 
III III 

LVHETLNGTTKN8EEPSKDVIAILEELGDGVVEGILEKRKELLS8FGVMDPPPFDAIELEPGKA8ASVCFST 
1660 1670 1680 1690 1700 1710 1720 



DAFGNPLKNPFVELFGKLRDEFERATK8EMPDDILTKF 
1730 1740 1750 1760 



9. US-08-249- 182-7 (1-10) 
MOTI.CANFA MOTRIN. 

ID MOTI.CANFA STANDARD; PRT; 22 AA. 

AC PI 9863i 

DT 01-FEB-1991 (REL. 17, CREATED) 

DT 01-FEB-1991 (REL. 17. LAST SEQUENCE UPDATE) 

DT 01 -APR- 1993 (REL. 25. LAST ANNOTATION UPDATE) 

DE MOTRIN. 

OS CANIS FAMILIARIS (DOG). 

oc eukaryota; metazoa; chordata; vertebrata; tetrapoda; mammalia; 

oc eutheria; carnivora. 

RN [1] 

RP SEQUENCE. 

RM 83195948 

RA POITRAS P.. REEVE J.R. JR.. HUNKAPILLER H.U.. HOOD L.E., WALSH J.H.; 

RL REGUL. PEPT. 5:197-208(1983). 

CC -!- FUNCTION: PLAYS AN IMPORTANT ROLE IN THE REGULATION OF 

CC INTERDIGESTIVE GASTROINTESTINAL MOTILITY AND INDIRECTLY CAUSES 

CC RHYTHMIC CONTRACTION OF DUODENAL AND COLONIC SMOOTH MUSCLE. 

CC -!- RESIDUE 1 MAY ALSO BE LYS OR SER. 

DR PIR; S00189; S00189. 

DR PIR; A60313; A60313. 

KU HORMONE. 

FT UNSURE 1 1 

SQ SEQUENCE 22 AA; 2685 MU; 2092 CN; 

Initial Score = 5 Optinized Score = 5 Significance = 3.42 
Residue Identity = 50% Matches = 5 Hi snatches = 5 

Gaps = 0 Conservative Substitutions - 0 

X X 
VPPFENIELY 

II I II 
FVPIFTHSELQKIREKERNKGQ 



X 10 20 



IV. UB-VO-CtT-lOC - / W-tV/ 

ADHN.AHYHE NDMA-DEPENDENT ALCOHOL DEHYDROGENASE (EC 1.1.99.-) 

ID ADHN.AHYME STANDARD; PRT; 41 AA. 

AC P80175; 

DT 01-JUL-1993 (REL. 26, CREATED) 

DT 01-JUL-1993 (REL. 26. LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26. LAST ANNOTATION UPDATE) 

DE NDilA-DEPENDENT ALCOHOL DEHYDROGENASE (EC 1.1.99.-) (NDHA-ADH) 

DE (FRAGMENT) . 

OS AHYCOLATOPSIS METHANOLICA. 

oc prokaryota; firmicutes; actinomycetales; nocardioforpi. 

RN [1] 

RP SEQUENCE. 

RC STRAIN-NCIB 11946; 

RM 93215662 

RA VAN OPHEH P.M.. VAN BEEUNEN J., DUINE J. A.; 

RL EUR. J. BIOCHEN. 212:B19-826(1993) . 

CC -!- FUNCTION: THIS IS A NOVEL ENZYME. CATALYTICALLY DIFFERENT FROM 

CC OTHER ALCOHOL DEHYDROGENASES. IT IS EFFECTIVE IN OXIDIZING 

CC ETHANOL. OTHER PRIMARY ALCOHOLS AND BENZYLALCOHOL ONLY IN THE 

CC PRESENCE OF P-N I TROSO-N > N-D I METHYL AN I L I NE (NOMA) AS AN ELECTRON 

CC ACCEPTOR. NADH ACTS AS A COFACTOR HERE INSTEAD AS A COENZYME. 

CC -!- CATALYTIC ACTIVITY: ALCOHOL + NDHAH = ALDEHYDE + NDHA(+). 

CC -!- COFACTOR: NAD. 

CC -!- SUBUNIT: HOMOTRIMER. 

DR PIRJ S30335; S30335. 

KM OXIDOREDUCTASE; NAD. 

FT UNSURE 41 41 

FT NON.TER 41 41 

SQ SEQUENCE 41 AA; 4589 MU; 9102 CN; 

Initial Score = 5 Optinized Score - 5 Significance = 3.42 
Residue Identity - SOX Hatches = 5 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
VPPFENIELY 

III II 

MKTKAAVLHSAGKPFE1EELELDGPREHEVLIKYTATGLXR 
10 X 20 30 40 



11. US-08-249-182-7 (1-10) 

ATP8.0ENBE ATP SYNTHASE PROTEIN 8 (EC 3.6.1.34) (ATPASE-ASSOC 

ID ATP8_0ENBE STANDARD; PRT; 59 AA. 

AC P07513; 

DT 01-APR-1988 (REL. 07, CREATED) 

DT 01-DEC-1992 (REL. 24, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE ATP SYNTHASE PROTEIN 8 (EC 3.6.1.34) (ATPASE-ASSOC I ATED PROTEOLIPID 

DE COMPONENT) (A6L) . 

OS OENOTHERA BERT I ANA (BERTERO'S EVENING PRIMROSE). 

OG MITOCHONDRION. 

OC eukaryota; PL ant a; embryophyta; ANGIOSPERMAE; DICOTYLEDONEAE; 

OC MYRTALES; ONAGRACEAE. 

RN tl] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CV. MUNZIA; 

RA HIESEL R., BRENNICKE A.; 

RL FEBS LETT. 193:164-168(1985). 

RN [21 

RP ERRATUM. 

RC STRAIN=CV. MUNZIA; 

RA HIESEL R., BRENNICKE A.; 



hl rcoDLCii. cvi< 11 1-1/ / \nooi . 

CC -!- FUNCTION: THIS IS ONE OF THE CHAINS OF THE NQNENZYNATIC COMPONENT 
CC (CF(O) SUBUNIT) OF THE MITOCHONDRIAL ATPASE COMPLEX. 

CC -!- SUBCELLULAR LOCATION? MEMBRANE-BOUND. 

DR PIR; A27210J A27210. 

KU HYDROGEN ION TRANSPORT! CF(O) SUBUNIT; MITOCHONDRION. 

S8 SEQUENCE 5? AA; 7009 MM; 18749 CN; 



Initial Score = 5 
Residue Identity = 50X 
Gaps - 0 



Optimized Score = 5 Significance = 3.42 
Matches = 5 Hi snatches = 5 

Conservative Substitutions = 0 



X 10 
VPPFENIELY 



NPFLVGLSPPFLYFELIGHFQVEPSPTPTIKGRKHWRLSLFLIFWGERRVKKETKANDC 
10 X 20 30 40 SO 



12. US-08-249- 182-7 (1-10) 

Y7K5_VACCV HYPOTHETICAL 7.5 KD PROTEIN. 

ID Y7K5_VACCV STANDARD; PRT? 64 AA. 

AC PI 8383 > 

DT 01-N0V-1990 (REL. 16. CREATED) 

DT 01-N0V-1990 (REL. 16, LAST SEQUENCE UPDATE) 

DT 01-FEB-1991 (REL. 17, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL 7.5 KD PROTEIN, 

OS VACCINIA VIRUS (STRAIN MR) . 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; POXVIRIDAE? CH0RD0P0XVIRINAE? 

OC ORTHOPOXVIRUSES. 

RN in 

RP SEQUENCE FROM N.A. 

RM 89067908 

RA BOURSNELL M.E.G., FOULDS I. J., CAMPBELL J.I., BINNS H.N..' 

RL J. GEN. VIROL. 69:2995-3003(1988). 

DR EHBL; D00382? PXVACH3K. 

DR PIR? JS0218; HHVZK8. 

KW HYPOTHETICAL PROTEIN. 

SO SEQUENCE 64 AA; 7472 HUi 24024 CN! 

Initial Score = 5 Optimized Score = 5 Significance = 3.42 
Residue Identity = 50% Matches = 5 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
II III 

MDFCKIDVVVSFAHSFDNLINFINTIVPYSDIIELH9FLV8SSTTGNIFVKHYNHISPRNIFIY 
10 20 X 30 X 40 50 60 



13. US-08-249-1B2-7 (1-10) 

NXL3_DENP0 LONG NEUROTOXIN 3 (TOXIN VN2) . 

ID NXL3_DENP0 STANDARD. PRT; 72 AA. 

AC P25667; 

DT 01-MAY-1992 (REL. 22. CREATED) 

DT 01-MAY-1992 (REL. 22, LAST SEQUENCE UPDATE) 

DT 01-MAY-1992 (REL. 22. LAST ANNOTATION UPDATE) 

DE LONG NEUROTOXIN 3 (TOXIN VN2). 

OS DENDROASPIS POLYLEPIS POLYLEPIS (BLACK HAMBA) . 

oc eukaryota; metazoa; chordata; vertebrata; tetrapoda; reptilia; 

oc lepidosauria; serpentes. 

RN [H 

rp sequence. 



RL S. AFR. J. CHEP1. 30540-48(1977). 

CC -!- LD(50) IS 0.4 HG/KG BY SUBCUTANEOUS INJECTION. 

DR PROSITE; PSQ0272; SNAKE.TQXIN. 



I/I 1 

KW 


VENOM: NEUROTOXIN; 


MULT1GENE FAMILY. 


FT 


DISULFID 


3 


21 


BY SiniLARITY. 


FT 


DISULFID 


14 


42 


BY SIMILARITY. 


FT 


DISULFID 


27 


31 


BY SIMILARITY. 


FT 


DISULFID 


46 


57 


BY SIMILARITY. 


FT 


DISULFID 


58 


63 


BY SIMILARITY. 


SQ 


SEQUENCE 


72 aa; 


7939 mi 


25452 CN; 



Initial Score = 5 Optimized Score = 5 Significance = 3.42 
Residue Identity = 50X Hatches - 5 Misnatches = 5 

Gaps - 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
II III 

RTCNKTFSD8SKICPPGENICYTKTHCDAHCSRRGKIVELGCAATCPKVKAGVGIKCCSTDNCNLFKFGKPR 
10 X 20 X 30 40 50 60 70 



14. US-08-249-182-7 (1-10) 

NXL2.DENP0 LONG NEUROTOXIN 2 (NEUROTOXIN DELTA) (TOXIN TN2). 



ID NXL2.DENP0 STANDARD; PRT; 72 AA. 

AC P01397: 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 01-MAY-1992 (REL. 22. LAST SEQUENCE UPDATE) 

DT 01-MAY-1992 (REL. 22. LAST ANNOTATION UPDATE) 

DE LONG NEUROTOXIN 2 (NEUROTOXIN DELTA) (TOXIN TN2). 

OS DENDROASPIS POLYLEPIS POLYLEPIS (BLACK MAMBA) . 

oc eukaryota; hetazoa; chordata; vertebrata; tetrapoda; reptilia; 

oc lepidosauria; serpentes. 

rn in 

RP SEQUENCE. 

RA STRVDOM D.J.; 

RL THESIS (1973). UNIVERSITY OF PRETORIA, SOUTH-AFRICA. 

DR PIR; A01668; N2EP2D. 

DR PROSITE; PS00272; SNAKE.TOXIN. 

KW VENOM; NEUROTOXIN; MULTIGENE FAMILY. 



FT 


DISULFID 


3 


21 


BY SIMILARITY. 


FT 


DISULFID 


14 


42 


BY SIMILARITY. 


FT 


DISULFID 


27 


31 


BY SIMILARITY. 


FT 


DISULFID 


46 


57 


BY SIMILARITY. 


FT 


DISULFID 


58 


63 


BY SIMILARITY. 


FT 


CONFLICT 


7 


7 


P -> F (IN PIR DATA BANK). 


SO 


SEQUENCE 


72 AA; 


7948 HW; 


25602 CN; 



Initial Score = 5 Optinized Score - 5 Significance = 3.42 
Residue Identity = 50% Matches = 5 Misnatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
II III 

RTCNKTPSDQSKICPPGENICYTKTUCDAWCSQRGKIVELGCAATCPKVKAGVEIKCCSTDNCNKFKFGKPR 
10 X 20 X 30 40 50 60 70 



15. US-08-249-182-7 (1-10) 

NXL1.DENP0 LONG NEUROTOXIN 1 (NEUROTOXIN GAMMA) (TOXIN VN1). 

ID NXLl.DENPO STANDARD; PRT; 72 AA. 
AC P01396J 



DT 21-JUL-1986 (REL. 01, LAST SEQUENCE UPDATE) 

DT 01-HAV-1992 (REL. 22. LAST ANNOTATION UPDATE) 

DE LONG NEUROTOXIN 1 (NEUROTOXIN GAMMA) (TOXIN VN1). 

OS DENDROASPIS POLYLEPIS POLYLEPIS (BLACK MAMBA). 

oc eukaryotaj metazoa; chord at a; vertebrata; tetrapdda; REPTILIA. 

OC LEPIDOSAURIA; SERPENTES. 

RN [1] 

RP SEQUENCE. 

RH 72206049 

RA STRYDOM D.J.; 

RL J. BIOL. CHEH. 247:4029-4042(1972). 

CC -!- LD(50) IS 0.12 MG/KG BY SUBCUTANEOUS INJECTION. 

DR PIR; A01667; N2EP1D. 



DR 


PROSITE; 


PS00272; 


SNAKE.TOXIN. 


KW 


VENOM; NEUROTOXIN; 


MULTIGENE FAMILY. 


FT 


DISULFID 


3 


21 


BY SIMILARITY. 


FT 


DISULFID 


14 


42 


BY SIMILARITY. 


FT 


DISULFID 


27 


31 


BY SIMILARITY. 


FT 


DISULFID 


46 


57 


BY SIMILARITY. 


FT 


DISULFID 


58 


63 


BY SIMILARITY. 


SQ 


SEQUENCE 


72 aa; 


8043 MM; 


25023 CN; 



Initial Score = 5 Optimized Score = 5 Significance = 3.42 
Residue Identity = 502 Hatches = 5 Misnatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
VPPFENIELY 
II III 

RTCNKTFSDQSKICPPGENICYTKTHCDAHCSQRGKRVELGCAATCPKVKAGVEIKCCSTDDCDKFQFGKPR 
10 X 20 X 30 40 50 60 70 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 J 

Results file u249_8a.res nade by on Thu 22 Sep 94 10>'07:42-PDT. 



Query sequence being conpared:US-08-249-182-8 (1-11) 
Nunber of sequences searched: 42145 
Number of scores above cutoff: 4408 

Results of the initial conparison of US-08-249-182-8 (1-11) with: 
Data bank : A-GeneSeq 15. all entries 

100000- 

N 

U50000- 
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E 

R * » 

0 

F 10000- 
S 

E 5000- ♦ 

Q 
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E 



n 
C 
E 

S 1000- 



500- 



100- 
50- 
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5- 



0~ 
II 

SCORE 0| 
STDEV 0 



II I I 
|1 | 2 
1 2 



II II 
4 | 5 | 

3 4 



I II 
6 |7 

5 6 



PARAMETERS 



y K- tuple 2 

1 Joining penalty 20 

0 Window size 5 

5 

0 

0 

0 Alignnents to save 15 

0 Display context 50 



SEARCH STATISTICS 



Similarity matrix Unita 
Mi snatch penalty 
Gap penalty 1. 
Gap size penalty 0. 
Cutoff score 
Randomization group 

Initial scores to save 
Optimized scores to save 



Scores: Kean 

1 

Tines? CPU 

00:00:25.99 

Number of residues: 

Number of sequences searched: 

Number of scores above cutoff: 



Median Standard Deviation 
3 1.22 

Total Elapsed 
00:00:36.00 

5287517 
42145 
4408 



Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 



A 100% identical sequence to the query sequence was found* 
Sequence Name Description 



Init. Opt. 
Length Score Score Sig. Frame 



1. R37450 Autotaxin peptide ATX 100. 

The list of other best scores is? 

Sequence Name Description 



11 11 11 8.17 0 



Init. Opt. 
Length Score Score Sig. Frame 



**§* 4 standard deviations 

2. P70030 Secretory signal sequence of 

3. Rl 1208 Retroviral B-epitope containi 

4. R36584 Virus neutralising epitope of 

5. R14334 HIV-1 amplifier peptide §18. 

6. P70033 Secretory signal sequence of 

7. P70029 Secretory signal sequence of 

8. R13443 FSH inhibiting protein. 

9. R05596 Somatomedin carrier protein s 

10. P92300 Sequence of human insulin-lik 

11. R32501 Beta-adrenergic receptor. 

12. R13792 E75B exon Bl polypeptide. 

13. R38691 Mitochondria ATPase beta subu 

**t# 3 standard deviations 

14. P98464 Sequence of C. trachomatis se 

15. P98452 Sequence of C. trachomatis se 

16. P98448 Sequence of C. trachomatis se 

17. P98444 Sequence of C. trachomatis se 

18. P98468 Sequence of C. trachomatis se 

19. P9B460 Sequence of C. trachomatis se 

20. P82416 Peptide #4 uith atrial natriu 

21. R12374 Human Factor VII (285-305 + G 

22. R34149 CDR2 domain of human V beta 4 

23. R24868 Sequence of peptide fragment 

24. P91359 Amino acids 482-517 of HIV gl 

25. P90504 cDNA from murine cells encodi 

26. P94662 Protein sequence for the amin 

27. R38924 HIP-lalpha. 

28. P91962 Polypeptide encoded by cDMA 5 

29. R28371 Papillomavirus E5 protein. 

30. R02234 Sequence of the E5 oncogene r 

31. P93590 Deduced sequence of I1IP-1 alp 

32. R12603 SIB 121 intestinal mucin. 

33. R13336 HypA protein. 

34. R13334 HypA protein. 

35. R26953 Human T lymphocyte receptor V 

36. R41247 Aphrodisine. 

37. R22950 Leech antiplatelet protein. 

38. P70494 Sequence of human B-cell grow 

39. R42675 Golden hamster Aphrodisin pre 

40. R42674 Field hamster Aphrodisin prec 



above mean **** 
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6 


6 
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0 
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6 
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0 
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0 
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4.08 


0 


39 






4.08 


0 


50 




*> 


4.08 


0 


264 




t> 


4.08 


0 


291 




*> 


4.08 


0 


291 




* 


4.08 


0 


400 


6 


6 


4.08 


0 


425 


6 


6 


4.08 


0 


551 


6 


6 


4.08 


0 


e nean 


»t*t 








14 


5 


5 


3.27 


0 


14 


5 


5 


3.27 


0 


14 


5 


5 


3.27 


0 


14 


5 


5 


3.27 


0 


14 


5 


5 


3.27 


0 


14 


5 


5 


3.27 


0 


20 


5 


5 


3.27 


0 


23 


5 


5 


3.27 


0 


24 


5 


5 


3.27 


0 


28 


5 


5 


3.27 


0 


38 


5 


5 


3.27 


0 


69 


5 


5 


3.27 


0 


70 


5 


6 


3.27 


0 


74 


5 


5 


3.27 


0 


74 


5 


5 


3.27 


0 


83 


5 


5 


3.27 


0 


83 


5 


5 


3.27 


0 


92 


5 


5 


3.27 


0 


95 


5 


5 


3.27 


0 


102 


5 


6 


3.27 


0 


102 


5 


6 


3.27 


0 


106 


5 


5 


3.27 


0 


145 


5 


6 


3.27 


0 


147 


5 


5 


3.27 


0 


159 


5 


5 


3.27 


0 


167 


5 


6 


3.27 


0 


167 


5 


6 


3.27 


0 



1. US-OB-249- 182-8 (1-11) 

R37450 Autotaxin peptide ATX 100. 

ID R37450 standard; peptide; 11 AA. 

AC R37450; 

DT 22-JUL-1993 (first entry) 

DE Autotsxin peptide ATX 100. 



KW innunostains; disease outcone prediction? therapy choice; 
KW cancer therapy; crosslinked toxins- 
OS Synthetic. 
PN US7822043-A. 
PD Oi-JAN-1993. 
PF 17-JAN-1992; 822043. 
PR 17-JAN-1992; US-822043. 
PA (USSH ) US DEPT HEALTH fc HUMAN SERVICE. 
PI Krutzsch Hi Liotta LAr Schiffnann Et Stracke H. 
DR HP I; 93-085861/10. 

PT Motility stimulating protein naned autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Exanple; Page 33; 36pp; English. 

CC The sequence is that of autotaxin peptide ATX 100. It nay be used to 

CC raise anti-autotaxin antibodies which can be used to diagnose cancer 

CC metastasis and in innunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcones and/or choice of 

CC therapy which nay also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy, 

SQ Sequence 11 AA; 

SQ l A; o R; 0 N; 0 D; 0 B; 0 C; l 3; 0 E; 0 Z; 2 G; 0 H: 

SQ l i; I l; l k; o h; o F; l P; o S? 2 T; l w; 0 Y; 0 v; 

Initial Score = 11 Optinized Score = 11 Significance = 8.17 
Residue Identity = 1007, Hatches = 11 Misnatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X X 
CGQPLWITATK 

minimi 

GGQPLMITATK 
X 10 



2. US-08-249-182-8 < 1-1 1 > 

P70030 Secretory signal sequence of plasnid pSPA12. 

ID P70030 standard; protein; 26 AA. 

AC P70030; 

DT 03-FEB-1991 (first entry) 

DE Secretory signal sequence of plasnid pSPA12. 

KW Secretory signal sequence; vector; protein secretion. 

FH Key Location/Qualifiers 

FT Region 17.. 26 

FT /label=nultiple cloning site 

PN EP-244042-A. 

PD 04-N0V-1987. 

PF 29-APR-1987; 200813. 

PR 02-MAY-1986; EP-200774. 

PA (KQNN) Gist Brocades NV. 

PI Snith HEt Van Ee JH. Peeters BPH» Bron Si Venena G. 

DR DPI; 87-308298/44. 

PT Plasnid for detecting secretory signal sequence - containing a 

PT nultiple cloning site and an open reading frane encoding an enzyne. 

PS Clain 13; Page 28; 38pp5 English. 

CC The secretory signal sequence of plasnid pSPA12 comprises a nultiple 

CC cloning site with restriction sites in reading frane with the 

CC structural gene. This sequence allows protein secretion* eg in 

CC Bacillus sp.> in economically high yields. See also P70028-9 and 

CC P70031-P70041 and N70039-N70041 . 

SQ Sequence 26 AA; 

SQ 5 A; i R; o N; l D; 0 B; 0 C; 0 9; 1 E; 0 z; 4 G; 0 H; 

SQ 5 I; 3 L ; o k; i n; o F; l p; l s? l T; o u; o Y; 2 v; 



Initial Score = 6 Optimized Score = b Significance = 4.08 
Residue Identity = 54X Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X 10 
GGQPLWITATK 

II II II 
MIRGILIAVLGI AIVGGDPLESTAAA 
10 X 20 X 



3. US-08-249-182-8 (1-11) 

Rl 1208 Retroviral B-epitope containing peptide #18 of hyb 

ID Rl 1208 standard; Protein; 31 AA. 

AC Rl 1203; 

DT 23-HAY-1991 (first entry) 

DE Retroviral B-epitope containing peptide #18 of hybrid nolecule. 

KM retrovirus? env glycoprotein; B-epitope; immunodef iciency virus; 

KW HIV; SI V; AIDS. 

PN W09102544-A. 

PD 07-MAR-1991. 

PF 17-AUG-1990; F00620. 

PR 18-AUG-1989; FR-011044. 

PA (INSP ) INST PASTEUR. 

PA (UYCU-) UMIV CURIE P fc N PARIS V. 

PI Girard Mr Gluckman JC> Bahraoui EM? 

DR UPI; 91-087117/12. 

PT Vaccine conpsns. which neutralise human innune deficiency virus - 

PT comprise a B epitope of retro-virus envelope glyco-protein and T 

PT epitope of distinct protein 

PS Claim 6; Page 36! 47pp; French. 

CC The peptide is a specific example of a B-epitope contg. peptide 

CC yhich can form a hybrid immunogenic nolecule with a retroviral T- 

CC epitope. The B-epitope is chosen to be the major neutralisation 

CC epitope of the envelope glycoprotein of a pathogenic retrovirus. 

CC The T-epitope can be derived from a different protein of the 

CC sane retrovirus or fron the sane protein fron a different retrovirus. 

CC The hybrid nolecule can also contain a minor epitoper especially a 

CC B-epitope fron a conserved region of the HIV, SIVi HTLV-1 or 

CC HTLV-II env glycoprotein. The B-epitope-contg. peptide is joined to 

CC the T-epitope using eg tetanus toxin* KLH or HSA, 

CC See also R11191-R11207 and R11209-R11210. 

S9 Sequence 31 AA; 

S9 2 A; 3 R; i n; o D; 0 B; o c; 3 9; o E; 0 Z; 3 G; 1 h; 

S8 2 i; 2 L; 2 K; o M; o F; 2 p; 2 s; 6 T; o w; 2 y; o V? 

Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLWITATK 

III I II 
TRPYKNTR9STPIGLGGALYTTRTKSIGQAH 
10 X 20 X 30 



4. US-08-249-182-8 (1-11) 

R36584 Virus neutralising epitope of envelope glycoprotei 

ID R36584 standard; peptide; 33 AA. 

AC R36584; 

DT 06-SEP-1993 (first entry) 

DE Virus neutralising epitope of envelope glycoprotein of HIV. 



nw nundri mnunggeT 1 1 lenty viru^r gp*cui gpiour cian vntr iRFturux,y. 

OS Synthetic. 

PN WQ9303836-A. 

PD 13-MAY-1993. 

PF 28-QCT-1992; E02459. 

PR 28-0CT-1991? US-782154. 

PR 28-0CT-1991; US-782241. 

PR 28-0CT-1991; US-782252. 

PA (INSP ) INST PASTEUR. 

PI Girard M; 

DR WPI; 93-167398/20. 

PT Enhancing ianunogenicity of viral envelope glycoprotein - by 

PT co-adninistration of viral envelope glycoprotein itself > and an 

PT oligopeptide derive. 

PS Disclosure; Page 82; 107pp; English. 

CC A novel nethod of enhancing the innunogenicity of an envelope 

CC glycoprotein (EGP) of a virus <esp. HIV gpl20 or gpl60) in a host 

CC comprises adnin, to the host at least one EGP of the virus in an ant. 

CC sufficient for prining vaccination and at least one peptide derived 

CC fron an actino acid sequence of the EGP (e.g. the sequence shown) f 

CC where the peptide comprises at least one virus-neutralisation 

CC epitope (VNE). The conplex is able to enhance the induction of 

CC neutralising antibodies to the virus and to confer long lasting 

CC immunity r longer than 6 norths. 

CC See also R36567-87. 

SQ Sequence 33 AA; 

ss 2 A? 3 R; l N; o d; o B; 2 c? 3 8; 0 E; o z; 3 G; l h? 

SQ 2 I; 2 L; 2 K; 0 M; 0 F; 2 P; 2 S; 6 T; 0 W; 2 Y? 0 V; 

Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLHITATK 

II I 1 II 
CTRPYKNTRQSTP I GLGQAL YTTRTKS I GQAHC 
10 X 20 X 30 



5. US-08-249-182-8 (1-11) 

R14334 HIV-1 amplifier peptide #18. 

ID R14334 standard; Protein; 33 AA. 

AC R14334; 

DT 03-JAN-1992 (first entry) 

DE HIV-1 anplifier peptide #18. 

KM hunan immunodeficiency virus; vaccine; hunan retrovirus; AIDS; 

KW acquired immunodeficiency syndrome; envelope glycoprotein. 

OS Synthetic. 

PN W09114449-A. 

PD 03-0CT-1991. 

PF 16-MAR-1991J E00509. 

PR 19-HAR-1990J US-494749. 

PA (INSP ) INST PASTEUR. 

PI Girard tt; 

DR WPI? 91-310366/42. 

PT Enhancing immunogenic ity of envelope glycoprotein - for use as 

PT vaccine or innunoJtherapeutic drug especially against HIVi HTLV-I 

PT and HTLV-I I 

PS Clain 12; Page 50; 71pp; English. 

CC This peptide is one example of an HIV-1 anplifier peptide for use in 

CC a composition for enhancing the innunogenicity of an envelope 

CC glycoprotein of a virus. The sequence corresponds to a 

CC neutralisation epitope and enhances the induction of persistent 

CC neutralising antibodies in the host. The amplifier peptide is used 



ti* in aooiLion lq an envelope glycoprotein tor printing Lne mouction ot 

CC neutralising antibodies. The compositions are particularly 

CC useful for vaccinating against HIV, SIV, HTLV-I and HTLV-II. 

SQ Sequence 33 AA; 

SQ 2 A; 3 R? 1 M; 0 D; 0 B; 2 C; 3 Q; 0 E; 0 Z; 3 G; 1 H; 

SQ 2 I; 2 Lf 2 K; 0 H; 0 F; 2 P? 2 s? 6 T; 0 W? 2 Y; o V; 

Initial Score - 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Hatches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLWITATK 

11 I I II 
CTRPYKNTRQSTPIGLGQALYTTRTKSIGQAHC 
10 X 20 X 30 



6. US-08-249-182-8 (1-11) 

P70033 Secretory signal sequence of plasmid pSPA31. 



ID P70033 standard; protein! 39 AA. 

AC P70033J 

DT 03-FEB-1991 (first entry) 

DE Secretory signal sequence of plasmid pSPA31. 

KM Secretory signal sequence; vector; protein secretion. 

FH Key Location/Qualifiers 

FT Region 30.. 39 

FT /label=multiple cloning site 

PN EP-244042-A. 

PD 04-N0V-1987. 

PF 29-APR-1987? 200813. 

PR 02-MAY-1986? EP-200774. 

PA (KONN) Gist Brocades NV. 

PI Smith HEt Van Ee Jh\ Peeters BPH» Bron Sr Venema G. 

DR WP1? 87-308298/44. 

PT Plasmid for detecting secretory signal sequence - containing a 

PT multiple cloning site and an open reading frame encoding an enzyne. 

PS Claim 13; Page 28? 38pp; English. 

CC The secretory signal sequence of plasmid pSPA31 comprises a multiple 

CC cloning site with restriction sites in reading frame with the 

CC structural gene. This sequence allows protein secretionr eg in 

CC Bacillus sp.i in economically high yields. See also P70028-P70032* 

CC P70034-P70041 and N70039-N70041. 

SQ Sequence 39 AA; 

SQ 5 A; o R; o n; 2 D? 0 B; o C; o Q; 8 E; o z; 3 G? 1 h; 

SQ 2 I; 5 L? 0 K; 2 M; 4 F? 2 Pi 1 Si 1 T? o H; 0 Y? 3 V; 



Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Hatches = 6 Hismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X 10 
GGQPLWITATK 
II II II 

MDEVHEEEEFEE APGLF I LLFLF VilAV I GGDPLEST AAA 
10 20 30 X 



7. US-08-249-182-8 ( 1-1 1 > 

P70029 Secretory signal sequence of plasnid pSPA3. 

ID P70029 standard; protein? 50 AA. 

AC P70029? 

DT 03-FEB-1991 (first entry) 

DE Secretory signal sequence of plasnid pSPA3. 



FH Key Location/Qualifiers 

FT Region 41 ..50 

FT /label=nultiple cloning site 

PN EP-244042-A. 

PD 04-N0V-1987. 

PF 29-APR-1987; 200813. 

PR 02-HAY-1986; EP-200774. 

PA (KONN) Gist Brocades NV. 

PI Snith HE 

PI Van Ee JH 

PI Peeters BPH 

PI Br on S 

PI Venema G 

DR WPI; 87-308298/44. 

PT Plasmid for detecting secretory signal sequence - containing a 

PT multiple cloning site and an open reading frane encoding an enzyme, 

PS Claim 13; Page 27; 38pp; English. 

CC The secretory signal sequence of plasmid pSPA3 comprises a multiple 

CC cloning site with restriction sites in reading frane uith the 

CC structural gene. This sequence allows protein secretion* eg in 

CC Bacillus sp.i in economically high yields. (See also P70028* 

CC P70030r P70031, P70032, P70033, P70034, P70035r P70036, P70037, 

CC P70038, P70039, P70040, P70041, N70039, N70040, N70041) 

SQ Sequence 50 AA; 

SQ 6 A; 0 R; 2 N; 2 D; 0 B; 1 c; 0 9; 4 E; o z; 5 G; 0 h; 

SO 0 I; 9 L; 4 K; 2 N; l F; 2 P; 4 S; 4 T; 0 U; 0 Y; 4 v; 



Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Hatches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
GGQPLMITATK 
I! II II 

HKKMLVVLLFSALLLNGCGSGESKANTAETPEVLDVKLTGGDPLESTAAA 
10 20 30 40 50 



8. US-08-249- 182-8 (1-11) 

R13443 FSH inhibiting protein. 

ID R13443 standard; protein; 264 AA. 

AC R13443? 

DT 29-0CT-1991 (first entry) 

DE FSH inhibiting protein. 

KU FSH-IP; endometriosis; contraceptive? porcine follicular fluid; 

KW follicle stimulating hormone. 

OS Sus scrofa domestica. 

PN US5037805-A. 

PD 06-AUG-1991. 

PF 20-MAR-1989; 326151. 

PR 20-MAR-1989; US-326151. 

PA (SALK ) SALK INST FOR BIOL STUD. 

PI Ling NC; 

DR WPI; 91-252089734. 

PT Use of a FSH inhibiting protein - for treating endometriosis and 

PT for use as a male or female contraceptive. 

PS Claim 1; Page 7; llpp; English. 

CC The protein bias isolated and purified from porcine follicular 

CC fluid and may also be prepd. by recombinant DNA techniques. It 

CC can be used for regulating ovulation or fertility in female 

CC mammals* for regulating spermatogenesis in malesi and for treating 

CC conditions such as endometriosis which result from an over abun- 

CC dance of FSH or oestrogen. Dosage is pref. 10 ug-1 mg/kg/day. 

SO Sequence 264 AA; 



OB 10 H. 17 Hr 0 Ni IC U, U Di 10 IV Hi W C» U it £4 bi O Hr 

SQ 6 I; 19 L; 20 K; 2 H; 5 F; 23 P; 27 S; 9 T; 1 u; 9 Y; 14 VJ 



Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Hatches = 6 Hi snatches = S 

Gaps = 0 Conservative Substitutions = 0 

X X 
GG8PLMITATK 
llll II 

EDTLNKLKFLNVLSPRGVHIPNCDKKGFYKKK9CRPSKGRKRGFCUCVDKYGQPLPGYTTKGKEDVHCYSH9 
200 210 220 230 240 250 260 

SK 



9. US-08-249- 182-8 (1-11) 

R05596 Son atoned in carrier protein subunit. 



ID R05596 standard; protein; 291 AA. 

AC R05596J 

DT 31-0CT-1990 (first entry) 

DE Sonatonedin carrier protein subunit. 

KU Hunan sonatonedin carrier protein; LCP2.3; acronegaly; 

KU diabetic retinopathy; osteoporosis; ds. 

OS Hono sapiens. 

PN EP-375438-A. 

PD 27-JUN-1990. 

PF 21-DEC-1989J 313463. 

PR 22-DEC-1988; US-290250. 

PA (BI0G-) Biogrouth Inc. 

PI Spencer EN, Talkington C; 

DR WPI; 90-195533/26. 

OR N-PSDB? Q04796. 

PT Recombinant DNA nolecule - 

PT has gene which codes for carrier protein-like polypeptide. 

PS Disclosure; p; English. 

CC Carrier proteins bind to sonatonedin polypeptides! also knoun as 

CC insulin-like polypeptides* they have therapuetic> diagnostic and 

CC other applications i such as the inhibition of SH-C in acromegaly, 

CC treatment of diabetic retinopathy and wound healing. 

SO Sequence 291 AA; 

SQ 25 A; 22 R; 6 N; 12 D; o B; 18 C; li 8; 17 E; 0 z; 24 G; 7 H; 

S9 6 i; 25 L; 19 k; 3«; 5 F; 26 p; 27 s; 11 T; 2 U; 9 Y; 16 V; 



Initial Score = 6 Optinized Score = 6 Significance = 4.08 
Residue Identity = 54X Hatches = 6 Ni snatches - 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLHITATK 
llll II 

EDTLNHLKFLNVLSPRGVHIPNCDKKGFYKKK0CRPSKGRKRGFCWCVDKYGQPLPGYTTKGKEDVHCYSH8 
220 230 240 250 260 270 280 



SK 
290 



10. US-08-249-182-8 (1-11) 

P92300 Sequence of hunan insulin-like growth factor bindi 

ID P92300 standard; protein; 291 AA. 
AC P92300; 

DT 05-HAR-1990 (first entry) 



ut sequence ot nurcan msuun-iifie grouLn tacujr oinoing protein era,* rroR 

DE cDNA clone ibp.118. 

KW Insulin-like growth factor binding protein; BP53; clone ibp.118. 

OS Hopto sapiens. 

PN W08909268-A. 

PD 05-QCT-19B9. 

PF 10-HAR-19B9J U00983. 

PR 22-MAR-1988; US-171623. 

PA (RQYA-) Royal Prince Alfred Hospital (GETH) Genentech Inc. 

PI Baxter RC, Wood HI; 

DR WPI? 89-309533/42. 

DR N-PSDB? N91467. 

PT DNA encoding insulin-like growth factor binding protein - used to 

PT increase insulin-like growth factor circulatory half-life* and as 

PT netabolic regulator. 

PS Disclosure? Fig. 3U)-3(2); 72pp? English. 

CC BP53 is an acid-stable component of a 125-150 kD glycoprotein conplex 

CC contained in hunan plasna. It carries (tost of the endogenous IGF'sr and 

CC is regulated by growth horaone. To allow nore efficient delivery of IGF 

CC to target cells* an AA constituting the transneabrane or neabrane- 

CC binding donain of norwal IGF-I or -II receptori or a phospholipid anchor 

CC donain can be introduced at the C-terninal. BP53 can be used to prolong 

CC circulatory half-life of IGF. High levels of BP in adult serun correlate 

CC with high levels of growth hornone, 

S8 Sequence 291 AA; 

so 25 A; 22 R; 6 N; 12 D; o B; 18 c; 11 8; 17 E; 0 z; 25 G; 7 H; 

S8 6 I; 25 L; 19 K! in; 5 F; 25 P! 27 S; 11 T; 2 M? 9 Y; 16 v; 



Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity - 54% hatches = 6 Mi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLyiTATK 
I II I II 

EDTLNHLKFLNVLSPRGVHIPNCDKKGFYKKK8CRPSKGRKRGFCHCVDKYG8PLPGYTTKGKEDVHCYSHS 
220 230 240 250 260 270 280 



SK 
290 



11. US-08-249-182-8 (l-ll) 

R32501 Beta-adrenergic receptor. 

ID R32501 standard; protein? 400 AA. 

AC R32501? 

DT 09-JUN-1993 (first entry) 

DE Beta-adrenergic receptor. 

KW Fat cell specific? BAR? lipolysis; obesity? diagnosis? 

KW thernogenesis? netabolisn. 

OS Rattus rattus. 

PN US7783602-A. 

PD 15-DEC-1992. 

PF ll-NQV-1991? 783602. 

PR 01-N0V-1991? US-783602. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Venter CJ, 

DR WPI? 93-067426/08. 

PT Fat cell specific beta- adrenergic receptor polypeptide - used 

PT for diagnosis of obesity due to inactive lipolysis 

PS Disclosure? Page 16? 20pp? English. 

CC A rat intercapsular brown adipose tissue cDNA library was cloned and 

CC probes with DNA probes encoding hunan beta-1 and rat beta-2 

CC adrenergic receptors under low stringency conditions. Positive 

CC clones were found to be different fron the rat and hunian sequences 



k,\. ano conLameo a single open reaoing trame ot l^uu op encooing me 

CC protein shown* of 400 amino acids and not. wt, 43 kD» The protein is 

CC the fat specific beta-adrenergic receptor and nay be used in work on 

CC the thermogenesis process. Isolation of the gene for BAR allows the 

CC diagnosis and treatment of obesity and the testing of dedications 

CC for their effectiveness in stimulating the thermogenesis metabolic 

CC response in obesity patients. 

SQ Sequence 400 AA; 

SQ 48 A; 32 R? 13 N; 10 D; 0 B; 9 C; 5 8; 12 Ei 0 Z; 28 G? 4 H; 

SQ 10 I; 50 L? 4 K? 6 Mr 18 F? 36 P; 33 S? 26 T; 10 W? 9 Y? 37 V; 

Initial Score = 6 Dptinized Score = 6 Significance = 4.08 
Residue Identity = 54X Hatches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLWITATK 
II I III 

MAPHPHKMGSLAFWSDAPTLDPSAANTSGLPGVPWAAALAGALLALATVGGNLLVITAIARTPRLQTITNVF 
10 20 30 40 50 60 70 

VTSLATADLVVGLLVHPPGATLALTGHWPLGATGCELW 
80 90 100 110 



12. US-08-249-182-8 (1-11) 

R13792 E75B exon Bl polypeptide. 

ID R13792 standard? Protein; 425 AA. 

AC R13792J 

DT 29-N0V-1991 (first entry) 

0E E75B exon Bl polypeptide. 

KW Insect steroid receptor; hormone. 

OS Drosophila melanogaster. 

PN W09113167-A. 

PD 05-SEP-1991. 

PF 15-FEB-1991? U01189. 

PR 26-FEB-1990; US-485749. 

PA (STRD ) LELAND STANFORD JR UNIV. 

PI Hogness DSr Koelle MR, Segraves UA? 

DR WPI? 91-281480/38. 

DR N-PSDB; 013573. 

PT DNA encoding insect steroid receptors - and ligands, for use as 

PT benign inducing factors 

PS Disclosure; Page 100? 126pp; English. 

CC The amino acid sequence codes for the protein produced by Exon Bl 

CC which is specific to the E75B transcription unit. Exons 2-5 are 

CC shown in E75A (R13791) are also found in E75B but amino acid 

CC residues must be increased by 15 to apply to the E75B protein. The 

CC E75 proteins show considerable similarity to members of the steroid 

CC receptor family. Since the putative hormone binding E domain of the 

CC E75 proteins does not show high sequence homology to the known 

CC ecdysone receptor (R13793) it is likely that the E75 proteins bind 

CC either a terpenoid juvenile hormone or a novel Drosophila hormone. 

CC See also R13792-R13794. 

SQ Sequence 425 AA? 

SQ 36 A? 10 R? 25 N; 20 D; 0 B? 13 C? 98 Q? 20 E; 0 Z? 13 G? 22 Hi 

SQ 12 I? 30 L? 18 Kt 5 M? 2 F? 17 P? 35 S? 28 T; 1 H? 5 Y? 15 V? 

Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
GGGPLWITATK 
II I I II 



10 20 30 40 50 60 70 



H9HQH999AKS99LK99HSALVKLLESAPIK9099T 
80 90 100 



13. US-08-249-182-8 (1-11) 

R38691 Mitochondria ATPase beta subunit. 

ID R38691 standard; Protein; 551 AA. 

AC R3S691; 

DT 11-N0V-1993 (first entry) 

DE Mitochondria ATPase beta subunit. 

KW Rice; nitochondria; ATPase; beta subunit; male sterility* 

OS Oryza sativa. 

FH Key Location/Qualifiers 

FT Peptide 1. .85 

FT /note= "Transit peptide 0 

FT Protein 86.. 551 

FT /note* "ATPase beta subunit 0 

PN J05137581-A. 

PD 01-JUN-1993. 

PF 19-N0V-1991; 303251. 

PR 19-N0V-1991; JP-303251. 

PA (tllTK ) MITSUI TOATSU CHEN INC. 

OR WPI? 93-211307/26. 

DR N-PSDB; 942748. 

PT cDNA of rice mitochondria adenosine triphosphatase beta sub-unit 

PT - for introduction of prod, of foreign gene into mitochondrial 

PT used for recovery of male sterility of rice plant 

PS Claim 1; Page 5-8; 8pp; Japanese. 

CC This sequence is encoded by the rice mitochondria ATPase beta subunit 

CC gene. The gene sequence nay be used in the construction of an 

CC artificial nitochondria ATPase beta subunit for introduction into 

CC nitochondria. These way be used for the recovery of wale sterile 

CC rice plants. This sequence is given as it is represented in the 

CC specification. 

SO Sequence 551 AA; 

SO 57 A; 37 R; 16 N; 28 D; o B; 2 C; 23 9? 34 E; o z; 56 G; 11 H; 

SQ 34 I; 49 l; 21 K; 10 H; 18 f; 30 PJ 28 s; 36 T; o w; 13 y; 48 v; 

Initial Score = 6 Optimized Score - 6 Significance = 4.08 
Residue Identity = 547. Hatches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GG9PLWITATK 
II II II 

AVHFRDAEGQDVLLFIDNIFRFTSANSEVSALLGRIPSAVGYSPTLATDLGGL9ERITTTKKGSITSVSAIY 
320 330 340 350 360 X 370 X 380 

VPADDLTDPAPATTFAHLDATTVLSR9ISELGIYPAVDP 
390 400 410 420 



14. US-08-249-182-8 (1-11) 

P98464 Sequence of C. trachomatis serovar K major outer m 

ID P98464 standard; Protein; 14 AA. 

AC P98464; 

DT 06-HAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar K major outer membrane protein (M0MP) 

DE variable domain (VD) K-VDIII encoded by base pairs 742-783 

KW Clamydia trachomatis; antigen; monclonal antibody; vaccine; 

KW diagnosis; serotyping; non-immunologic assay. 



us imanyDia tracnomans. 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-MAR-1989? 324664. 

PR 17-MAR-1989? US-324664. 

PA (USSH ) US DEPT HEALTH h HUMAN. 

PI Caldwell HD> Ying Y, Zhang YX, Hatkins NG; 

DR WPI; 89-339697/46. 

DR N-PSDB? N97099. 

PT Clamydia trachomatis genes - used for determn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure; Fig 18? 49pp? English. 

CC The inventors sequenced the 4 MOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The MOMP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by MOMP 

CC specific monoclonal antibodies. The nucleotide* amino acid 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate MOMP antigenic determinants to be used 

CC in the construction of synthetic peptides * subunits or recomtinant 

CC chlamydial vaccines. This will allow the prodn. or reagents and 

CC methodologies applicable in the development of neu diagnostic tests 

CC for serotyping. 

SQ Sequence 14 AA; 

SQ 3 A? 0 R? 0 N? l D? 0 B; 0 C? 0 9; 2 E? 0 Z? l G? 0 H; 

SQ i if i l? o K? o H? l F? 1 P? o s? 2 T; o hi o y? i v? 

Initial Score = 5 Optimized Score = 5 Significance = 3.27 
Residue Identity = 45X Hatches = 5 Mismatches = 6 

Gaps = 0 Conservative Substitutions - 0 

X X 
GG9PLWITATK 

II III 
VEFPLDITAGTEAA 
X 10 



15. US-08-249-182-8 (1-11) 

P98452 Sequence of C. trachomatis serovar H major outer m 

ID P98452 standard? Protein? 14 AA. 

AC P98452? 

DT 06-MAR-1992 (first entry) 

DE Sequence of C. trachomatis serovar H major outer membrane protein (MOMP) 

DE variable domain (VD) H-VDIII encoded by base pairs 742-783 

KW Clamydia trachomatis? antigen? monclonal antibody? vaccine? 

KW diagnosis? serotyping? non-immunologic assay? ss. 

OS Chlamydia trachomatis, 

PN US7324664-A. 

PD 29-AUG-1989. 

PF 17-HAR-1989? 324664. 

PR 17-MAR-1989? US-324664. 

PA (USSH ) US DEPT HEALTH h HUMAN. 

PI Caldwell HD, Ying Y, Zhang YX, Hatkins NG? 

DR WPI? 89-339697/46. 

DR N-PSDB? N97087. 

PT Clamydia trachomatis genes - used for determn. of nucleotide and 

PT amino sequences of the variable domains of the major outer 

PT membrane proteins 

PS Disclosure? Fig 15? 49pp? English. 

CC The inventors sequenced the 4 MOMP VDs of ten C. trachomatis 

CC serovars and the amino acid sequences were deduced. The MOMP VDs 

CC with the greatest total hydrophilicity and charge values were found 

CC to be the location of antigenic determinants recognised by MOMP 



spec i tic nonocionai antioooies. me nucieotioer am no acio 

CC sequences and hydrophilicity/charge value analyses will assist in 

CC the selection of appropriate ROMP antigenic determinants to be used 

CC in the construction of synthetic peptides, subunits or recontinant 

CC chlamydial vaccines. This uill allou the prodn. or reagents and 

CC methodologies applicable in the development of neu diagnostic tests 

CC for serotyping. 

S3 Sequence 14 AA; 

S9 4 A; 0 R; 0 N; l D; 0 B; o C; 0 Q; 2 E; 0 Z; l G; 0 H; 

S9 l I; l L; o k; o ti; l F; l P; 0 s; 2 T; o w; 0 y; 0 v; 



Initial Score = 
Residue Identity = 
Gaps = 



5 Optinized Score = 5 Significance = 3.27 
45X Matches = S Hi snatches = 6 

0 Conservative Substitutions = 0 



X X 
GGQPLWITATK 

II III 
AEFPLDITAGTEAA 

X 10 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 



Results file u249_8p.res nade by on Thu 22 Sep 94 10:l6i43-PDT. 



Query sequence being conpared:US-08-249-182-8 (1-11) 
Number of sequences searched: 70848 
Number of scores above cutoff: 4261 

Results of the initial comparison of US-08-249-182-8 (1-11) uilh: 
Data bank : PIR 41, all entries 

100000- 

N 

U50000- * 

H 
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E 

R ♦ 
0 
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E 5000- 

0 

U 

E 

N 

C 

E 

S 1000- t 



500- 



100- 



50- 



10- 



5- 



II II I I I I I I t I II I I I I I 
SCORE 0| |1 | 2 | 4 | 5 t 6 |7 | 9 | 10 11 
STDEV -1 012345678 



PARAMETERS 



Similarity matrix 
Mismatch penalty 
Gap penalty 
Gap size penalty 
Cutoff score 
Randomization group 



Unitary 
1 

1.00 
0.05 
0 
0 



K-tuple 

Joining penalty 
Window size 



2 

20 
5 



Initial scores to save 40 
Optimized scores to save 0 



Alignments to save 15 
Display context 50 



SEARCH STATISTICS 



Scores: 



Mean Median Standard Deviation 

2 3 0.98 



Times; 



CPU 
00:01*26.02 



Total Elapsed 
00:01:36.00 



Number of residues: 20816057 
Number of sequences searched: 70848 
Number of scores above cutoff: 4261 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 



A 100% similar sequence to the query sequence uas found: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frame 



1. A42329 autotaxin - human (fragments) 114 11 11 9.17 0 



The list of other best scores is; 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



5 standard deviations above nean **** 



2. 


HZBE2 


gene 2 protein - hunan herpes 


238 


7 


7 


5.10 


0 


3. 


XYCHFA 


fatty-acid synthase (EC 2.3.1 


2446 


7 


7 


5.10 


0 






**** 4 standard deviations above nean 








4. 


COA1 70 
520178 


nypoLneLi td l protein \irni o 


1 


L 
O 


6 


A AO 

4.08 


A 

0 




YQECC1 


CFA1 finbrial protein precurs 


1 /U 


t 

0 


6 


a aa 

4.08 


A 

0 


L 

6. 


ATO/I 07 


helix-destabilizing protein - 


i 7/1 

1 /4 


L 
0 


6 


A Afl 

4.08 


A 

0 


7 


nncriD 


helix-destabilizing protein - 


1 /J 


L 

0 


6 


A AO 

4.08 


A 

0 


n 

0 • 


B7A A70 


prolactin-related protein III 


on 


6 


6 


A AO 

4.08 


A 

0 


o 

7. 


533428 


spdA protein - Streptonyces a 


224 


6 


* 
6 


A AA 

4.08 


A 

0 


1U. 


14oojc 


transfer protein spdA - Strep 


224 


6 


0 


A AO 

4.08 


A 

0 


11 . 


A44914 


phosphate-dependent exoribonu 


245 


6 


i_ 

6 


A Aft 

4.08 


0 


12. 


S03888 


photosysten II oxygen-evolvin 


248 


6 


6 


4.08 


0 


13. 


S22763 


photosysten II oxygen-evolvin 


258 


6 


6 


4.08 


0 


4 A 

14. 


F2T0X2 


photosysten II oxygen-evolvin 


ten 

258 


6 


6 


4.08 


0 


15. 


JS0771 


photosysten II oxygen-evolvin 


25V 


6 


6 


4.08 


0 


16. 


PA7 A / T 

S07467 


photosysten II oxygen-evolvin 


25? 


6 


6 


4.08 


0 


17. 


S10016 


photosysten II oxygen-evolvin 


260 


6 


6 


4.08 


0 


18. 


515005 


photosysten II oxygen-evolvin 


265 


6 


6 


A AO 

4.08 


ft 

0 


4 n 

19. 


I0HU3 


instil in-like growth factor-bi 


nn 4 

291 


6 


6 


4.08 


0 


20. 


AT/ fifl 

A36748 


insulin-like growth factor-bi 


292 


6 


6 


4.08 


0 


21 . 


PIFNZRP 


natrix protein - rinderpest v 


335 


6 


6 


4.08 


0 


22. 


nit nn 

SI 1738 


hemagglutinin precursor - inf 


770 

378 


6 


7 


4.08 


0 


23. 


A39314 


gastricsin (EC 3.4.23.3) prec 


384 


6 


6 


4.08 


0 


24. 


C i 7 An A 

S13094 


glycerol -3-phosphate dehydrog 


385 


6 


6 


4.08 


0 


25. 


All 4 

A3181 1 


gastricsin (EC 3.4,23.3) prec 


388 


6 


& 


4.08 


0 


26. 


A29937 


gastricsin (EC 3.4.23.3) prec 


388 


6 


6 


4.08 


0 


27. 


A38621 


aspartate transaminase (EC 2. 


392 


6 


6 


4.08 


0 


28. 


A 4 ft 

A33510 


gastricsin (EC 3-4. 23*3) - ra 


392 


6 


6 


4.08 


0 


29. 


A24608 


pepsin A (EC 3.4.23.1) precur 


392 


6 


6 


4.08 


0 


T A 

30. 


S29808 


beta-3-adrenergic receptor - 


400 


6 


6 


4.08 


0 


T 4 

31 . 


A CIOO 4 

A53281 


beta 3-adrenergic receptor - 


Wv 


* 

0 


6 


4.08 


0 


32. 


if* at a n 

JS0349 


hypothetical 45K protein (sbc 


400 


6 


7 


4.08 


0 


33. 


A41679 


beta-3-adrenergic receptor - 


400 


6 


6 


4.08 


0 


34 • 


A4/U41 


transposase honolog (insertio 


408 


6 


t_ 

0 


A AO 

4.08 


A 

0 


35. 


B39096 


alkaline phosphatase (EC 3.1. 


462 


6 


6 


4.08 


0 


36. 


SI 6359 


adenylate cyclase (EC 4.6.1.1 


469 


6 


6 


4.08 


0 


37. 


S41616 


atpB protein - Euglena gracil 


480 


6 


b 


4.08 


0 


38. 


S34547 


H+-transporting ATP synthase 


480 


6 


b 


4.08 


0 


39. 


A26951 


nifB protein - Rhizobiun neli 


490 


b 


b 


4.08 


0 


40. 


C24829 


H+-transporting ATP synthase 


491 


6 


b 


4.08 


0 



i. US-08-249-182-8 (1-11) 

A42329 autotaxin - hunan (fragments) 

ENTRY A42329 ttype fragnents 

TITLE autotaxin - hunan (fragnents) 

ORGANISM tfornaljiane Hono sapiens #connon_nane nan 

DATE 04-Nar-l993J #sequence_revision Ol-Jan-1993? #text_change 

08-May-1993 
ACCESSIONS A42329 
REFERENCE A42329 

#authors Stracker 11. L. J Krutzsch* H.C.J Unsuorth, E.J. 5 Arestadr A.J 
Cioce» V.i Schiffnanni E.J Liottaf L.A. 

tjournal J. Biol. Chen. (1992) 267**2524-2529 

Ititle Identification! purification! and partial sequence analysis 

of autotaxini a novel notility-stinulating protein. 
#cross-references MUID*92129337 
iaccession A42329 



»» status preliminary 
##piolecule_type protein 
••residues" 1-114 «81abel STR 

••cross-references NCBIP:78526; NCBIPJ78523; NCBIP:7852l; NCBIPJ78518," 
NCBIP:78515! NCBIPS78512; NCBIP:78511J NCBIPJ78510! 
NCBIPJ78509; NCBIP:7850B5 NCBIP:78503 
••note sequence extracted fron NCBI backbone 

SUMMARY # length 114 ichecksun 7335 

SEGUENCE 

Initial Score = 11 Optinized Score = 11 Significance 
Residue Identity = 100% Matches = 11 Mi snatches 

Gaps = 0 Conservative Substitutions 

X 10 
GG8PLUITATK 

minimi 

EFLSNYLTNVDDITLVPGTLGRDIEHLTSLDFFRVNSM8TVFVGYGPTFKGG8PLWITATKSPPFENINLYY 
10 20 30 40 50 X 60 X 70 

DVPWNETIPEEVTXPNYL8AEVSYPAFKPXLDVYKUHVA 
80 90 100 110 



= 9.17 
0 
0 



2. US-08-249- 182-8 (1-11) 
UZBE2 gene 2 protein 



hunan herpesvirus 3 



ENTRY WZBE2 #type conplete 

TITLE gene 2 protein - hunan herpesvirus 3 

ORGANISM #fornal_nafie hunan herpesvirus 3. varicella-zoster virus 

DATE 30-Sep-1988 #sequence_revision 30-Sep-1988 #text_change 

08-Apr-1994 
ACCESSIONS B27212 
REFERENCE A27345 

•authors Davison. A.J.i Scott. J.E. 

•journal J. Gen. Virol. (1986) 67M759-1816 

#title The conplete DNA sequence of varicella-zoster virus. 

•cross-references MUID:86306657 

•accession B27212 
••nolecule_type DNA 
itresidues" 1-238 SSlabel DAV 
COMMENT The DNA sequence uas obtained fron EMBL. release 13. 
GENETICS 

•gene 2 

CLASSIFICATION isuperf anily varicella-zoster virus gene 2 protein 



SUMMARY 
SEQUENCE 



•length 238 #nolecular-ueight 25984 Ichecksun 1948 



Initial Score = 7 
Residue Identity = 637. 
Gaps = 0 



Optinized Score = 7 Significance = 5.10 
Matches = 7 Mi snatches = 4 

Conservative Substitutions = 0 



X 10 
GG8PLUITATK 
II I II II 

SETLAYGHVPAFIMGSTLVRPSLNATAEENPASETRCLLRVLAGRTVDLPGGGTLHITCTKTYVIIGKYSKP 
10 20 30 40 50 X 60 X 70 



GERLSLARLIGRAMTPGGARTFIILAMKEKRSTTLGYEC 
80 90 100 110 



3. US-08-249-182-8 (1-11) 

XYCHFA fatty-acid synthase (EC 2.3.1.85) - chicken 



ENTRY 



XYCHFA #type conplete 



uiLt taiay-acio synmase itt ^.^.i.od; - cmcKen 

CONTAINS 3-hydroxypalp»itoyl-[ acyl-carrier-protein] dehydratase (EC 

4.2.1.61); 3-oxoacyl-[ acyl-carrier-proteinl reductase (EC 
1.1.1,100); 3-oxoacyl-[ acyl-carrier-protein] synthase (EC 
2.3.1.41); enoyl-[ acyl-carrier-protein] reductase (NADPH, 
B-specific) (EC 1.3.1.10); oleoyM acyl-carrier-protein] 
hydrolase (EC 3.1.2.14); C acyl-carrier-protein] 
S-acetyltransferase (EC 2-3.1 .38) ; [acyl-carrier-protein] 
S-nalonyltransferase (EC 2.3.1.39) 
ORGANISM #fornal_nane Gallus gallus ftcofinon^nane chicken 

DATE 30-Sep-1991 ♦sequence.revision 30-Sep-1991 #text_change 

31-Dec-1993 

ACCESSIONS A33918; A30445; A31236J B31236J A30297; A31184; A31185; 

A30446; 503856 
REFERENCE A33918 

#authors Holzer, K.P.; Liu> M.; Hannes, G.G. 

♦journal Proc. Natl. Acad. Sci. U.S.A. (1989) 86:4387-4391 

ttitle Molecular cloning and sequencing of chicken liver fatty acid 

synthase cDNA. 
Scross-references MUID:89282777 
♦accession A33918 
#t(nolecule_type nRNA 
♦♦residues" 1-1701 ##label HOL 
♦accession A30445 

##Rolecule_type protein 
♦♦residues" 33-39 ; 1012-1017 ♦♦label H0L2 
REFERENCE A3 1236 

♦authors Yuan, Z.J Liu, U.; Hawnes, G.G. 

♦journal Proc. Natl. Acad. Sci. U.S.A. (1988) 85:6328-6331 

♦title Molecular cloning and sequencing of DNA complementary to 

chicken liver fatty acid synthase nRNA. 
♦cross-references MU1D:88320436 
♦accession A31236 
♦♦nolecule_type nRNA 
♦♦residues" 1678-2446 ♦♦label YUA 
♦accession B31236 
♦♦nolecule_type nRNA 

♦♦residues" 1678-2275,2284-2446 Mlabel YUA2 
REFERENCE A30297 

♦authors Chirala, S.S.; Kasturi, R.; Pazirandeh, M. ; Stolou, D.T.J 

Huang, W.Y.; Wakil, S.J. 
♦journal J. Biol. Chen. (1989) 264:3750-3757 

♦title A novel cDNA extension procedure. Isolation of chicken fatty 

acid synthase cDNA clones, 
♦cross-references MUID:89139426 
♦accession A30297 
♦♦nolecule_type nRNA 

♦♦residues" 1494-1503, 'W , 1505-1658, , 1660-1671, 'S' ,1673-2275, 

2284-2446 ♦♦label CHI 
♦♦cross-references EHBL:J04485 
REFERENCE A3 11 84 

♦authors Yang, C.Y.; Huang, W.Y.; Chirala, S.; Wakil f S.J. 
♦journal Biochenistry (1988) 27:7773-7777 

♦title Complete amino acid sequence of the thioesterase domain of 

chicken liver fatty acid synthase, 
♦cross-references MUID:89088151 
♦accession A31184 

♦♦nolecule^type protein 

♦♦residues" 2135-2275,2284-2442 ♦♦label YAN 
REFERENCE A31185 

♦authors Kasturi, R,; Chirala, S.; Pazirandeh, M.; Wakil, S.J. 
♦journal Biochenistry (1988) 27:7778-7785 

♦title Characterization of a genonic and cDNA clone coding for the 

thioesterase donain and 3' noncoding region of the chicken 
liver fatty acid synthase gene. 

♦cross-references MUID:89088152 



^accession hjucj 
iinolecule_type DNA 

iiresidues" 2128-2275,2284-2446 ##label KAS 
iicross-references EMBLi J02839 
^accession A30446 
iinolecule_type nRNA 

iiresidues" 2128-2275, 2284-2446 iilabel KA2 
REFERENCE S03856 

iauthors Huang, W.Y.; Stoops r J.K.; Wakil* S.J. 

ijournal Arch. Biochen. Biophys. (1989) 270:92-98 

#ii tie Complete aflino acid sequence of chicken liver acyl carrier 

protein derived fron the fatty acid synthase, 
tcross-references MUID:89192401 
#accession S03856 

ii«olecule_type protein 
iiresidues" 2047-2135 ##label HUA 
CLASSIFICATION isuperfanily rat fatty-acid synthase 

KEYHORDS acyltransferase; carbon-oxygen lyase; carrier protein? fatty 

acid biosynthesis; honodiner; hydro-lyase; nannary gland; 
oxidoreductase; phosphopantetheine; thiolester hydrolase 

FEATURE 

1-2446 iprotein fatty-acid synthase (isoforn 1) #status 

predicted ilabel HATX 
l-2275r2284-2446 iprotein fatty-acid synthase (isoforn 2) istatus 

predicted «label NA2X 
1-338 idottain 3-oxoacyW acyl-carrier-protein] synthase 

istatus predicted llabel ENZ1\ 
1248-1266 iregion active-site region (of 

3-hydroxypalnitoyW acyl-carrier-protein] dehydratase) 

#status predictedX 

2047-2143 idonain acyl carrier protein istatus predicted ilabel 

ACP\ 

2144-2446 idonain oleoyl-E acyl-carrier-protein] hydrolase istatus 

predicted ilabel ENZ7\ 
87 #active_site Cys (of 3-oxoacyl-Eacyl-carrier-protein] 

synthase) istatus predictedX 
506 iactive_site Ser (of [acyl-carrier-protein] 

acetyl/nalonyltransferase) istatus predicted\ 
1631 iactive_site Ser (of enoyl-t acyl-carrier-protein] 

reductase) istatus predictedX 
1634 iactive_site Lys (of enoyl-t acyl-carrier-protein] 

reductase) istatus predictedX 
1856 iactive_site Lys (of 3-oxoacyM acyl-carrier-protein] 

reductase) istatus predictedX 
2084 ibinding_site phosphopantetheine (Ser) (covalent) (in 

acyl carrier protein) istatus predictedX 
2235 iactive_site Ser (of oleoyl-t acyl-carrier-protein] 

hydrolase) istatus predicted 
SUMMARY ilength 2446 inolecular-ueight 267247 ichecksun 8579 

SEQUENCE 

Initial Score = 7 Optinized Score = 7 Significance = 5.10 
Residue Identity = 637. Matches = 7 Mismatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGGPLWITATK 
Nil HI 

LNLVAMKRSFFGSVIFLCRRQSPAKAPILLPVDDTHYKWVDSLKEILADSSEOPLWLTATNCGNSGILGMVN 
1320 1330 1340 1350 1360 1370 1380 

CLRLEAEGHRIRCVFVSNLSPSSTVPATSLSSLEMQKII 
1390 1400 1410 1420 



4. US-08-249-182-8 (1-11) 



nypouieutdi protein urm 4 region* - yeasi, iaac 



ENTRY S20178 Stype fragment 

TITLE hypothetical protein (IFH1 3' region) - yeast (Saccharonyces 

cerev i 5 i ae) ( fragment ) 
ORGANISM Sfornaljtafte Saccharonyces cerevisiae 

DATE 23-Apr-1993 #sequence_revision 23-Apr-1993 #text_change 

30-Sep-1993 
ACCESSIONS S20178; S17024 

REFERENCE S20177 

f authors Valutas, A.; Ackernan* S.H.; Tzagoloffi A. 
#journal Eur. J. Biochen. U991) 201:643-652 

fttitle Mitochondrial translational-initiation and elongation factors 

in Saccharonyces cerevisiae. 
#cross-references J1UID:92037620 
iaccession S20178 
##nolecule_type DNA 
itresidues" 1-125 Sllabel VAN 
##cross-references EMBUX58379 
SUHMARY Slength 125 #checksun 772 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



6 Optimized Score = 6 Significance = 4.08 
54X Hatches = 6 Hi snatches = 5 

0 Conservative Substitutions = 0 



X 10 
GG9PLHITATK 
II II II 

PRTEKLSKFLDDDTF9KF8EVVGYNPL9VLRYDLGGKPLLYAETKVDILSTVPRPGYNPSS9RIFEM9LMPK 
10 20 30 X 40 X 50 60 70 



MIFDLEEVVSVDNGMEHGTILVF 
80 90 



5. US-08-249-182-8 (1-11) 

Y9ECC1 CFA1 finbrial protein precursor - Escherichia coli 



ENTRY YQECC1 #type complete 

TITLE CFA1 finbrial protein precursor - Escherichia coli 

ALTERNATE.NAMES CFA1 pilin; colonization factor antigen I (CFAt) 
0RGANISH ifornal^nane Escherichia coli 

DATE 14-Nov-1983 #sequence_revision 30-Jun-1991 #text change 

31-Dec-1993 
ACCESSIONS A30589J A03495; A43831 

REFERENCE A30589 

Sauthors Karjalainen, T.K.; Evans* D.G.; So, tl.; Lee; C.H. 

^journal Infect. Innun. (1989) 57:1126-1130 

Stitle Molecular cloning and nucleotide sequence of the colonization 

factor antigen I gene of Escherichia coli. 
Icross-references HUID:89173309 
Saccession A30589 
##Rolecule_type DNA 
##residues" 1-170 •• label KAR 
REFERENCE A03495 
iauthors Kleiwi P. 

#journal Eur. J. Biochen. (1982) 124:339-348 

Stitle Prinary structure of the CFA1 finbrial protein fron hunan 

enterotoxigenic Escherichia coli strains. 
#cross-references HUID:82235736 
icontents Strain H10407 
#accession A03495 
##ptolecule_type DNA 

##residues" 24-75, 'N' ,77-96, 'A' ,98-170 ##label KLE 
REFERENCE A43831 



ffduinors Lassetsr r.u.; ueaii u.u.i Keior n.n.r jarooej u.l.i naussi 

J«L. r Carieri J.H.; Boedeken E.C. 
#journal Infect. Imnun. (1992) 60:2174-2181 

#title Analysis of Escherichia coli colonization factor antigen I 

linear B-cell epitopes! as deternined by private responses^ 
following protein sequence verification, 
fkross-references MUID: 92267624 
tcontents Strain H10407 
taccession A43831 

#tflolecule_type protein 
Mresidues" 24-170 ##label CAS 
##cross-references NCBIP: 104220 
ttnote sequence extracted from NCBI backbone 

COMMENT The CFA1 fimbriae are rather rigidi thread-like filaments of 0.5-1 
micrometer^ with an apparent axial hole! and a dianeter of 7 
nanometers. A single CFA1 fimbria consists of about 100 identical 
protein subunits. 
CLASSIFICATION #superfamily CFA1 fimbrial protein 
KEYWORDS fimbria? fimbrial antigen 

FEATURE 

1-23 idonain signal sequence tlabel SIG\ 

24-170 #protein CFA1 fimbrial protein #label HAT 

SUMMARY ftlength 170 imolecular-ueight 17461 #checksum 1166 

SEQUENCE 

Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Matches = 6 Mismatches - 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GG9PLWITATK 
III I II 

PASKTFESYRVHT9VHTNDATKKVIVKLADTP9LTDVLNSTVQMPISVSWGG9VLSTTAKEFEAAALGYSAS 
60 70 80 90 100 110 120 

GVNGVSSS9EL V I SAAPKTAGTAPTAGNYSG VVSLVMTL 
130 140 150 160 



6. US-08-249-182-8 (1-11) 

A38487 helix-destabilizing protein - Escherichia coli pla 



ENTRY A38487 #type fragment 

TITLE helix-destabilizing protein - Escherichia coli plasmid R64 

(fragment) 

ORGANISM #formal_name Escherichia coli 

DATE 23-Aug-1991 #sequence_revision 23-Aug-1991 #text_change 

18-Jun-1993 
ACCESSIONS A38487 
REFERENCE A38487 

tauthors Ruvoloi P.P.; Keating* K.M.i Willi arts r K.R.; Chase- J.W. 

# journal Proteins (1991) 9:120-134 

#title Single-stranded DNA binding proteins (SSBs) from prokaryotic 

transmissible plasmids. 
#cross-references MUID:91180109 
taccession A38487 

##status preliminary 

##nolecule_type DNA 

♦^residues' 1-174 ##label RUV 

tinote sequence not compared to nucleotide translation 

GENETICS 

#genome plasnid 
CLASSIFICATION tsuperfanily bacterial helix-destabilizing protein 
SUMMARY Mlength 174 ichecksum 691 

SEQUENCE 



unuMwian «t orna t_nane bos prinigemus taurus Bcommonjiane cauie 

DATE 30-Har-1990 #sequence_revision 30-Mar-1990 itextjihange 

30-Sep-1993 
ACCESSIONS B34078 
REFERENCE A34078 

#authors Kesslerr M.A. ; Milosavljevkr H. » Zieten C.G.; Schuler, L.A, 

#journal Biochemistry (1989) 28:5154-5161 

#ti tie A subfamily of bovine prolactin-related transcripts distinct 

from placental lactogen in the fetal placenta, 
ftcross-references MUID:89352599 
#accession B34078 

#>status preliminary 
##nolecule_type mRNA 
##residues" 1-213 ##label KES 
##cross-references GB:H27240 
SUMMARY # length 213 #molecular~weight 25016 #checksum 9382 

SEQUENCE 



Initial Score = 6 
Residue Identity = 54X 
Gaps - 0 



Optimized Score - 6 Significance = 4.08 
Matches = 6 Mismatches = 5 

Conservative Substitutions - 0 



X X 
GG9PLWITATK 

II I IN 

aVNSCPSCCPDVFDIPLESLTHLFLNASRLSHDIVNHTTIMFHEFDEKYAQNQPYTINATKSCHTNSLHTP9 
10 20 30 40 50 60 70 80 



EREKALRNNNEDLSKWILMLLYSUHRPLYLLVKDLSSNK 
90 100 110 120 



9. US-08-249-182-8 (1-11) 

S33428 spdA protein - Streptomyces ambofaciens 



ENTRY 
TITLE 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
#authors 

^submission 
#accession 
ttstatus 
tiresidues 



S33428 #type complete 

spdA protein - Streptomyces ambofaciens 

#formal_name Streptomyces ambofaciens 

06-Mar-1994; #sequence_revision 06-Mar-1994; #text_change 

06-Mar-1994 
S33428 
S33420 

Hagege* J.M.; Pernodet* J.L.; Gerbaudr C.J Sezonov, G.; 

Friedmanni A.; Guerineau* H. 
submitted to the EMBL Data Library! January 1993 
S33428 

preliminary 
1-224 ftlabel HAG 



##cross-references EMBL :Z 19593 
SUMMARY #length 224 #molecular-ueight 23575 ichecksum 9089 

SEQUENCE 



Initial Score = 
Residue Identity = 
Gaps = 



6 Optimized Score = 6 Significance = 4.08 
54% Matches = 6 Mismatches = 5 

0 Conservative Substitutions = 0 



X 10 
GG6PLWITATK 
II I III 

GPSRLAWSWFVIALVASLGANVATAGLLDLNDVPAWLRILVAAWPALAFMGGTLLAHTATHHEPEAPAPT9P 
70 30 90 100 110 X 120 X 130 



APEPPAFTDEHDLVRVDDTEEPPELPAPGLQQAPAPPAV 
140 150 160 170 



nuudi oLure - o uptxnizeo acore - o oxgniricance = *f<uo 

Residue Identity = 542 Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
GG9PLWITATK 
III III 

Y I EG9LRTRSWDDNG I TRY I TE IL VKTTGTM9MLGSAP90NA9A9PKP99NG9P9SADATKKGGAKTKGRER 
80 90 100 110 120 130 140 

KAAQPEP0P9TPEGEDYGFSDDIPF 
150 160 170 



7. US-08-249-182-8 (1-11) 

DDECIB helix-destabilizing protein - Escherichia coli pla 



Escherichia coli plasnid 



ENTRY DDECIB #type complete 

TITLE helix-destabilizing protein 

ColIb-P9 

single-stranded DNA-binding protein 
♦fornal_nane Escherichia coli 

31-Dec-1990 tsequence_revision 31-Dec-1990 ♦text change 

30-Jun-1993 
A32304 
A32304 

Houlandr C.J.; Rees, C.E.D.; Barthi P.T.J Hilkinsr B.M. 
J. Bacteriol. (1989) 171:2466-2473 
The ssb gene of plasnid ColIb-P9. 
♦cross-references MUID:89213928 
♦accession A32304 
♦♦nolecule_type DNA 
##residues" 1-175 ♦♦label HOW 
COMMENT The plasnid-encoded single-stranded DNA-binding proteins nay be 
involved in DNA metabolisn during bacterial conjugation; their 
functions are also related to the plasnid-nediated SOS initiation 
process. 



ALTERNATE.NAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

tauthors 

♦journal 

♦title 



GENETICS 

♦gene 

♦genone 
CLASSIFICATION 
KEYWORDS 
FEATURE 

2-174 
SUMMARY 
SE9UENCE 



ssb 

plasnid 

♦superfanily bacterial helix-destabilizing protein 

DNA repair; DNA replication; single-stranded DNA binding 

♦protein helix-destabilizing protein ♦label MAT 
♦length 175 ♦nolecular-ueight 19240 ♦checksum 1668 



Initial Score 
Residue Identity 
Gaps 



6 Optimized Score = 6 Significance = 4.08 
54% Matches = 6 Mi snatches = 5 

0 Conservative Substitutions = 0 



X X 
GG9PLHITATK 
III III 

YIEG9LRTRSMDDNGITRYITEILVKTTGTM0MLGSAP69NA9A9PKP99NG9P9SADATKKGGAKTKGRGR 
80 90 100 110 120 130 140 150 



KAA9PEP9P9TPEGEDYGFSDDIPF 
160 170 



8. US-08-249-182-8 (1-11) 

B34078 prol act in-related protein III - bovine 

ENTRY B34078 ♦type complete 

TITLE prolactin-related protein III - bovine 



Sequence Name Description 



inn,. upi<. 
Length Score Score Sig. Frame 







7 standard deviations above nean 










2. 


A39216 


plasma cell aenbrane protein 


925 


10 


11 


7.48 


0 


3. 


A 4 *7 A f 

S21706 


nucleotide pyrophosphatase * 


925 


10 


11 


7,48 


0 






6 standard deviations above wean 










A 

4, 


Ac/410 


plasma cell membrane protein 


905 


9 


9 


6.55 


0 






**** 5 standard deviations above mean 


**** 








5. 


RKAALC 


ribulose-bisphosphate carboxy 


474 


8 


8 


5.61 


0 






4 standard deviations above mean 










6. 


DSNVAC 


superoxide dismutase (EC 1.15 


151 


7 


7 


4.68 


0 


7. 


A35216 


p* nn n i m * 

FPD4 protein - foulpox virus 


218 


7 


8 


4.68 


0 


n 

8. 


a a ji in 

A24637 


T-cell surface glycoprotein C 


236 


7 


7 


4.68 


0 


o 
7. 


A AC/1 AO 

A45442 


Secl3p=endoplasnic reticulum 


297 


7 


7 


4.68 


0 


1 A 

10. 


S30803 


SEC13 protein - yeast (Saccha 


297 


7 


7 


4.68 


0 


I 4 

II . 


S06938 


41 1 J ft J t Li ■ ft tm. 

glutamate dehydrogenase (NADP 


446 


7 


7 


4.68 


0 


12. 


A33504 


glutamate dehydrogenase (EC 1 


447 


7 


7 


4.68 


0 


4 T 

13. 


AAAA 4 1 

A22413 


glutamate dehydrogenase (NADP 


447 


7 


7 


4.68 


0 


4 A 

14- 


DEECEN 


glutamate dehydrogenase (NADP 


447 


7 


7 


4.68 


0 


15. 


a nnm 

APBOL 


leucyl aminopeptidase (EC 3.4 


478 


7 


7 


4.68 


0 


16. 


ill | A 4 

A61191 


nuclear hormone receptor ST-5 


564 


7 


7 


4.68 


0 


17. 


HNNZB3 


hemagglutinin-neuraminidase ( 


572 


7 


7 


4.68 


0 


18. 


r*Ai> i 

S28142 


SEC23 protein homolog» 64. 7K 


573 


7 


7 


4.68 


0 


4 n 

19. 


A37251 


early response protein NAK1 - 


598 


7 


7 


4.68 


0 


a a 
20. 


JQ1894 


gene p74 protein - pigeonpox 


630 


7 


7 


4.68 


0 


21 • 


A30347 


exotoxin A precursor - Pseudo 


638 


7 


7 


4.68 


0 


•DO 

cc • 




ferrienterochelin receptor pr 


745 


7 


7 


4.68 


0 


23. 


LIT # "7QA 

H36790 


hypothetical protein 0RF43 - 


891 


7 


7 


4.68 


0 


A A 

24. 


S07421 


E2 glycoprotein precursor - a 


1162 


7 


7 


4.68 


0 


25. 


1 1 A THAI/ 

VGIHAK 


E2 glycoprotein precursor - a 


1162 


7 


7 


4.68 


0 


26. 


S12127 


protein-tyrosine kinase (EC 2 


1187 


7 


7 


4.68 


0 


2/ . 


\JQ1258 


RNA-directed RNA polymerase ( 


1335 


7 


7 


4.68 


0 


28. 


S32437 


polyprotein - Volvox carteri 


1462 


7 


9 


4.68 


0 


29. 


A30788 


mannose 6-phosphate receptor 


2499 


7 


7 


4.68 


0 






3 standard deviations above nean 


*««* 








30. 


n At/in r* 

S27205 


interleukin-2 precursor - mou 


63 


6 


6 


3.74 


0 


31 . 


S00972 


kcrB2 protein - Escherichia c 


66 


6 


6 


3.74 


0 


32. 


PN0088 


matrix protein M3 - influenza 


68 


6 


7 


3.74 


0 


33. 


PN0085 


matrix protein M3 - influenza 


68 


6 


7 


3.74 


0 


34. 


C A 1C 4 £ 

E42516 


D-ORF-B protein - vaccinia vi 


80 


6 




3.74 


0 


35. 


QQVZ4 


hypothetical protein B-80 - v 


80 


6 




3.74 


0 


36. 


528564 


hypothetical protein 2 - phag 


92 


6 


6 


3.74 


0 


3/. 


b4Ul /U 


gene fdxB protein - Plectonen 


96 


6 


7 


3.74 


0 


Jo. 


CI 0.7m 

MU/U3 


plastocyanin - parsley 


97 


6 




3.74 


0 


39. 


S10702 


plastocyanin - parsley 


97 


6 




3.74 


0 


40. 


S06105 


plastocyanin - rice 


97 


6 




3.74 


0 



1. US-08-249-182-9 (1-16) 



A42329 autotaxin - human (fragments) 

ENTRY A42329 #type fragments 

TITLE autotaxin - human (fragments) 

ORGANISM #formal_name Hono sapiens #comnon_nane nan 

DATE 04-Mar-1993J #sequence_revision Ol-Jan-1993; #text_change 

08-May-1993 
ACCESSIONS A42329 
REFERENCE A42329 

tauthors Stracker M.L.J Krutzsch* H.C.J Unsaorth* E.J.J Arestadi A.J 
Ciocer V.; Schiffmannr E.J Liottai L.A. 

ftjournal J. Biol. Chen. (1992) 267:2524-2529 

#title Identification* purification* and partial sequence analysis 

of autotaxinr a novel notility-stimulating protein. 
#cross-references MUIDJ92129337 
#accession A42329 



10- 



5- 



0 

II I I I II I I I II I I II I I I I 

SCORE 0| | 2 | |4 | 5 | 7| | 9 II | 12 14 16 
STDEV -1 01234567 89 



PARAMETERS 



Similarity natrix Unitary 

Mismatch penalty 1 

Gap penalty 1.00 

Gap size penalty 0.05 

Cutoff score 0 

Randomization group 0 



K-tuple 

Joining penalty 
Window size 



2 

20 
5 



Initial scores to save 40 
Optimized scores to save 0 



Alignments to save 15 
Display context 50 



SEARCH STATISTICS 



Scores? 



Mean Median Standard Deviation 

2 3 1.07 



Tines? 



CPU 
00:01:27.02 



Total Elapsed 
00:01:35.00 



Nunber of residues: 20816057 
Number of sequences searched: 70848 
Nunber of scores above cutoff: 3804 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 
Cut-off raised to 6. 



The scores below are sorted by initial score. 
Significance is calculated based on initial score. 



A 1007. sinilar sequence to the query sequence uas found: 

Init. Opt. 

Sequence Name Description Length Score Score Sig. Frane 



1. A42329 autotaxin - hunan (fragments) 114 16 16 13.09 0 

The list of other best scores is: 



au oequence tcu ftfli 

S6 57 A; 32 R; 9 Ni 24 D; 0 B; 8 Ci 21 Q; 32 Ei 0 Zi 41 Gi 11 Hi 
Sfi 13 I; 43 L? 4 K; 2 Hi 12 FI 28 P! 22 Si 19 Ti 5 Ui 12 Y? 25 Vi 



Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 43X Hatches = 7 Misnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMSTVFVGYGPTFK 
lllll II 

CAGPADSGDALLERNYPTEAEFLGDGGDVSFSTRGTQNHTVERLL8AHRGLEERGYVFVGYHGTFLEAASSI 
190 200 210 220 230 X 240 250 

VFGGVRARS6DLDA I WRGFY I AGDPAL AYGYASDQEPDARGR I R 
260 270 280 290 300 

> 0 < 

0| |0 InteUiGenetics 

> 0 < 

FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 

Results file u249_9p.res nade by on Thu 22 Sep 94 10:23:45-PDT. 



Query sequence being conpared.'US-08-249- 182-9 (1-16) 
Number of sequences searched: 70848 
Nunber of scores above cutoff.' 3804 

Results of the initial conparison of US-08-249-182-9 (1-16) with: 
Data bank : PIR 41, all entries 

100000- 

N 

U50000- 

« * 

B 

E 

R - * 
0 

F10000- » 

s « 
E 5000- 
Q 
U 
E 

N * 

C 

E 

S 1000- 



500- 

# 



100- » 



DT 18-JAN-1991 (first entry) 

DE PE40ab protein comprising a portion of the Pseudomonas exotoxin A 

DE lacking cysteine residues at 265, 287* 372 and 379. 

KW TGF-alpha-PE40; PE40ab; tumour; epidermal growth factor; EGF? 

KM transforming growth factor-alpha; TGF-alpha. 

OS Pseudomonas sp. 

PN EP-389043-A. 

PD 26-SEP-1990. 

PF 15-MAR-1990; 200613. 

PR 22-MAR-1989; US-327214. 

PA (MERI ) MERCK 8c CO INC. 

PI Rienen MUi Stirdivant SM; 

DR WPI; 90-291988/39. 

PT Modified PE40 by substitution with other amino acids for cysteine - 

PT improving specificity of targetting agent for tumour cells. 

PS Disclosure; Table 6; 21pp; English. 

CC By replacing cysteine residues at positions 265 and 287 and/or 372 

CC and 379, chenical ambiguities nay be eliminated) and targeting 

CC specificity for targeted agents of tunour cells eg. EGF or TGF-alpha 

CC nay be improved, 

CC See also Q06127. 

SQ Sequence 420 AA; 

SO 59 A; 32 R; 9 N; 24 D; 0 Bi 6 C; 21 Q; 31 E; 0 Z? 42 G; 11 Hr 

SQ 13 I; 43 L; 4 K; 2 M; 12 F; 28 P? 21 S; 20 T; 5 w; 12 y; 25 v; 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
lllll II 

AAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNHTVERLLQAHRQLEERGYVFVGYHGTFLEAAQSI 
190 200 210 220 230 X 240 250 

VFGG VRARS8DLD A I WRGFY I AGDP AL A YG Y A0D9EPDARGR I R 
260 270 280 290 300 



15. US-08-249-182-9 (1-16) 

R06992 PE40aB protein comprising a portion of the Pseudon 

ID R06992 standard; protein; 420 AA. 

AC R06992; 

DT 18-JAN-1991 (first entry) 

DE PE40aB protein comprising a portion of the Pseudomonas exotoxin A 

DE lacking cysteine residues at 265 and 287. 

KW TGF-alpha-PE40; PE40ab; tumour; epidermal growth factor; EGF; 

KM transforming growth factor-alpha; TGF-alpha. 

OS Pseudomonas sp. 

PN EP-389043-A. 

PD 26-SEP-1990. 

PF 15-MAR-1990; 200613. 

PR 22-MAR-1989; US-327214. 

PA (MERI ) MERCK k CO INC. 

PI Riemen MW» Stirdivant SM; 

DR WPI; 90-291988/39. 

PT Modified PE40 by substitution with other amino acids for cysteine - 

PT improving specificity of targetting agent for tumour cells. 

PS Disclosure; Table 4; 21pp; English. 

CC By replacing cysteine residues at positions 265 and 287 and/or 372 

CC and 379 r chemical ambiguities may be eliminated* and targeting 

CC specificity for targeted agents of tumour cells eg. EGF or TGF-alpha 

CC may be improved. 

CC See also Q06127. 



190 200 210 220 230 X 240 250 



VFGGVRARSQDLDAIHRGFYIAGDPALAYGYAQDSEPDARGRIR 
260 270 2B0 290 300 



13. US-08-249-182-9 (1-16) 

R06447 TGF-57-Pseudononas exotoxin 40 fusion protein. 

ID R06447 standard? protein; 420 AA. 

AC R06447? 

DT 04-JAN-1991 (first entry) 

DE TGF-57-Pseudononas exotoxin 40 fusion protein. 

KM Pseudomonas exotoxin-40 (PE40)? protein targeting agent? 

KW psoriasis treatment? anti-tumour agent? 

FH Key Location/Sualif iers 

FT Region 1..54 

FT /label=residues -4 to +50 of TGF-alpha 

FT Region 58.. 420 

FT /label=residues +252 to +613 of PE 

PN EP-383599-A. 

PD 22-AUG-1990. 

PF 15-FEB-1990? 301639. 

PR 17-FEB-19S9; US-312540. 

PR 03-AUG-1989? US-389092. 

PR 21-DEC-1989? US-449187. 

PA (MERI ) MERCK & CO INC. 

PI Oliff A, Jones DD> Edwards GH? . 

DR WPI? 90-255832/34. 

DR N-PSDB? Q05666. 

PT Modified pseudomonas exotoxin hybrid proteins - has at least 2 

PT cysteine residues replaced or deleted to improve binding to 

PT receptors. 

PS Example ? Table 2? 20pp? English. 

CC Modified pseudomonas exotoxin (PE40) linked to 

CC 5 f portion of transforming growth factor (TGF)-alpha as a targeting 

CC agent. The corresponding nucleotide sequence was constructed from 

CC a synthetic oligonucleotide encoding the 5' portion of TGF-alpha 

CC and linked to PE40 and a linker cassette called "cassette 57°. The 

CC recombinant plasnid was used to transform E.coli JM109 cells. 

CC The hybrid protein can bind and kill tumour cells or keratinocytes 

CC possessing TGF receptors for treatment of psoriasis or warts. 

CC See also R06448-R06450 

SQ Sequence 420 AA? 

S9 55 A? 32 R? 9 N? 24 D? 0 B? 10 C? 21 8? 31 E? 0 Z? 42 G? 11 H? 

S3 13 I? 42 L? 4 K; 2 Mi 13 F? 28 P? 22 S? 19 T? 5 W? 12 Y? 25 V? 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 437. Hatches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 

inn it 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNWTVERLLQAHRQLEERGYVFVGYHGTFLEAA9SI 
190 200 210 220 230 X 240 250 

VFGGVRARSQDLDAIWRGFYIAGDPALAYGYAQDGEPDARGRIR 
260 270 280 290 300 



14. US-08-249-182-9 (1-16) 

R06994 PE40ab protein comprising a portion of the Pseudom 



ID R06994 standard; protein? 420 AA. 



sequence hcv t\ht 
S8 58 A; 32 R? 9 N; 24 D? 0 B; 8 C? 21 9; 30 E? 0 Z; 43 G; 11 h; 
S8 12 I; 42 L? 4 K? 2 H; 13 Fi 28 P? 21 S; 20 T; 5 W? 12 Y; 25 V? 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 432 Hatches - 7 Hi snatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSM8TVFVGYGPTFK 

till! II 

AAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGT8NHTVERLLQAHR8LEERGYVFVGYHGTFLEAA8SI 
190 200 210 220 230 X 240 250 

VFGGVRARS8DLDAI WRGFY I AGDPALAYGYA8D8EPDARGR IR 
260 270 280 290 300 



12. US-08-249- 182-9 (1-16) 

R06448 TGF-alpha-PE40-aB modified pseudononas exotoxin hy 

ID R06448 standard? protein? 420 AA. 

AC R06448; 

DT 04-JAN-1991 {first entry) 

DE TGF-alpha-PE40-aB modified pseudononas exotoxin hybrid protein. 

KH Pseudononas exotoxin-40 (PE40)? protein targeting agent; 

KW psoriasis treatment? anti-tumour agent? TGF-alpha-PE40-aB? 

FH Key Location/8ualif iers 

FT Region 1..54 

FT /label=residues -4 to +50 of TGF-alpha 

FT Region 58.. 420 

FT /label=residues +252 to +613 of PE 

FT /note= n residues 265i 272 and 287 are modified 0 

PN EP-383599-A. 

PD 22-AUG-1990. 

PF 15-FEB-1990? 301639. 

PR 17-FEB-1989? US-312540. 

PR 03-AUG-1989? US-389092. 

PR 21-DEC-1989? US-449187. 

PA (MERI ) MERCK & CO INC. 

PI 01 iff A, Jones DD, Edwards GH? . 

DR WPI? 90-255832/34. 

PT Modified pseudononas exotoxin hybrid proteins - has at least 2 

PT cysteine residues replaced or deleted to improve binding to 

PT receptors. 

PS Example ? Table 4? 20pp? English. 

CC Modified pseudononas exotoxin (PE40) linked to 

CC 5 T portion of transforming growth factor (TGF) -alpha as a targeting 

CC agent. Three site-specific nutations have been introduced c.f wild- 

CC type PE40. The Cys residues at positions 265 and 287 of the 

CC exotoxin have been replaced by Ala residues. Phe at position 272 

CC has been replaced uith Leu. These changes inprove receptor binding. 

CC The hybrid protein can bind and kill tunour cells or keratinocytes 

CC possessing TGF receptors for treatment of psoriasis or uarts. 

CC See also R06447 and R06449-R06450 

S8 Sequence 420 AA? 

S8 57 A? 32 R? 9 N; 24 D; 0 B? 8 C? 21 8? 32 E? 0 Z? 41 G? 11 H? 

S8 13 I? 43 L? 4 K? 2 H? 12 F; 28 P? 22 S; 19 T? 5 W? 12 Y? 25 V? 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 43X Hatches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions » 0 



X 10 X 

VNSH8TVFVGYGPTFK 
lllll II 



v»u j portion of tran&formng grouLn tacLor ubrz-aipna as a targeting 

CC agent. Three site-specific nutations have been introduced c.f wild- 

CC type PE40 in each of the two loci A and B. The Cys residues at 

CC positions 265, 287, 372 and 379 of the exotoxin have been replaced 

CC by Ala residues. Phe at position 272 has been replaced with Leu; 

CC Ser at position 369 Of PE is replaced by Thr. These changes inprove 

CC receptor binding, 

CC The hybrid protein can bind and kill tunour cells or keratinocytes 

CC possessing TGF receptors for treatment of psoriasis or warts. 

CC See also R06447-R06449. 

SQ Sequence 420 AA; 

SO 59 A; 32 R; 9 N? 24 D? 0 B? 6 C? 21 Q? 31 E; 0 Z; 42 G; 11 H; 

SS 13 I; 42 L; 4 K; 2 H; 12 F; 28 P; 21 S; 20 T; 5 w; 12 y; 25 V; 

SQ 1 Others; 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity - 43X Hatches = 7 Mismatches - 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSHQTVFVGYGPTFK 
Mill II 

AAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTflNHTVERLLQAHRflLEERGYVFVGYHGTFLEAABSI 
190 200 210 220 230 X 240 250 

VFGGVRARS8DLDAIWRGFYIAGDPAXAYGYA8DQEPDARGRIR 
260 270 280 290 300 



11. US-08-249-182-9 (1-16) 

R06449 TGF-alpha-PE40-Ab wodified pseudomonas exotoxin hy 

ID R06449 standard; protein; 420 AA. 

AC R06449; 

DT 04-JAN-1991 (first entry) 

DE TGF-alpha-PE40-Ab Rodified pseudononas exotoxin hybrid protein. 

KM Pseudononas exotoxin-40 (PE40); protein targeting agent; 

KU psoriasis treatment? anti-tunour agent; TGF-alpha-PE40-Ab; 

FH Key Location/flualif iers 

FT Region 1..54 

FT /label=residues -4 to +50 of TGF-alpha 

FT Region 58.. 420 

FT /label=residues +252 to +613 of PE 

FT /note~°residues 369, 372 and 379 are nodified" 

PN EP-383599-A. 

PD 22-AUG-1990. 

PF 15-FEB-1990; 301639. 

PR 17-FEB-1989; US-312540, 

PR 03-AUG-1989; US-389092. 

PR 21-DEC-1989; US-449187. 

PA (MERI ) MERCK * CO INC. 

PI Oliff Ai Jones DD, Edwards CM;. 

DR WPI; 90-255832/34. 

PT Modified pseudononas exotoxin hybrid proteins - has at least 2 

PT cysteine residues replaced or deleted to inprove binding to 

PT receptors. 

PS Exanple ; Table 5; 20pp; English. 

CC Modified pseudononas exotoxin (PE40) linked to 

CC 5' portion of transforming growth factor (TGF)-alpha as a targeting 

CC agent. Three site-specific nutations have been introduced c.f ui Id- 

CC type PE40. The Cys residues at positions 369 and 379 of the 

CC exotoxin have been replaced by Ala residues. Ser at position 369 

CC has been replaced with Thr. These changes inprove receptor binding. 

CC The hybrid protein can bind and kill tunour cells or keratinocytes 

CC possessing TGF receptors for treatment of psoriasis or warts. 

CC See also R06447-R06448 and R06450 



ri ftnern neimoroou uti uutt *u» auroivant t>n; 

DR UPI; 92-026359/04. 

PT Treatment of bladder cancer using hybrid protein - comprising 

PT cell targetting agent e.g. EGF that binds to EGF receptor on 

PT tumour cells and PE40 cell toxin 

PS Disclosure; Page 18-19; 34ppi English. 

CC The modified PE40 domains of the hybrid proteins have tuo or four of 

CC the Cys residues (designated Cys265* Cys287r Cys372 and Cys372) 

CC substituted with neutral amino acidsr e.g. Gly r Ala* or Phe. 

CC TGF-alpha-PE40aB (R20199) has Cys265 and Cys287 replaced? 

CC TGF-alpha-PE40Ab (R20200) has Cys372 and Cys379 replaced; and 

CC TGF-alpha-PE40ab (R20201) has all four replaced. 

CC The nodified hybrid proteins were produced in E.coli transformed 

CC with TAC expression vectors. Site specific nutations were introduced 

CC to the unmodified TGF-alpha-PE40 gene cloned in pTACTGF57-PE40. 

CC The mol. efficiently targets receptors on human bladder tumour cells 

CC (the modified PE40 domain has improved receptor binding) and is 

CC used for selectively killing bladder tumour cells. 

50 Sequence 420 AA; 

SO 57 A? 32 R; 9 N; 24 D? 0 8; 8 C; 22 9? 30 E; 0 Z? 42 G; 11 Hi 

Sfi 13 I; 42 L; 4 K? 2 m; 13 f; 28 P; 21 S; 20 T; 5 w; 12 y; 25 V; 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 432 Hatches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSHGTVFVGYGPTFK 
Mill II 

AAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNWTVERLLQAHR9LEERGYVFVGYHGTFLEAA8SI 
190 200 210 220 230 X 240 250 

VFGGVRARSQDLDAIWRGFYIAGDPALAYGYAQDOEPDARGRIR 
260 270 280 290 300 



10. US-08-249-182-9 (1-16) 

R06450 TGF-alpha-PE40-ab modified pseudomonas exotoxin hy 

ID R06450 standard; protein; 420 AA. 

AC R06450; 

DT 04-JAN-1991 (first entry) 

DE TGF-alpha-PE40-ab modified pseudomonas exotoxin hybrid protein. 

KH Pseudomonas exotoxin-40 (PE40); protein targeting agent; 

KW psoriasis treatment; anti-tumour agent; TGF-alpha-PE40-ab? 

FH Key Location/flualif iers 

FT Region 1..54 

FT /label=residues -4 to +50 of TGF-alpha 

FT Region 58.. 420 

FT 7label=residues +252 to +613 of PE 

FT /note= p residues 265, 272, 287, 369, 372 and 379 

FT are modified 0 

PN EP-383599-A. 

PD 22-AUG-1990. 

PF 15-FEB-1990; 301639. 

PR 17-FEB-1989; US-312540. 

PR 03-AUG-1989; US-389092. 

PR 21-DEC-1989; US-449187. 

PA (MERI ) MERCK & CO INC. 

PI Oliff A, Jones DD, Edwards Gfl; . 

DR WPI; 90-255832/34. 

PT Modified pseudomonas exotoxin hybrid proteins - has at least 2 

PT cysteine residues replaced or deleted to improve binding to 

PT receptors. 

PS Example ; Table 6; 20pp; English. 

CC Modified pseudomonas exotoxin (PE40) linked to 



ID R07054 standard; protein; 419 AA. 

AC R07054? 

DT 18-JAN-1991 (first entry) 

DE PE40AB protein comprising a portion of the Pseudononas exotoxin A. 

KW TGF-alpha-PE40? PE40ab? tunour? epidernal growth factor? EGF? 

KW transforming growth factor-alpha? TGF-alpha. 

OS Pseudononas sp. 

PN EP-389043-A. 

PD 26-SEP-1990. 

PF 15-MAR-1990? 200613. 

PR 22-HAR-1989? US-327214. 

PA (MERI ) MERCK & CO INC. 

PI Rienen HUt Stirdivant SMi 

DR WPI; 90-291988/39. 

DR N-PSDB? 006127. 

PT Modified PE40 by substitution with other anino acids for cysteine - 

PT improving specificity of targetting agent for tunour cells. 

PS Disclosure? Table 3? 21pp? English. 

CC By replacing cysteine residues at positions 265 and 287 and/or 372 

CC and 379* chemical ambiguities nay be elininated* and targeting 

CC specificity for targeted agents of tunour cells eg. EGF or TGF-alpha 

CC nay be inproved. 

S8 Sequence 419 AA? 

SQ 55 A? 31 R? 9 N? 24 D? 0 B? 10 C? 20 S? 32 E? 0 Z? 42 G? 11 H? 

SQ 13 I? 42 L? 4 K? 2 M? 13 F? 28 P? 22 S? 19 T? 5 U? 12 Y? 25 V? 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
Hill II 

CAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTQNWTVERLLQAHRELEERGYVFVGYHGTFLEAA8SI 
190 200 210 220 230 X 240 250 

VFGGVRARSQDLDAIHRGFYIAGDPALAYGYAQDflEPDARGRIR 
260 270 280 290 300 



9. US-08-249-182-9 (1-16) 

R20200 TGF-alpha-PE40Ab. 

ID R20200 standard? Protein? 420 AA. 

AC R20200? 

DT 16-APR-1992 {first entry) 

DE TGF-alpha-PE40Ab. 

KM Pseudononas exotoxin; bladder? ftutant? target; receptor binding. 

FH Key Location/Qualifiers 

FT Region 5.. 54 

FT /label= TGFalphal-50 

FT Region 59.. 420 

FT /label* PE252-613 

FT Misc_difference 176 

FT /note- "Ser -> Thr n 

FT MiscjJifference 179 

FT /note= "Cys -> Ala" 

FT MiscjJifference 186 

FT /note= "Cys -> Ala" 

PN EP-467536-A. 

PD 22-JAN-1992. 

PF 20-JUN-1991? 305582. 

PR 21-JUN-1990; US-542281. 

PR 14-MAR-1991? US-669269. 

PA (MERI ) MERCK & CO INC. 



COU CtV COU C7U JUU 



7. US-08-249-182-9 (1-16) 

R20199 TGF-alpha-PE40aB. 

ID R20199 standard; Protein; 419 AA. 

AC R20199J 

DT 16-APR-1992 (first entry) 

DE TGF-alpha-PE40aB. 

KM Pseudomonas exotoxin; bladder; nut ant; target; receptor binding. 

FH Key Location/Qualifiers 

FT Region 5.. 54 

FT /label* TGFalphal-50 

FT Region 58.. 419 

FT /label= PE252-613 

FT Misc_difference 71 

FT /note= °Cys -> Ala" 

FT Nisc_difference 78 

FT /note* a Phe -> Leu D 

FT Nisc^difference 93 

FT /note= n Cys -> Ala 11 

PN EP-467536-A. 

PD 22-JAN-1992. 

PF 20-JUN-1991; 305582. 

PR 21-JUN-1990; US-542281. 

PR 14-MAR-1991; US-669269. 

PA (HERI ) MERCK St CO INC. 

PI Ahern J, Heimbrook DC, Oliff AI r Stirdivant SM? 

DR HPI; 92-026359/04. 

PT Treatment of bladder cancer using hybrid protein - comprising 

PT cell targetting agent e.g. EGF that binds to EGF receptor on 

PT tumour cells and PE40 cell toxin 

PS Disclosure; Page 16-17; 34pp; English. 

CC The modified PE40 domains of the hybrid proteins have two or four of 

CC the Cys residues (designated Cys265i Cys287r Cys372 and Cys372) 

CC substituted with neutral am no acidSf e.g. Gly> Ala, or Phe. 

CC TGF-alpha-PE40aB (R20199) has Cys265 and Cys287 replaced; 

CC TGF-alpha-PE40Ab (R20200) has Cys372 and Cys379 replaced; and 

CC TGF-alpha-PE40ab (R20201) has all four replaced. 

CC The modified hybrid proteins were produced in E.coli transformed 

CC with TAC expression vectors. Site specific nutations were introduced 

CC to the unmodified TGF-alpha-PE40 gene cloned in pTACTGF57-PE40. 

CC The nol. efficiently targets receptors on human bladder tumour cells 

CC (the modified PE40 domain has improved receptor binding) and is 

CC used for selectively killing bladder tumour cells. 

SQ Sequence 419 AA; 

SQ 56 A; 32 R; 9 N; 24 D; 0 B; 8 C; 21 Q; 32 E? o Z; 41 G; 11 H; 

SQ 13 I; 43 L? 4 K; 2 Hi 12 F; 28 P; 22 S? 19 T? 5 U; 12 Y? 25 V; 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 432 Hatches = 7 Mismatches - 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSH6TVFVGYGPTFK 
lllll II 

CAGPADSGDALLERNYPTEAEFLGDGGDVSFSTRGTQNHTVERLLQAHRQLEERGYVFVGYHGTFLEAAflSI 
190 200 210 220 230 X 240 250 

VFGGVRARSGDLDAIWRGFYIAGDPALAYGYAQDGEPDARGRIR 
260 270 280 290 300 



8. US-08-249-182-9 (1-16) 

R07054 PE40AB protein comprising a portion of the Pseudom 



6. US-08-249-182-9 (1-16) 

R20201 TGF-alpha-PE40ab. 



ID R20201 standard; Protein? 419 AA. 

AC R20201; 

DT 16-APR-1992 (first entry) 

DE TGF-alpha-PE40ab. 

KW Pseudomonas exotoxin; bladder; mutant; target; receptor binding. 

FH Key Location/Qualifiers 

FT Region 5.. 54 

FT /label* TGFalphal-50 

FT Region 58.. 419 

FT /label* PE252-613 

FT flisc_difference 71 

FT /note= °Cys -> Ala" 

FT Misc_difference 78 

FT /note= °Phe -> Leu° 

FT Nisc^difference 93 

FT /note= D Cys -> Ala 0 

FT MiscjJifference 175 

FT /note= u Ser -> Thr* 

FT Hiscjiifference 178 

FT /note= "Cys -> Ala 0 

FT Hiscjiifference 185 

FT /note= a Cys -> Ala 8 

PN EP-467536-A. 

PD 22-JAN-1992. 

PF 20-JUN-1991; 305582. 

PR 21-JUN-1990, US-542281. 

PR 14-MAR-1991; US-669269. 

PA (HER I ) MERCK b CO INC. 

PI Ahern J, Heinbrook DCi 01 iff AI» Stirdivant SM; 

DR WPI? 92-026359/04. 

PT Treatnent of bladder cancer using hybrid protein - comprising 

PT cell targetting agent e.g. EGF that binds to EGF receptor on 

PT tumour cells and PE40 cell toxin 

PS Disclosure; Page 20-21; 34pp; English. 

CC The modified PE40 domains of the hybrid proteins have two or four of 

CC the Cys residues (designated Cys265» Cys287* Cys372 and Cys372) 

CC substituted with neutral anino acids, e.g. Gly» Ala, or Phe. 

CC TGF-alpha-PE40aB (R20199) has Cys265 and Cys287 replaced; 

CC TGF-alpha-PE40Ab (R20200) has Cys372 and Cys379 replaced; and 

CC TGF-alpha-PE40ab (R20201) has all four replaced. 

CC The modified hybrid proteins were produced in E.coli transformed 

CC with TAC expression vectors. Site specific mutations were introduced 

CC to the unmodified TGF-alpha-PE40 gene cloned in pTACTGF57-PE40. 

CC The mol. efficiently targets receptors on human bladder tumour cells 

CC (the modified PE40 domain has improved receptor binding) and is 

CC used for selectively killing bladder tumour cells. 

SQ Sequence 419 AA; 

S9 58 A; 32 R; 9 N? 24 D; 0 B; 6 C; 21 8; 31 E; 0 Zi 42 G; 11 H; 

SQ 13 I? 43 L; 4 K? 2 M; 12 F; 28 P? 21 S; 20 T? 5 W; 12 Y; 25 V; 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
IIIII II 

AAGPADSGDALLERNYPTGAEFLGDGGDVSFSTRGTSNHTVERLLSAHRfiLEERGYVFVGYHGTFLEAASSI 
190 200 210 220 230 X 240 250 

VFGGVRARSQDLDAIWRGFYIAGDPALAYGYAQDQEPDARGRIR 



or ceus tor i,nerapeui,ici cytoLxicr oiagnosLic or ovner purposes. 

CC See also R21436.7. 
SQ Sequence 361 AA; 

SQ 44 A? 30 R; 9 N? 19 D; 0 Bi 4 C; 20 fl; 27 E? 0 Z; 38 G; 6 H; 

SQ 13 IS 38 L; 5 K? 0 Mi 11 F? 26 P; 18 S; 19 T; 6 H; 11 Y; 17 V? 

Initial Score = 7 Optinized Score = 7 Significance = 4.66 
Residue Identity = 43X Matches = 7 Hi snatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
lllll II 

CAGPADSGDALLERNYPTEAEFLGDGGDVSFSTRGTQNMTVERLLQAHRQLEERGYVFVGYHGTFLEAAaSI 
130 140 150 160 170 180 190 X 

VFGGVRARS6DLDAIWRGFYIAGDPALAYGYAQDQEPDARGRIR 
200 210 220 230 240 



5. US-08-249-182-9 (1-16) 

R04572 0RF4 product fron the bos gene, 

ID R04572 standard; protein? 407 AA. 

AC R04572J 

DT 14-SEP-1990 (first entry) 

DE 0RF4 product fron the nos gene. 

KW Rhizopine; bos gene? noc gene; nitrogen fixation! Medicago sativa. 

0S Rhizobiun neliloti strain L5-30. 

PN AU8941262-A. 

PD 15-MAR-1990. 

PF 08-SEP-19S8; A41262. 

PR 08-SEP-1988; AU-000328. 

PA (LUHI-) Luminis PTY Ltd. 

PI Tenp J> Kondorosi Ar Putnoky Pi Murphy PJ> Schell JSr De Bruijn FJ. 

DR HPI? 90-139827/19. 

DR N-PSDB! Q04303. 

PT Bacteria contg. genes for rhizopine synthesis and catabolisn - esp. 

PT Rhizobiun strains for increasing nitrogen fixation and growth in 

PT leguninous plants. 

PS Disclosure; p» English. 

CC The nos 0RF 4 product is a protein of a predicted size of 35.8kD. 

CC Rhizobiun strains?eg RM1021t containing the full nos gene and a noc gene 

CC to catabolise rhizopine compounds are used to increase synbiotic nitrogen 

CC fixation in Leguninaceae» esp. alfalfa. Where a noc gene is 

CC present in separate bacteria both N-fixation and plant growth can be 

CC pronoted. Alternatively? nos genes are expressed in the plant and only 

CC noc in the bacterial this will cause desirable soil bacteria (eg being 

CC used for biological control of a pathogen) to be held in the rhizosphere. 

CC See also R04569-72. 

S3 Sequence 407 AA; 

SQ 46 a; 20 r; li n; io D; 0 B; 8 c; 6 S; 11 E; o Z; 42 G; 11 h; 

S8 27 I; 49 l; 10 k; 17 m; 27 F? 16 P; 35 S? 21 T? 3 u; 6 Y; 31 v; 

Initial Score = 7 Optinized Score = 7 Significance = 4.66 
Residue Identity = 43X Matches = 7 Misnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
II II I II 

GCRALIAISGSLASAGMTTALFfiPSFAGKLAGFALLGLGMANLVPIIFSEAASMNTVSKTVGLTFVSVCGYS 
290 300 310 320 330 340 350 X 



GFLVGPPIIGASRRPLGSGELCSSSFAVGVIVACASVFFDRHRS 
360 370 380 390 400 



reiiuues ooj-o/o of rnu» in tnr ft-irernwius or ognairi adi unicn 

CC has ADP-ribosylation activity and is involved in protein synthesis, 

CC A chimeric gener encoding the chimeric toxin» constructed in plasnid 

CC pWD154 Lias found to have similar ADP-ribosylation activity to the 

CC ui ld-type PE-40. Thus foreign amino-acid sequences can be 

CC introduced into the toxin while keeping intact the membrane 

CC translocation and cytotoxic activities. The implications are that 

CC chimeric proteins can be used to introduce foreign proteins such as 

CC en2ynesi growth factors > lymphokines or antibodies into the cytosol 

CC of cells for therapeutiCi cytotxicr diagnostic or other purposes. 

CC See also R21435.7. 

SQ Sequence 361 AA; 

SQ 43 A? 30 R; 9 N; 19 D; 0 Bt 3 Cf 19 QJ 27 E; 0 Z; 38 G? 6 H? 

SQ 13 I; 38 Li 5 K? 0 H>- 11 F; 26 Pi 21 Sf 19 T; 6 Wr 11 Y; 17 V? 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps - 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
lllll II 

CAGPADSGDALLERNYPTEAEFLGDGGDVSFSTRGTQNHTVERLLQAHR9LEERGYVFVGYHGTFLEAA8SI 
130 140 150 160 170 180 190 X 

VFGGVRARS9DLDAIWRGFYIAGDPALAYGYASDQEPDARGRIR 
200 210 220 230 240 



4. US-08-249-182-9 (1-16) 

R21435 PE-40 somatostatin substituted protein. 

ID R21435 standard; protein; 361 AA. 

AC R21435; 

DT 16-JUN-1992 (first entry) 

DE PE-40 somatostatin substituted protein. 

KM Pseudomonas exotoxin-40; chimera; translocation; therapeutic; 

KH cytotoxic; diagnostic. 

OS Pseudomonas. 

FH Key Location/Qualifiers 

FT Modified.site 113.. 126 

FT /note* "modified somatostatin substitution, 

FT see comments 0 

PN US7663455-A. 

PD 21-JAN-1992. 

PF 04-MAR-1991; 663455. 

PR 04-MAR-199U US-663455. 

PA (USSH ) US DEPT HEALTH h HUMAN. 

PI Pastan IH; 

DR HPI; 92-079704/10. 

PT Recombinant chimeric proteins for diagnosis - contg. trans 

PT locating segment delivering foreign protein into cytosol of 

PT target cells 

PS Disclosure; Page 5; 45pp? English. 

CC The sequence shous residues 252 to 613 of Pseudononas exotoxin 40t 

CC ( retrieved from patent EP-383599-A ) substituted at residue 113 by 

CC a synthetic somatostatin 14 molecule. This substitution is between 

CC residues 365-378 of PE-40* in the N-terminus of domain Ib> which 

CC has ADP-ribosylation activity and is involved in protein synthesis. 

CC A chimeric genei encoding the chimeric toxin* constructed in plasnid 

CC pWD150 was found to have similar ADP-ribosylation activity to the 

CC ui ld-type PE-40. Thus foreign amino-acid sequences can be 

CC introduced into the toxin uhile keeping intact the membrane 

CC translocation and cytotoxic activities. The implications are that 

CC chimeric proteins can be used to introduce foreign proteins such as 

CC enzymesr growth factors? lymphokines or antibodies into the cytosol 



ri rastan ifii 

DR HPI; 92-079704/10. 

PT Recombinant chineric proteins for diagnosis - contg. trans 

PT locating segment delivering foreign protein into cytosol of 

PT target cells 

PS Disclosure; Page 5? 45pp; English. 

CC The sequence shews residues 252 to 613 of Pseudomonas exotoxin 40r 

CC ( retrieved from patent EP-383599-A ) substituted at residue 113 by 

CC a synthetic methionine rich peptide. This substitution is between 

CC residues 365-380 of PE-40* in the N-terminus of domain Ibi which 

CC has ADP-ribosylation activity and is involved in protein synthesis. 

CC A chineric genei encoding the chineric toxin> constructed in plasnid 

CC pWD163 was found to have similar ADP-ribosylation activity to the 

CC wild-type PE-40. Thus foreign amino-acid sequences can be 

CC introduced into the toxin while keeping intact the membrane 

CC translocation and cytotoxic activities. The implications are that 

CC chimeric proteins can be used to introduce foreign proteins such as 

CC enzymes r growth factors i lynphokines or antibodies into the cytosol 

CC of cells for therapeutic* cytotxicr diagnostic or other purposes. 

CC See also R21435,7. 

SO Sequence 361 AA; 

SQ 42 A; 30 R; & n; 20 D; o b? 4 c; 19 a; 27 E; o z; 38 G; 6 H; 

SS 13 I; 38 L; 3 K? 9 M? S FS 27 P; 18 S; 18 T; 5 W; 11 Y; 17 VJ 

Initial Score = 7 Optimized Score = 7 Significance = 4.66 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSM9TVFVGYGPTFK 
Mill II 

CMGPADSGDALLERNYPTEAEFLGDGGDVSFSTRGT8NHTVERLLQAHRQLEERGYVFVGYHGTFLEAA8SI 
130 140 150 160 170 180 190 X 

VFGGVRARSQDLDAIWRGFYIAGDPALAYGYAQDQEPDARGRIR 
200 210 220 230 240 



3. US-08-249-182-9 (1-16) 

R21436 PE-40 somatostatin substituted protein. 

ID R21436 standard? Protein; 361 AA. 

AC R21436; 

DT 16-JUN-1992 (first entry) 

DE PE-40 somatostatin substituted protein. 

KW Pseudomonas exotoxin-4Q» chimera; translocation? therapeutic; 

KW cytotoxic; diagnostic. 

OS Pseudomonas. 

FH Key Location/Qualifiers 

FT Modified.site 113.. 126 

FT /note= "modified somatostatin substitution* 

FT see comments 0 

PN US7663455-A. 

PD 21-JAN-1992. 

PF 04-MAR-1991; 663455. 

PR 04-MAR-1991? US-663455. 

PA (USSH ) US DEPT HEALTH & HUMAN. 

PI Pastan IH; 

DR UP I; 92-079704/10. 

PT Recombinant chineric proteins for diagnosis - contg. trans 

PT locating segment delivering foreign protein into cytosol of 

PT target cells 

PS Disclosure; Page 5; 45pp; English. 

CC The sequence shows residues 252 to 613 of Pseudomonas exotoxin 40> 

CC ( retrieved from patent EP-383599-A ) substituted at residue 113 by 

CC a synthetic somatostatin 14 molecule. This substitution is between 



nctrttot it-Ki*Dj-tifl fusion protein. tt/ / / 4.66 



1. US-08-249-182-9 (1-16) 

R37451 Autotaxin peptide ATX 101. 

ID R374S1 standard; peptide; 16 AA. 

AC R37451; 

DT 22-JUL-1993 (first entry) 

DE Autotaxin peptide ATX 101. 

KU Cell motility stimulating; cancer metastasis; antibody; detection; 

KU innunostains? disease outcome prediction; therapy choice; 

KW cancer therapy; crosslinked toxins. 

OS Synthetic. 

PN US7822043-A. 

PD 01-JAN-1993. 

PF 17-JAN-1992; 822043. 

PR 17-JAN-1992; US-822043. 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICE. 

PI Krutzsch H» Liotta LA, Schiffnann Er Stracke N. 

DR WPI; 93-085861/10. 

PT Motility stimulating protein named autotaxin - useful in cancer 

PT diagnosis and therapy 

PS Example; Page 33; 36pp? English. 

CC The sequence is that of autotaxin peptide ATX 101. It nay be used to 

CC raise anti-autotaxin antibodies uhich can be used to diagnose cancer 

CC metastasis and in immunostains of patient samples to detect the 

CC presence of autotaxin. The level of autotaxin in tissue or body 

CC fluids can be used to predict disease outcomes and/or choice of 

CC therapy uhich may also include autotaxin inhibitors. Autotaxin 

CC antibodies can be crosslinked to toxins (e.g. ricin A) for cancer 

CC therapy. 

SQ Sequence 16 AA; 

SQ 0 A; o R; l N; o 

SO 0 I; o L; l K; o 

Initial Score = 15 
Residue Identity = 93X 
Gaps = 0 

X 10 X 

VNSMQTVFVGYGPTFK 

iii minium 

VNSH9TVFVGYGPTFK 
X 10 X 



2. US-08-249-182-9 (1-16) 

R21437 PE-40 protein contg. a methionine rich peptide. 

ID R21437 standard; Protein; 361 AA. 
AC R21437; 

DT 16-JUN-1992 (first entry) 

DE PE-40 protein contg, a methionine rich peptide. 

KW Pseudomonas exotoxin-40; chimera; translocation; therapeutic; 

KU cytotoxic; diagnostic. 

OS Pseudomonas. 

FH Key Location/Qualifiers 

FT Modified.site 113. .128 

FT /note- "methionine rich peptide substitution! 

FT see comments 0 

PN US7663455-A. 

PD 21-JAN-1992. 

PF 04-MAR-1991; 663455. 

PR 04-MAR-1991; US-663455. 

PA (USSH ) US DEPT HEALTH & HUMAN. 



D; 0 B; 0 C; l Q; 0 E; 0 Z; 2 G; l H; 

H; 2 F; 1 P; l s; 2 T; 0 w; l Y; 3 V? 

Optimized Score = 15 Significance = 10.88 

Matches = 15 Mismatches = 1 

Conservative Substitutions = 0 



Tines: CPU Total Elapsed 

00:00:25.98 00:00:26.00 

Nunber of residues: 5287517 
Nunber of sequences searched: 42145 
Nunber of scores above cutoff: 3752 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores below are sorted by initial score. 
Significance is calculated based on initial score. 

A 100X identical sequence to the query sequence was not found. 



The list of best scores is: 

Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 



10 standard deviations above nean #«* 



1. 


R37451 


Auto tax in peptide ATX 101. 


16 15 


15 10.88 


0 






##** 4 standard deviations above nean *♦»* 






2. 


R21437 


PE-40 protein contg. a nethio 


361 7 


7 4.66 


0 


3. 


R21436 


PE-40 somatostatin substitute 


361 7 


7 4.66 


0 


4. 


R21435 


PE-40 sonatostatin substitute 


361 7 


7 4.66 


0 


5. 


R04572 


0RF4 product fron the nos gen 


407 7 


7 4.66 


0 


6. 


R20201 


TGF-alpha-PE40ab. 


419 7 


7 4.66 


0 


7. 


R20199 


TGF-alpha-PE40aB. 


419 7 


7 4.66 


0 


8. 


R07054 


PE40AB protein conprising a p 


419 7 


7 4.66 


0 


9. 


R20200 


TGF-alpha-PE40Ab. 


420 7 


7 4.66 


0 


10. 


R06450 


TGF-alpha-PE40-ab nodified ps 


420 7 


7 4.66 


0 


11. 


R06449 


TGF-alpha-PE40-Ab nodified ps 


420 7 


7 4.66 


0 


12. 


R06448 


TGF-alpha-PE40-aB nodified ps 


420 7 


7 4.66 


0 


13. 


R06447 


TGF-57-Pseudononas exotoxin 4 


420 7 


7 4.66 


0 


14. 


R06994 


PE40ab protein conprising a p 


420 7 


7 4.66 


0 


IS. 


R06992 


PE40aB protein conprising a p 


420 7 


7 4.66 


0 


16. 


R06993 


PE40Ab protein conprising a p 


420 7 


7 4.66 


0 


17. 


R04934 


Innunotoxin hybrid of hunan i 


496 7 


7 4.66 


0 


IB. 


P70141 


Sequence of a region of the S 


537 7 


7 4.66 


0 


19. 


R04920 


Innunoprotein PEX46. 


549 7 


7 4.66 


0 


20. 


R04923 


Innunoprotein TANG11. 


557 7 


7 4.66 


0 


21. 


P50501 


Sequence of bovine parainflue 


558 7 


7 4.66 


0 


22. 


R31957 


Sequence encoded by parainflu 


572 7 


7 4.66 


0 


23. 


R06023 


Viral haenagglutinin neuranin 


572 7 


7 4.66 


0 


24. 


P94800 


Parainf luenzae-3 gene product 


572 7 


7 4.66 


0 


25. 


P94799 


Perdue strain of transnissabl 


572 7 


7 4.66 


0 


26. 


R04919 


Innunoprotein PEX45. 


574 7 


7 4.66 


0 


27. 


R04924 


Innunoprotein TANG 12. 


577 7 


7 4.66 


0 


28. 


R40113 


Pseudononas exotoxin (S245C). 


613 7 


7 4.66 


0 


29. 


R40112 


Pseudononas exotoxin (K223C) . 


613 7 


7 4.66 


0 


30. 


R40111 


Pseudononas exotoxin (S192C). 


613 7 


7 4.66 


0 


31. 


R40110 


Pseudononas exotoxin (S188C). 


613 7 


7 4.66 


0 


32. 


R40109 


Pseudononas exotoxin (R182C). 


613 7 


7 4.66 


0 


33. 


R40108 


Pseudononas exotoxin (S158C). 


613 7 


7 4.66 


0 


34. 


R40107 


Pseudononas exotoxin (S96C). 


613 7 


7 4.66 


0 


35. 


R40106 


Pseudononas exotoxin (S88C). 


613 7 


7 4.66 


0 


36. 


R40105 


Pseudononas exotoxin (S25C). 


613 7 


7 4.66 


0 


37. 


R40104 


Pseudononas exotoxin (K20C). 


613 7 


7 4.66 


0 


38. 


R40102 


Pseudononas exotoxin for site 


613 7 


7 4.66 


0 


39. 


R26983 


(FRP51) -ETA fusion protein. 


637 7 


7 4.66 


0 



E 

R * 
0 

F 10000- 
S 

E 5000- 

Q 

U 

E 

N 

C 

E 

S 1000- 



500- 



100- 



50- 



10- 



5- 



o 

II I I I I I I! I I I I II I I I I 

SCORE 0| | 2 | 3 | 5| | 7 8 | 10| |12 13 15 
STDEV 012 345 6789 



PARAMETERS 



Similarity matrix 


Unitary 


K-tuple 


2 


Mismatch penalty 


1 


Joining penalty 


20 


Gap penalty 


1.00 


Window size 


5 


Gap size penalty 


0.05 






Cutoff score 


0 






Randomization group 


0 






Initial scores to save 40 


Alignments to save 


15 


Optimized scores to 


save 0 


Display context 


50 



SEARCH STATISTICS 



Scores: 



Hean Hedian Standard Deviation 



nn aninunnnfl n.r sunnucutn n.r aninnanni s.r Linb n.r 

RL BIOCHEN. BIOPHYS. RES. COHMUN. 165:189-195(1989). 

CC -!- FUNCTION: IGF-BINDING PROTEINS PROLONG THE HALF-LIFE OF THE IGFS 

CC AND HAVE BEEN SHOWN TO EITHER INHIBIT OR STIMULATE THE GROWTH 

CC PROMOTING EFFECTS OF THE IGFS ON CELLS CULTURE. THEY ALTER THE 

CC INTERACTION OF IGFS WITH THEIR CELL SURFACE RECEPTORS. 

CC -!- SUBUNIT: IGFBP-3 CAN CIRCULATE IN SERUM BOUND TO BOTH AN IGF 

CC PEPTIDE AND AN 80 KD GLYCOPROTEIN AS PART OF A 150 KD COMPLEX. 

CC -!- BINDS IGF-II MORE THAN IGF-I. 

CC -!- SIMILARITY: TO OTHER INSULIN-LIKE GROWTH FACTOR BINDING PROTEINS. 

DR EMBL; M31837; RNIGFBP3. 

DR EMBL; M33300; RNIGFB01. 

DR PIR; A30S20I A30820. 

DR PIR.' B30820; B30820. 

DR PIR. A26832; A26832. 

DR PIR; A34651; A34651. 

DR PIR; A33570; A33570. 

DR PIR; A36748; A36748. 

DR PROSITE; PS00222; IGF.BINDING. 

DR PROSITE; PS00484; THYROGLOBULIN.l. 

KM GROWTH FACTOR BINDING; SIGNAL; GLYCOPROTEIN. 

FT SIGNAL 1 27 

FT CHAIN 28 292 INSULIN-LIKE GROWTH FACTOR BINDING 

FT PROTEIN 3. 



FT 


CARBOHYD 


118 


118 


POTENTIAL. 


FT 


CARBOHYD 


124 


124 


POTENTIAL. 


FT 


CARBOHYD 


137 


137 


POTENTIAL. 


FT 


CARBOHYD 


200 


200 


POTENTIAL. 


FT 


DOMAIN 


237 


286 


THYROGLOBULIN TYPE I. 


FT 


CONFLICT 


8 


8 


MISSING (IN REF. 2). 


S9 


SEQUENCE 


292 aa; 


31680 


HW; 437334 CN; 



Initial Score = 6 Optinized Score = 6 Significance = 4.52 
Residue Identity = 54% Matches = 6 Mi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GG8PLWITATK 

llll II 

EDTLNHLKFLNVLSPRGVHIPNCDKKGFYKKK8CRPSKGRKRGFCWCVDKYG8PLPGYDTKGKDDVHCLSV8 
220 230 240 250 260 270 280 290 

SQ 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Comparison of Sequences 
Release 5.4 3^>cj 

Results file u249_9a.res nade by on Thu 22 Sep 94 10:27:33-PDT. 



Query sequence being conpared:US-08-249-182-9 (1-16) 
Nunber of sequences searched: 42145 
Nunber of scores above cutoff: 3752 

Results of the initial comparison of US-08-249-182-9 (1-16) uith: 
Data bank : A-GeneSeq 15, all entries 

100000- 



N 

U50000- 



un 
KU 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 

se 



rnoaiitr rauutot, i runuuLUfluniw . 

GROWTH FACTOR BINDING; SIGNAL; GLYCOPROTEIN. 



SIGNAL 
CHAIN 

DOMAIN 

DOMAIN 

DOMAIN 

CARBOHYD 

CARBOHYD 

CARBOHYD 

DOMAIN 

SEQUENCE 



1 

28 

28 
138 
192 
116 
136 
199 
236 
291 



AA; 



27 

291 INSULIN-LIKE GROWTH FACTOR BINDING 

PROTEIN 3. 
134 IGF-BINDING (POTENTIAL). 

161 SER/THR-RICH. 
208 SER/THR-RICH. 
116 POTENTIAL. 
136 POTENTIAL. 
199 POTENTIAL. 
285 THYROGLOBULIN TYPE I. 

31660 My; 441418 CN; 



Initial Score 
Residue Identity 
Gaps 



6 Optimized Score = 6 Significance = 4.52 
54X Hatches = 6 Hi snatches = 5 

0 Conservative Substitutions = 0 



X X 
CG9PLUITATK 

llll II 

EDTLNHLKFLNVLSPRGVHIPNCDKKGFYKKKQCRPSKGRKRGFCWCVDKYG9PLPGYTTKGKEDVHCYSHQ 
220 230 240 250 260 270 280 



SK 
290 



15. US-08-249- 182-8 (1-11) 

IBP3.RAT INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN 3 PRECU 

ID IBP3.RAT STANDARD; PRT; 292 AA. 

AC P15473! 

DT 01-APR-1990 (REL. 14, CREATED) 

DT 01-AUG-1990 (REL. 15, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN 3 PRECURSOR (IGFBP-3) 

DE (IBP-3) (IGF-BINDING PROTEIN 3). 

OS RATTUS NORVEGICUS (RAT). 

oc eukaryota; hetazoa; chord at a; vertebrata; tetrapoda; mammalia; 

oc eutheria; rodentia. 

RN [1] 

RP sequence from n.a. 

Rtl 90088511 

RA SHIMASAKI S., K0BA A., MERCADO M.» SHIHONAKA H. , LING N.; 

RL BIOCHEM. BIOPHYS. RES. COMHUN. 165:907-912(1989). 

RN [2] 

RP SEQUENCE FROH N.A. 

RC TISSUE=LIVER; 

RN 90147804 

RA ALBISTON A.L., HERINGTON A.C.; 

RL BIOCHEM. BIOPHYS. RES. COMHUN. 166:892-897(1990). 

RN [31 

RP SEQUENCE OF 28-68. 

RN 89050156 

RA ZAPF J., BORN W., CHANG J.Y., JANES P., FROESCH E.R.. FISCHER J. A.; 

RL BIOCHEM. BIOPHYS. RES. COMHUN. 156:1187-1194(1988). 

RN 14] 

RP SEQUENCE OF 28-42. 

RM 87326380 

RA BAXTER R.C., MARTIN J.L.; 

RL BIOCHEM. BIOPHYS. RES. COMMUN. 147:408-415(1987). 

RN [5] 

rp sequence of 28-49. 

rc tissue=serum; 

RM 90073708 



SQ SEQUENCE 265 AA; 28561 NW; 365422 CN; 



Initial Score = 6 Optinized Score = 6 Significance = 4.52 
Residue Identity - 54X Hatches = 6 Mi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGQPLHITATK 
II III! 

QA YSGKTDSEGGFESDAVA I ANVLETSSAE VGGKPY YYLSVLTRT ADGNEGGKHQL I TATVNDGKL YI CKAQ 
170 180 190 200 210 220 230 240 

AGDKRHFKGAKKFVENTATSFSLA 
250 260 



14. US-08-249-182-8 (1-11) 

IBP3_HUNAN INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN 3 PRECU 

ID IBP3.HUMAN STANDARD; PRTi 291 AA. 

AC P17936; 

DT 01-N0V-1990 (REL. 16, CREATED) 

DT 01 -NOV- 1990 (REL. 16. LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN 3 PRECURSOR (IGFBP-3) 

DE (IBP-3) (IGF-BINDING PROTEIN 3). 

GN IBP3. 

OS HOHO SAPIENS (HUNAN) . 

oc eukaryota; hetazoa; chord at a; vertebrata; tetrapoda; mammalia; 

OC EUTHERIA; PRIMATES. 

RN en 

RP SEQUENCE FROM N.A. 

RM 90324259 

RA CUBE AGE N.L.. SUNANICHKUL A., POWELL D.R.J 

RL J. BIOL. CHEM. 265:12642-12649(1990). 

RN 12] 

RP SEQUENCE FROM N.A. 

RM 89112197 

RA HOOD H.I., CACHIANES G., HENZEL U.J., UINSLOU G.A., SPENCER S.A., 

RA HELLHISS R., MARTIN J.L., BAXTER R.C.; 

RL MOL. ENDOCRINOL. 2:1176-1185(1988). 

RN [3] 

RP SEQUENCE OF 28-65. 

RM 90368661 

RA ZAPF J.- KIEFER M.> HERRYMEATHER J. , MASIARZ F.. BAUER D., BORN U., 

RA FISCHER J. A., FORESCH E.R.i 

RL J. BIOL. CHEM. 265:14892-14898(1990). 

CC -!- FUNCTION: IGF-BINDING PROTEINS PROLONG THE HALF-LIFE OF THE IGFS 

CC AND HAVE BEEN SHOWN TO EITHER INHIBIT OR STIMULATE THE GROWTH 

CC PROMOTING EFFECTS OF THE IGFS ON CELLS CULTURE. THEY ALTER THE 

CC INTERACTION OF IGFS WITH THEIR CELL SURFACE RECEPTORS. 

CC -!- SUBUNIT: IGFBP-3 CAN CIRCULATE IN SERUM BOUND TO BOTH AN IGF 

CC PEPTIDE AND AN 80 KD GLYCOPROTEIN AS PART OF A 150 KD COMPLEX. 

CC -!- TISSUE SPECIFICITY: EXPRESSED BY MOST TISSUES. 

CC -!- DEVELOPMENTAL STAGE: IGFBP-3 LEVELS ARE HIGHER DURING EXTRAUTERINE 

CC LIFE AND PEAK DURING PUBERTY. 

CC -!- INDUCTION: IGFBP-3 LEVELS INCREASE IN THE PRESENCE OF IGF-I, 
CC INSULIN AND OTHER GROWTH-STIMULATING FACTORS SUCH AS GROWTH 

CC HORMONE, EPIDERMAL GROWTH FACTOR, AND PHORBOL ESTERS. 

CC -!- BINDS IGF-I1 MORE THAN IGF-I. 

CC -!- SIMILARITY: TO OTHER INSULIN-LIKE GROWTH FACTOR BINDING PROTEINS. 

DR EMBL; M35878; HSIBP3. 

DR PIR; A36578; I0HU3. 

DR HIM; 146732; TENTH EDITION. 

DR PROSITE; PS00222J IGF_BINDING. 



wiin int rnuiooioitn 11 lunrLtA. 

CC -!- SIMILARITY: TO OTHER 0EE2 SUBUNIT AND TO 16 KD SUBUNIT PROTEIN 
CC OF PHOTOSYSTEM II OF SPINACH. 

DR EMBL! X 172 13; SA23KDP2. 

DR EHBL; Y07498; SA0ES23. 

DR PIR; S03888; S03888. 

DR PIR; S10016; S10016. 

KU PHOTOSYNTHESIS; PHOTOSYSTEM II; CHLOROPLAST? TRANSIT PEPTIDE; 

KU THYLAKOID MEMBRANE. 

FT TRANSIT 1 74 CHLOROPLAST. 

FT CHAIN 75 260 OXYGEN-EVOLVING ENHANCER PROTEIN 2. 

SO SEQUENCE 260 AA; 27925 MM; 347514 CN; 



Initial Score = 6 Optinized Score = 6 Significance = 4.52 
Residue Identity = 542 Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X 10 
GGQPLWITATK 
II llll 

9AYFGETASEGGFDNNAVATANILETNIQDVGGKPYYYLSVLTRTADGDEGGKHQLITATVNGGKLYICKA8 
170 180 190 200 210 X 220 X 230 



AGDKRUFKGANKFVEKAATSFSVA 
240 250 260 



13. US-08-249- 182-8 (1-11) 

PSBP_TOBAC OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (0EE2 

ID PSBP.TOBAC STANDARD; PRT; 265 AA. 

AC PI 82 12; 

DT 01-N0V-1990 (REL. 16, CREATED) 

DT 01-AUG-1992 (REL. 23. LAST SEQUENCE UPDATE) 

DT 01 -DEC- 1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (0EE2) (23 KD SUBUNIT OF 

DE OXYGEN-EVOLVING SYSTEM OF PHOTOSYSTEM II). 

GN PSBP. 

OS N I COT I ANA TABACUM (COMMON TOBACCO). 

oc eukaryota; planta; embryophyta; angiospermae; dicotyledoneae; 

0C SOLANALES; SOL AN ACE AE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=SRl; tissue=leaf; 

RM 91329711 

RA HUA S., DUBE S.K., BARNETT N.M., KUNG S.; 

RL PLANT MOL. BIOL. 16!749-750(1991) . 

RN [21 

RP SEQUENCE OF 80-91. 

RC STRAIN=CV. KY57; 

RH 91329702 

RA TAKAHASH1 H.. EHARA Y., HIRANO H.; 

RL PLANT MOL. BIOL. 16:689-698(1991). 

CC -!- FUNCTION: ASSOCIATED WITH THE OXYGEN-EVOLVING COMPLEX OF 

CC PHOTOSYSTEM II. 

CC -!- INDUCTION: BY LIGHT. 

CC -!- SUBCELLULAR LOCATION: CHLOROPLAST THYLAKOID MEMBRANE; ASSOCIATED 
CC WITH THE PHOTOSYSTEM II COMPLEX. 

CC -!- SIMILARITY: TO OTHER 0EE2 SUBUNIT AND TO 16 KD SUBUNIT PROTEIN 
CC OF PHOTOSYSTEM II OF SPINACH. 

DR EMBL? X55354; NTPHII. 

DR PIR! S14994; S14994. 

DR PIR; SI 5005; S15005. 

KM PHOTOSYNTHESIS; PHOTOSYSTEM II; CHLOROPLAST? TRANSIT PEPTIDE? 

KM THYLAKOID MEMBRANE. 

FT TRANSIT 1 79 CHLOROPLAST. 



nn nUnfllfl N.i hAJlUKA H.» hUJiriUKA Y.i nlYAU nUKAl A l.i WA1 ANABt A.r 

RA SHINOZAKI K.J 

RL PROG. PHOTOSYNTH. RES. 1:701-704(1987) . 

CC -!- FUNCTION: ASSOCIATED WITH THE OXYGEN-EVOLVING COMPLEX OF 

CC PHOTOSYSTEM II. 

CC -!- INDUCTION: BY LIGHT. 

CC -!- SUBCELLULAR LOCATION: CHLOROPLAST THYLAKOID MEMBRANE? ASSOCIATED 
CC WITH THE PHOTOSYSTEM II COMPLEX. 

CC -!- SIMILARITY: TO OTHER 0EE2 SUBUNIT AND TO 16 KD SUBUNIT PROTEIN 

CC OF PHOTOSYSTEM II OF SPINACH. 

DR EMBL: XI 5552; PS23KDAP. 

DR EMBL; D 13296; PSPSBP. 

DR PIR; JS0771! JS0771. 

DR PIR; S03271; S03271. 

DR PIR? S07467; S07467. 

KU PHOTOSYNTHESIS; PHOTOSYSTEM II; CHLOROPLAST; TRANSIT PEPTIDE? 

KW THYLAKOID MEMBRANE. 

FT TRANSIT 1 73 CHLOROPLAST. 

FT CHAIN 74 259 OXYGEN-EVOLVING ENHANCER PROTEIN 2. 

FT CONFLICT 107 107 U -> K (IN REF. 2). 

SO SEQUENCE 259 AA; 28047 MU; 348398 CN; 

Initial Score = 6 Optinized Score = 6 Significance - 4.52 
Residue Identity = 54% Matches = 6 Mi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLWITATK 
II Mil 

QAFFGQTDSEGGFDTN AVAVAN ILESS AP V I GGK9Y YNISVLTRTADGDEGGKHQLI TATVKDGKL Y I CKAO 
170 180 190 200 210 X 220 X 230 

AGDKRUFKGARKFVEDTASSFSVA 
240 250 



12. US-08-249-182-8 (1-11) 

PSBP.SINAL OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (0EE2 

ID PSBP.SINAL STANDARD! PRT; 260 AA. 

AC P11594! 

DT 01-QCT-1989 (REL. 12, CREATED) 

DT 01-AUG-1990 (REL. 15, LAST SEQUENCE UPDATE) 

DT 01-AUG-1991 (REL. 19, LAST ANNOTATION UPDATE) 

DE OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (0EE2) (23 KD SUBUNIT OF 

DE OXYGEN EVOLVING SYSTEM OF PHOTOSYSTEM II). 

GN PSBP. 

OS SINAPIS ALBA (WHITE MUSTARD) . 

oc eukaryota; planta; embryophyta; angiospermae; dicotyledoneae; 

OC CAPPARALESJ CRUCIFERAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=SEEDLING; 

RM 91346684 

RA KERKLE T., KRENZ M. , MENNG A., SCHAEFER E.; 

RL PLANT MOL. BIOL. 14:889-890(1990). 

RN [2] 

RP SEQUENCE OF 13-260 FROM N.A. 

RC TISSUE=SEEDLING; 

RM 89211386 

RA UENNG A., EHMANN B., SCHAEFER E.; 

RL FEES LETT. 246:140-144(1989). 

CC -!- FUNCTION: ASSOCIATED WITH THE OXYGEN-EVOLVING COMPLEX OF 

CC PHOTOSYSTEM II. 

CC -!- INDUCTION: BY LIGHT. 

CC -!- SUBCELLULAR LOCATION: CHLOROPLAST THYLAKOID MEMBRANE, ASSOCIATED 



i/c uAiucn cvuLvinu ouicn ur rnutuototcn ii/. 

GN PSBP OR PS6X. 

OS LYCOPERSICON ESCULENTUM (TONATO) . 

OC EUKARYOTA; PLANTAJ EMBRYOPHYTA; ANGIOSPERMAE? dicotyledoneae; 

OC SOLAN ALES; SOLANACEAE. 

RN [11 

RP SEQUENCE FROM N.A. 

RM 92256823 

RA BETTS S.D., PICHERSKY E.i 

RL PLANT MOL. BIOL. 18:995-996(1992). 

CC -!- FUNCTION: ASSOCIATED WITH THE OXYGEN-EVOLVING COMPLEX OF 

CC PHOTOSYSTEM II. 

CC -!- INDUCTION: BY LIGHT. 

CC -!- SUBCELLULAR LOCATION: CHLOROPLAST THYLAKOID MEMBRANE; ASSOCIATED 
CC WITH THE PHOTOSYSTEM II COMPLEX. 

CC -!- SIMILARITY: TO OTHER 0EE2 SUBUNIT AND TO 16 KD SUBUNIT PROTEIN 
CC OF PHOTOSYSTEM II OF SPINACH. 

DR EMBL; X 6300 7; LEPSBXHR. 

DR PIR; S20872J F2T0X2. 

KW PHOTOSYNTHESIS; PHOTOSYSTEM II; CHLOROPLAST; TRANSIT PEPTIDE; 

KU THYLAKOID MEMBRANE. 

FT TRANSIT 1 73 CHLOROPLAST (BY SIMILARITY). 

FT CHAIN 74 258 OXYGEN-EVOLVING ENHANCER PROTEIN 2. 

SO SEQUENCE 258 AA; 27792 HW; 343591 CNJ 

Initial Score = 6 Optinized Score = 6 Significance = 4.52 
Residue Identity = 542 Hatches = 6 Misnatches = 5 

Gaps = 0 Conservative Substitutions - 0 

X 10 
GGQPLWITATK 
II llll 

QAYFGKTDSEGGFESGAVATRNLLEASSATVGGKEYYYLSVLTRTADGDEGGKHQLITATVNDGKLYICKAQ 
170 180 190 200 210 X 220 X 230 

AGDKRMFKGAKKFVENAATSFSI A 
240 250 



11. US-08-249- 182-8 (1-11) 

PSBP_PEA OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (0EE2 

ID PSBP.PEA STANDARD; PRT; 259 AA. 

AC PI 6059; 

DT 01-APR-1990 (REL. 14, CREATED) 

DT Ol-APR-1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (0EE2) (23 KD SUBUNIT OF 

DE OXYGEN EVOLVING SYSTEM OF PHOTOSYSTEM II). 

GN PSBP. 

OS PISUM SATIVUM (GARDEN PEA). 

OC EUKARYOTA; PLANT A; EHBRYOPHYTA; ANGIOSPERMAE; DICOTYLEDONEAE; FABALES; 

OC FABACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CV. LITTLE MARVEL; 

RM 91370839 

RA HALES R., NEWMAN B.J., ROSE S.A., PAPPIN D., GRAY J.C.; 

RL PLANT MOL. BIOL. 13:573-582(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CV. ALASKA; 

RA KONISHI T., MARUTA Y., MURASE M. , SHINOHARA K., WATANABE A.; 

RL SUBMITTED (OCT- 1992) TO EMBL/GENBANK/DDBJ DATA BANKS. 

RN 13] 

RP SEQUENCE OF 74-99. 



II III) 

HDGRQHDELRPI TFDLDF I SHPEGSVL I TAGNTKVICNASVEDRVPPFLRGGGKGWITAEYSMLPRATNQRT 
10 20 30 40 50 X 60 X 70 

I RESSKGK I SGRTME I 9RL I GR ALR A V VDLEKLGERT I U 
80 90 100 110 



9. US-08-249-182-8 (1-11) 

PSBP.WHEAT OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR <23 K 



ID PSBP.HHEAT STANDARD; PRT; 258 AA. 

AC 900434; 

DT 01-DEC-1992 (REL. 24. CREATED) 

DT 01-DEC-1992 (REL. 24i LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (23 KD SUBUNIT OF 

DE OXYGEN EVOLVING SYSTEM OF PHOTOSYSTEH II). 

GN PSBP. 

OS TRITICUM AESTIVUH (WHEAT). 

oc eukaryota; plant a; embryophyta; ANGIOSPERHAE; MONOCOTYLEDONEAE; 

OC CYPERALES; GRAHINEAE. 

RN [13 

RP SEQUENCE FROM N.A. 

RC STRAIN=AVALON! TISSUE=LEAF; 

RN 91329731 

RA JANES H.E., ROBINSON C.I 

RL PLANT MOL. BIOL. 17:179-182(1991). 

CC -!- FUNCTION? ASSOCIATED WITH THE OXYGEN-EVOLVING COMPLEX OF 

CC PHOTOSYSTEH II. 

CC -!- INDUCTION; BY LIGHT. 

CC -'- SUBCELLULAR LOCATION: CHLOROPLAST THYLAKOID MEMBRANE? ASSOCIATED 
CC WITH THE PHOTOSYSTEH II COMPLEX. 

CC -!- SIMILARITY: TO OTHER 0EE2 SUBUNIT AND TO 16 KD SUBUNIT PROTEIN 
CC OF PHOTOSYSTEM II OF SPINACH. 

DR EMBL; X57407; TAPPSBP. 

DR PIR; S22763; S22763. 

KM PHOTOSYNTHESIS? PHOTOSYSTEM II; CHLOROPLAST; TRANSIT PEPTIDE; 

KU THYLAKOID MEMBRANE. 

FT TRANSIT 1 73 CHLOROPLAST (BY SIMILARITY) . 

FT CHAIN 74 258 OXYGEN-EVOLVING ENHANCER PROTEIN 2. 

SQ SEQUENCE 258 AA; 27269 HW; 341934 CN; 



Initial Score = 6 Optimized Score = 6 Significance = 4.52 
Residue Identity = 547. Hatches = 6 Mi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLWITATK 

II llll 

8SYGGKTDSEGGFESDAVATANVLESSAPVVDGKQYYSITVLTRTADGDEGGKHQLITATVADGKLYVCKAQ 
170 180 190 200 210 X 220 X 230 



RDKRWFKGAKKFVENAAGSFSVA 
240 250 



10. US-08-249-182-8 (1-11) 

PSBP_LYCES OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (23 K 



ID PSBPJ.YCES STANDARD; PRT; 258 AA. 

AC P29795J 

DT 01-APR-1993 (REL. 25, CREATED) 

DT 01-APR-1993 (REL. 25, LAST SEQUENCE UPDATE) 

DT 01-APR-1993 (REL. 25, LAST ANNOTATION UPDATE) 

DE OXYGEN-EVOLVING ENHANCER PROTEIN 2 PRECURSOR (23 KD SUBUNIT OF 



cc 
cc 
cc 

DR 
DR 
DR 
DR 
KU 
FT 
FT 
FT 
FT 
SQ 



.- ainunnii i> otiunuo iu inc ounmu i i\ur in/ rnut.n\. I in rttniLi. 
-!- CAUTIONS BE CAREFUL OF POSSIBLE CONFUSION BETWEEN PRC-III AND 

PLP-III, THE TWO PROTEINS, WHILE DIFFERENT, ARE CALLED PLACENTAL 

PROLACTIN-RELATED PROTEIN III. 
ENBL; M27240! H27240. 
PIR; B34078; B34078. 
PROSITE? PS00266; SOMATOTROPIN i. 
PROSITE; PS00338! SOMATOTROPIN^. 
HORMONE f PLACENTA; SIGNAL. 
SIGNAL 1 13 POTENTIAL. 

CHAIN 14 213 PLACENTAL PROLACTIN-RELATED PROTEIN III. 

DISULFID 72 190 BY SIMILARITY. 

DISULFID 207 213 BY SIMILARITY. 

SEQUENCE 213 AA! 25016 MW J 236060 CN; 



Initial Score = 6 
Residue Identity = 54% 
Gaps = 0 



Optinized Score = 6 Significance = 4.52 
Hatches = 6 Hi snatches - 5 

Conservative Substitutions = 0 



X X 
GGQPLHITATK 

II I III 

Q VNSCPSCCPD VFD I PLESLTHLFLNASRLSHD I VNHTT I MFHEFDEKYAQNQPYT I NATKSCHTNSLHTPS 
10 20 30 40 50 60 70 80 



EREKALRMNNEDLSKWILMLLYSMHRPLYLLVKDLGSMK 
90 100 110 120 



8. US-08-249-182-8 (1-11) 

RNPHJACSU R1B0NUCLEASE PH (EC 2.7.7.56) (RNASE PH) (TRNA 

ID RNPH.BACSU STANDARD? PRT; 245 AA. 

AC P28619! 

DT 01-DEC-1992 (REL. 24, CREATED) 

DT 01 -DEC- 1992 (REL. 24, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE RIBONUCLEASE PH (EC 2.7.7.56) (RNASE PH) (TRNA 

DE NUCLEOTIDYLTRANSFERASE). 

GN RPH. 

OS BACILLUS SUBTILIS. 

oc prokaryota; firmicutes; endospore-forming rods and cocci; BACILLACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 92325065 

RA CRAVEN M.G., HENNER D.J., ALESSI D., SCHAUER A.T., OST K.A., 

RA DEUTSCHER M.P., FRIEDMAN D.I.; 

RL J. BACTERIOL. 174:4727-4735(1992). 

CC -!- FUNCTION: RNASE PH IS A PHOSPHOROLYTIC EXORIBONUCLEASE THAT 

CC REMOVES NUCLEOTIDE RESIDUES FOLLOWING THE -CCA TERMINUS OF TRNA 

CC AND ADDS NUCLEOTIDES TO THE ENDS OF RNA MOLECULES BY USING 

CC NUCLEOSIDE DIPHOSPHATES AS SUBSTRATES. 

CC -!- CATALYTIC ACTIVITY: TRNA(N+1) + ORTHOPHOSPHATE = TRNA(N) + 

CC A NUCLEOSIDE DIPHOSPHATE. 

CC -!- SIMILARITY: TO OTHER SPECIES RNASES PH. 

DR ENBL; M85163; BSRPHA. 

DR PIR; A44914; A44914. 

KM TRANSFERASE? NUCLEOTIDYLTRANSFERASE; TRNA PROCESSING. 

SQ SEQUENCE 245 AA; 26681 HW; 287533 CN; 

Initial Score = 6 Optinized Score = 6 Significance = 4.52 
Residue Identity = 54X Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X 10 
GGQPLHITATK 



ID SSBPJCOLI STANDARD; PRT; 174 AA. 

AC P18022; 

DT 01-N0V-1990 (REL. 16, CREATED) 

DT 01-AUG-1992 (REL. 23, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE SINGLE-STRAND BINDING PROTEIN (SSB) (HELIX-DESTABILIZING PROTEIN) . 

GN SSB. 

OS ESCHERICHIA COLI. 

OG PLASHID C0LIB-P9. 

oc prokaryota; gracilicutes; scotobacteria; facultatively anaerobic rods; 

oc enterobacteriaceae. 

RN [1] 

RP sequence from n.a. 

RN 89213928 

RA HOWLAND C.J., REES C.E.D., BARTH P.T., WILKINS B.N. J 

RL J. BACTERIOL. 171:2466-2473(1989). 

CC -!- FUNCTION: MAY CONTRIBUTE TO THE CONJUGATIVE PROCESSING OF DNA. IT 

CC HAS A FUNCTIONAL RELATIONSHIP WITH PSI (PLASHID-HEDIATED SOS 

CC INHIBITION) PROTEINS. 

CC -!- SUBUNIT: HOMOTETRANER (BY SIMILARITY). 

CC -!- SIMILARITY: TO OTHERS PROKARYOTIC AND MITOCHONDRIAL SS-DNA 

CC BINDING PROTEINS. 

DR EHBL; K2S505; N25505. 

DR FIR-' A32304; DDECIB. 

DR PROSITEJ PS00735J SSB.t. 

DR PROSITE; PS00736; SSB.2. 

KU PLASMID; DNA-BINDING; DNA REPLICATION. 

FT INIT.NET 0 0 BY SIMILARITY. 

FT DNAJIND 54 60 BY SIMILARITY. 

SO SEQUENCE 174 AA; 19109 MU; 144724 CNi 



Initial Score = 6 Optimized Score = 6 Significance = 4.52 
Residue Identity = 542 Hatches = 6 Mi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
GGQPLWITATK 
III III 

YIEGQLRTRSMDDNGITRYITEILVKTTGTMQMLGSAPQ8NAQAQPKPG8NGQPGSADATKKGGAKTKGRGR 
80 90 100 110 120 130 140 



KAAQPEPQPQTPEGEDYGFSDDIPF 
150 160 170 



7. US-08-249- 182-8 (1-11) 

PRR3.B0VIN PLACENTAL PROLACT IN-RELATED PROTEIN III PRECURSOR 

ID PRR3_B0VIN STANDARD? PRT; 213 AA. 

AC P12402; 

DT 01-0CT-1989 (REL. 12, CREATED) 

DT 01-0CT-1989 (REL. 12, LAST SEQUENCE UPDATE) 

DT 01-FEB-1991 (REL. 17, LAST ANNOTATION UPDATE) 

DE PLACENTAL PROLACTIN-RELATED PROTEIN III PRECURSOR (PRC-Ill). 

OS BOS TAURUS (BOVINE) . 

oc eukaryota; metazoa; chordata; vertebrata; tetrapoda; mammalia; 

oc eutheria; artiodactyla. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 89352599 

RA KESSLER M.A., MILOSAVLJEVIC M.> ZIELER C.G., SCHULER L.A.J 

RL BIOCHEMISTRY 28:5154-5161(1989). 

CC -!- FUNCTION: PLACENTAL PROLACTIN-RELATED PROTEINS MAY PLAY A 
CC SPECIFIC ROLE DURING GESTATION. 

CC -!- SUBCELLULAR LOCATIO?^ SECRETED. 



SQ SEQUENCE 170 AA; 17461 HU; 168534 CN; 

Initial Score = 6 Optiwized Score = 6 Significance = 4.52 

Residue Identity = 54X Matches = 6 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X X 
GG8PLWITATK 
III I II 

P ASKTFESYRVHTfiVHTND ATKKV I VKL ADTPQLTDVLNSTV8HP I SVSUGG8VLSTTAKEFEA AALGYSAS 
60 70 80 90 100 110 120 

GVNGVSSSQELVISAAPKTAGTAPTAGNYSGVVSLVHTL 
130 140 150 160 



5. US-08-249-182-8 (1-11) 

SSBRJCOLI SINGLE-STRAND BINDING PROTEIN (SSB) (HELIX-DESTABI 



ID SSBR.ECOLI STANDARD; PRT; 174 AA. 

AC P28045; 

DT 01-AUG-1992 (REL. 23, CREATED) 

DT 01 -AUG- 1992 (REL. 23. LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23. LAST ANNOTATION UPDATE) 

DE SINGLE-STRAND BINDING PROTEIN (SSB) (HELIX-DESTABILIZING PROTEIN). 

GN SSB. 

OS ESCHERICHIA COLI. 

OG PLASH ID R64. 

OC prokaryota; gracilicutes; SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTERI ACEAE . 

RN [11 

RP SEQUENCE FROH N.A. 

RH 91180109 

RA RUVOLO P.P., KEATING K.H., WILLIAMS K.R., CHASE J.W.; 

RL PROTEINS 9:120-134(1991). 

CC -!- FUNCTION: HAY CONTRIBUTE TO THE CONJUGATIVE PROCESSING OF DNA. IT 

CC HAS A FUNCTIONAL RELATIONSHIP WITH PSI (PLASHID-HEDIATED SOS 

CC INHIBITION) PROTEINS. 

CC -!- SUBUNIT: HOHOTETRAMER (BY SIMILARITY) . 

CC -!- SIMILARITY: TO OTHERS PROKARYOTIC AND MITOCHONDRIAL SS-DNA 

CC BINDING PROTEINS. 

DR PIR; A38487; A38487. 

DR PROSITE? PS00735! SSB 1. 

DR PROSITE; PS00736! SSB_2. 

KW DNA-BINDING; DNA REPAIR; DNA REPLICATION. 

FT INIT.HET 0 0 BY SIMILARITY. 

FT DNAJIND 54 60 BY SIMILARITY. 

SS SEQUENCE 174 AA; 19181 MU; 144576 CN; 



Initial Score = 6 Optinized Score = 6 Significance = 4.52 
Residue Identity - 54% Matches - 6 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 



X X 
GGQPLHITATK 
III III 

Y I EGQLRTRSUDDNG I TRY I TE I L VKTTGTHQMLGSAPSSNAQAQPKPQ9NGQPGSADATKKGGAKTKGRER 
80 90 100 110 120 130 140 



KAAQPEPQPQTPEGEDYGFSDD I PF 
150 160 170 



6. US-08-249-182-8 (1-11) 

SSBP.ECOLI SINGLE-STRAND BINDING PROTEIN (SSB) (HELIX-DESTABI 



Initial Score = 6 Optinized Score = 6 Significance = 4.52 
Residue Identity = 54X Hatches = 6 Hi snatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLWITATK 
II II II 

PRTEKLSKFLDDDTF8KFGEVVGYNPLQVLRYDLGGKPLLYAETKVDILSTVPRPGYNPSS8RIFEHQLHPK 
10 20 30 X 40 X 50 60 70 

MIFDLEEVVSVDNGMEHGTILVF 
80 90 



4. US-08-249- 182-8 (1-11) 

FHC1.EC0L1 CFA/I FIMBRIAL SUBUNIT 8 PRECURSOR (COLONISATION F 

ID FMC1.EC0LI STANDARD; PRT; 170 AA. 

AC P02971; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 01-FEB-1991 (REL. 17, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE CFA/I FIMBRIAL SUBUNIT B PRECURSOR (COLONISATION FACTOR ANTIGEN I 

DE SUBUNIT B) (CFA/I PILIN) (CFA/I ANTIGEN) . 

GN CFAB. 

OS ESCHERICHIA COL I. 

OG PLASHID NTP513. 

OC prokaryota; gracilicutes; scotobacteria; FACULTATIVELY ANAEROBIC RODS; 

0C ENTEROBACTERI ACEAE . 

RN [11 

RP SEQUENCE FROH N.A. 

RN 89173309 

RA KARJALAINEN T.K., EVANS D.G.. SO M., LEE C.H.; 

RL INFECT. IHHUN. 57:1126-1130(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RH 89330163 

RA HAHERS A.M., PEL H.J., WILLSHAU G.A., KUSTERS J.G., 

RA VAN DER ZEIJST B.A.H., GAASTRA W.; 

RL HICROB. PATHOG. 6:297-309(1989). 

RN [31 

RP SEQUENCE OF 24-170 FROH N.A. 

RC STRAIN=H-10407; 

RH 82235736 

RA KLEHH P. ; 

RL EUR. J. BIOCHEH. 124;339-348(1982) . 

CC -!- FUNCTION: FIMBRIAE (ALSO CALLED PILI), POLAR FILAHENTS RADIATING 

CC FROH THE SURFACE OF THE BACTERIUH TO A LENGTH OF 0.5-1.5 

CC HICROHETERS AND NUHBERING 100-300 PER CELL, ENABLE BACTERIA TO 

CC COLONIZE THE EPITHELIUM OF SPECIFIC HOST ORGANS. 

CC -!- SUBUNIT: CFA/I FIMBRIAE ARE RATHER RIGID, THREAD-LIKE FILAHENTS OF 

CC 0.5-1 MICROMETER, WITH AN APPARENT AXIAL HOLE, AND A DIAMETER OF 

CC 7 NANOMETERS. A SINGLE CFA/I FIMBRIA CONSISTS OF ABOUT 100 

CC IDENTICAL PROTEIN SUBUNITS. 

CC -!- INDUCTION: CFA/I FIMBRIAE ARE ONLY EXPRESSED IN THE PRESENCE OF 

CC THE POSITIVE REGULATOR CFAD. 

CC -!- SIMILARITY: TO THE CS1 FIMBRIAL SUBUNIT A (CSOA) . 

DR EMBL; M55661; ECCFAIA. 

DR PIR; A30589; YQECC1. 

KU FIMBRIA; ANTIGEN; PLASHID; SIGNAL. 

FT SIGNAL 1 23 

FT CHAIN 24 170 CFA/I FIMBRIAL PROTEIN B. 

FT CONFLICT 37 37 V -> A (IN REF. 2). 

FT CONFLICT 76 76 D -> N (IN REF. 3). 



CC -!- CATALYTIC ACTIVITY: ACYL-C ACYL-CARRIER PROTEIN] ♦ NADP(+) = 2,3- 
CC DEHYDROACYL-[ ACYL-CARRIER PROTEIN! + NADPH. 

CC -!- CATALYTIC ACTIVITY: OLEOYL-t ACYL-CARRIER PROTEIN] + H(2)0 = 
CC ACYL-CARRIER PROTEIN ♦ OLEATE. 

CC -!- SUBUNIT: HOMODIHER, WHICH ARE ARRANGED IN A HEAD TO TAIL FASHION. 

DR EHBL; J03860; GGFAS. 

OR EMBL 5 J04485; GGFASB. 

DR EMBL! J 023 3V; GGFAS A. 

DR PIR? A33918; XYCHFA. 

DR PIR; A32015; A32015. 

DR PROSITE; PS00012; PHOSPHOPANTETHE I NE • 

DR PROSITE; PS00606; B.KETOACYL SYNTHASE. 

KU FATTY ACID BIOSYNTHESIS! MULTIFUNCTIONAL ENZYME; PHOSPHOPANTETHE I NE ! 

KM TRANSFERASE; HYDROLASE; OXIDOREDUCTASE; LIGASE; NADP. 





4 
1 


338 


BETA-KETOACYL SYNTHASE. 


DOMAIN 


? 


? 


ACYL AND MALONYL TRANSFERASES. 


DOMAIN 


? 


1793 


ENOYL REDUCTASE. 


DOMAIN 


1794 


2046 


BETA-KETOACYL REDUCTASE. 


DOMAIN 


2047 


2135 


ACYL CARRIER. 


DOMAIN 


2136 


2446 


THIOESTERASE. 


ACT SITE 


87 


87 


BETA-KETOACYL SYNTHASE (BY SIMILARITY) . 


ACT_SITE 


506 


506 


MALONYLTRANSFERASE (BY SIMILARITY). 


BINDING 


2084 


2084 


PHOSPHOPANTETHE I NE (BY SIMILARITY). 


ACT.SITE 


2235 


2235 


THIOESTERASE. 


VARS>LIC 


2276 


2233 


KISSING (IN ISOFORM 2). 


CONFLICT 


1504 


1504 


R -> H (IN REF. 2). 


CONFLICT 


1659 


1659 


E -> 9 (IN REF. 2). 


CONFLICT 


1672 


1672 


N -> S (IN REF. 2). 


SEQUENCE 


2446 


AA; 267246 MM; 21121641 CN; 



Initial Score = 7 Optinized Score = 7 Significance = 5.65 
Residue Identity = 632 Matches - 7 Hi snatches = 4 

Gaps = 0 Conservative Substitutions = 0 

X X 
GGSPLWITATK 
Mil III 

LNLVAMKRSFFGSVI FLCRR8SPAKAPI LLPVDDTHYKW VDSLKE I LADSSEQPLHLTATNCGNSG I LGMVN 
1320 1330 1340 1350 1360 1370 1380 

CLRLEAEGHRIRCVFVSNLSPSSTVPATSLSSLENQKII 
1390 1400 1410 1420 



3. US-08-249-182-8 (1-11) 

YIFM_ YEAST HYPOTHETICAL PROTEIN IN IFM1 3'REGION (FRAGMENT). 

ID YIFM.YEAST STANDARD; PRT; 125 AA. 

AC P25040? 

DT 01-MAY-1992 (REL. 22, CREATED) 

DT 01-MAY-1992 (REL. 22, LAST SEQUENCE UPDATE) 

DT 01-MAY-1992 (REL. 22, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL PROTEIN IN IFM1 3'REGION (FRAGMENT). 

OS SACCHAROMYCES CEREVISIAE (BAKER'S YEAST). 

OC EUKARYOTA; fungi; ascomycotina; HEMIASCOHYCETES. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 92037620 

RA VAMBUTAS A., ACKERMAN S.H., TZAGOLOFF A. i 

RL EUR. J. BIOCHEM. 201:643-652(1991). 

DR EMBL; X58379! SCIFM1. 

DR PIR; S20178; S20178. 

DR PIR; S17024; S17024. 

KW HYPOTHETICAL PROTEIN. 

FT NON.TER 1 1 



ID FAS.CHICK STANDARD; PRT; 2446 AA. 

AC PI 2276; 

DT 01-0CT-1989 (REL. 12, CREATED) 

DT 01-MAR-1992 (REL. 21, LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE FATTY ACID SYNTHASE (EC 2.3.1.85) (CONTAINS: EC 2.3.1.38, EC 2.3.1.39, 

DE EC 2.3.1.41, EC 1.1.1.100, EC 4.2.1.61, EC 1.3.1.10, AND EC 3.1.2.14). 

GN FAS. 

OS GALLUS GALLUS (CHICKEN) . 

oc eukaryota; metazoa; chord at a; vertebrata; tetrapoda; aves; neognathae; 

OC GALL I FORMES. 

RN [11 

RP SEQUENCE OF 1-1701 FROM N.A. 

RC TISSUE=LIVERJ 

RM 89282777 

RA HOLZER K.P., LIU W., HAHMES G.G.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 86:4387-4391(1989). 

RN [2] 

RP SEQUENCE OF 1494-2438 FROM N.A. 

RM 89139426 

RA CHIRALA S.S., KASTURI R., PAZIRANDEH M. , STOLOW D.T., HUANG W.-Y., 

RA HAKIL S.J.; 

RL J. BIOL. CHEM. 264:3750-3757(1989). 

RN [3] 

RP SEQUENCE OF 1678-2438 FROM N.A. 

RM 88320436 

RA YUAN Z., LIU It.. HAMMES G.G.; 

RL PROC. NATL. ACAD. SCI. U.S.A. 85:6328-6331(1988). 

RN 14] 

RP SEQUENCE OF 2128-2438 FROM N.A. 

RM 89088152 

RA KASTURI R. , CHIRALA S., PAZIRANDEH H. , HAKIL S.J. J 

RL BIOCHEMISTRY 27:7778-7785(1988). 

RN [5] 

RP SEQUENCE OF 2047-2135. 

RM 89192401 

RA HUANG U.-Y., STOOPS J.K., UAKIL S.J.; 

RL ARCH. BIOCHEM. BIOPHYS. 270:92-98(1989). 

RN [6] 

RP SEQUENCE OF 2135-2434. 

RC STRAIN=WHITE LEGHORN; 

RM 89088151 

RA YANG C.-Y., HUANG U.-Y., CHIRALA S., HAKIL S.J.; 

RL BIOCHEMISTRY 27 : 7773-7777 ( 198B) . 

RN 17] 

RP SEQUENCE OF 593-600 AND 1624-1635. 

RM 89323081 

RA CHANG S.I., HAHMES G.G.; 

RL BIOCHEMISTRY 28:3781-3788(1989). 

CC -!- FUNCTION: FATTY ACID SYNTHETASE CATALYZES THE FORMATION OF 
CC LONG-CHAIN FATTY ACIDS FROM ACETYL-COA, MALONYL-COA AND NADPH. 

CC THIS MULTIFUNCTIONAL PROTEIN HAS 7 CATALYTIC ACTIVITIES AND AN 

CC ACYL CARRIER PROTEIN. 

CC -!- CATALYTIC ACTIVITY: ACETYL-COA + N MALONYL-COA + 2N NADPH = 
CC LONG-CHAIN FATTY ACID + (N+l) CDA + N C0(2) + 2N NADP(+). 

CC -!- CATALYTIC ACTIVITY: ACETYL-COA + tACYL-CARRIER PROTEIN] = COA 
CC + ACETYL-C ACYL-CARRIER PROTEIN]. 

CC -!- CATALYTIC ACTIVITY: MALONYL-COA + I ACYL-CARRIER PROTEIN] = COA 
CC ♦ NALONYL-I ACYL-CARRIER PROTEIN]. 

CC -!- CATALYTIC ACTIVITY: ACYL-I ACYL-CARRIER PROTEIN] + MALONYL-t ACYL- 
CC CARRIER PROTEIN] = 3-0X0ACYL-E ACYL-CARRIER PROTEIN] + CO (2) ♦ 

CC [ACYL-CARRIER PROTEIN]. 

CC -!- CATALYTIC ACTIVITY: ( 3R) -3-HYDROX YACYL-C ACYL-CARRIER PROTEIN] + 

CC NADP(+) = 3-OXOACYL-E ACYL-CARRIER PROTEIN] + NADPH. 

CC -!- CATALYTIC ACTIVITY: (3R)-3-HYDR0XYPALMIT0YL-[ ACYL-CARRIER PROTEIN] 



107 


Vn«l_KlNUI\ 






0 


0 




u 


17. 


GPDAJCHPO 


GLYCER0L-3-PH0SPHATE DEHYOROG 


384 


6 


6 


4.52 


0 


13. 


PEPC_HUMAN 


PROGASTRICSIN PRECURSOR (EC 3 


388 


6 


6 


4.52 


0 


19. 


PEPC.RAT 


PROGASTRICSIN PRECURSOR (EC 3 


392 


6 


6 


4.52 


0 


20. 


AAT.BACSP 


ASPARTATE AMINOTRANSFERASE (E 


392 


6 


6 


4.52 


0 


21. 


SBCD.ECOLJ 


EXONUCLEASE SBCD. 


400 


6 


7 


4.52 


0 


22. 


B3AR RAT 


BETA-3-ADRENERGIC RECEPTOR. 


400 


6 


6 


4.52 


0 


23. 


PPB3.BACSU 


ALKALINE PHOSPHATASE III PREC 


462 


6 


6 


4.52 


0 


24. 


CYAA_TRYEG 


ADENYLATE CYCLASE (EC 4.6.1.1 


469 


6 


6 


4.52 


0 


25. 


ATPB EUGGR 


ATP SYNTHASE BETA CHAIN (EC 3 


480 


6 


6 


4.52 


0 


26. 


ATPB.CHLEL 


ATP SYNTHASE BETA CHAIN (EC 3 


481 


6 


6 


4.52 


0 


27. 


NIFB_RHIHE 


NIFB PROTEIN. 


490 


6 


6 


4.52 


0 


28. 


ATPB.CHLRE 


ATP SYNTHASE BETA CHAIN (EC 3 


491 


6 


6 


4.52 


0 


29. 


0M6E.CHLTR 


60 KD OUTER MEMBRANE PROTEIN 


547 


6 


6 


4.52 


0 


30. 


ATP2_0RYSA 


ATP SYNTHASE BETA CHAIN. MITO 


551 


6 


6 


4.52 


0 


31. 


ATP2_NAIZE 


ATP SYNTHASE BETA CHAIN, MITO 


553 


6 


6 


4.52 


0 


32. 


ATP2_NICPL 


ATP SYNTHASE BETA CHAIN, MITO 


560 


6 


6 


4.52 


0 


33. 


LCFA.EC0L1 


LONG-CHAIN-FATTY-AC IO-COA LI 


561 


6 


6 


4.52 


0 


34. 


ATP2_HEVBR 


ATP SYNTHASE BETA CHAIN, MITO 


562 


6 


6 


4.52 


0 


35. 


MASY.CUCSA 


MALATE SYNTHASE, GLYOXYSOMAL 


568 


6 


6 


4.52 


0 


36. 


YLP5.CAEEL 


HYPOTHETICAL 66.5 KD PROTEIN 


608 


6 


6 


4.52 


0 


37. 


HEX3.YEAST 


HEXOSE METABOLISM-RELATED PRO 


619 


6 


6 


4.52 


0 


38. 


PCI MOUSE 


PLASMA-CELL MEMBRANE GLYCOPRO 


871 


6 


6 


4.52 


0 


39. 


SCD5.YEAST 


SCD5 PROTEIN. 


872 


6 


7 


4.52 


0 


40. 


CARB BACSU 


CARBAMOYL-PHOSPHATE SYNTHASE, 


1071 


6 


6 


4.52 


0 



1. US-08-249-182-8 (1-11) 

VG02_VZVD HYPOTHETICAL GENE 2 PROTEIN. 

ID VG02_VZVD STANDARD; PRT; 238 AA. 

AC P09267; 

DT 01-MAR-1989 (REL. 10, CREATED) 

DT 01-MAR-1989 (REL. 10, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE HYPOTHETICAL GENE 2 PROTEIN. 

GN 2. 

OS VARICELLA-ZOSTER VIRUS (STRAIN DUMAS) (VZV) . 

OC VIRIDAE; DS-DNA ENVELOPED VIRUSES; HERPESVIRIDAE; ALPHAHERPESV I R I NAE . 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 86306657 

RA DAVISON A.J., SCOTT J.E.; 

RL J. GEN. VIROL. 67:l759- i 1816(1986). 

CC -!- SIMILARITY: TO EHV-1 3. 

DR EMBL; X04370; HEVZVXX. 

DR PIR; B27212; UZBE2. 

KU HYPOTHETICAL PROTEIN. 

S8 SEQUENCE 238 AA; 25984 HW, 292124 CN; 

Initial Score = 7 Optinized Score = 7 Significance = 5.65 
Residue Identity = 63X Hatches = 7 Mismatches = 4 

Gaps - 0 Conservative Substitutions = 0 

X 10 
GGQPLHITATK 
II I II II 

SETLAYGHVPAFIMGSTLVRPSLNATAEENPASETRCLLRVLAGRTVDLPGGGTLHITCTKTYVIIGKYSKP 
10 20 30 40 50 X 60 X 70 

GERLSLARLIGRAMTPGGARTFIILAMKEKRSTTLGYEC 
80 90 100 110 



2. US-08-249-182-8 (1-11) 

FAS.CHICK FATTY ACID SYNTHASE (EC 2.3.1.85) (CONTAINS: EC 2. 



o 

II I I I I I I I I I I I I I 

SCORE 0| 1 | 2 | 2 3 4|5|5 6 7 
STDEV -10 12 3 5 



PARAMETERS 



Similarity natrix 


Unitary 


K-tuple 




2 


Mismatch penalty 


1 


Joining penalty 




20 


Gap penalty 


1.00 


Window size 




5 


Gap size penalty 


0.05 








Cutoff score 


0 








Randonization group 


0 








Initial scores to save 40 


Alignments to save 


15 




Optimized scores to 


save 0 


Display context 


50 





SEARCH STATISTICS 



Scores: Mean Median 

2 3 

Tines! CPU 

00:00:50.96 

Number of residues: 12496420 
Nunber of sequences searched: 36000 
Number of scores above cutoff: 3932 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5. 

The scores below are sorted by initial score 
Significance is calculated based on initial score. 

A 1007. identical sequence to the query sequence was not found. 



Standard Deviation 
0.89 

Total Elapsed 
00:00:58.00 



The list of best scores is: 



Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frane 









♦*** 5 standard deviations . 


above nean 


*#*♦ 






1. 


VG02.VZVD 


HYPOTHETICAL GENE 2 PROTEIN. 


238 


7 


f 5.65 


0 


2. 


FAS.CHICK 


FATTY ACID SYNTHASE <EC 2.3.1 


2446 


7 ; 


f 5.65 


0 








•<«» 4 standard deviations above nean 


***# 






3. 


YIFM. 


YEAST 


HYPOTHETICAL PROTEIN IN IFN1 


125 


6 I 


> 4.52 


0 


4. 


FHC1 


.EC0LI 


CFA/I FINBRIAL SUBUNIT B PREC 


170 


6 i 


3 4.52 


0 


5. 


SSBR. 


.EC0LI 


SINGLE-STRAND BINDING PROTEIN 


174 


6 t 


» 4.52 


0 


6. 


SSBP 


EC0LI 


SINGLE-STRAND BINDING PROTEIN 


174 


6 t 


3 4.52 


0 


7. 


PRR3 


.B0VIN 


PLACENTAL PROLACTIN-RELATED P 


213 


6 ( 


3 4.52 


0 


B. 


RNPH 


.BACSU 


RIBONUCLEASE PH (EC 2.7.7.56) 


245 


6 ( 


3 4.52 


0 


9. 


PSBP. 


.WHEAT 


OXYGEN-EVOLVING ENHANCER PROT 


258 


6 i 


3 4.52 


0 


10. 


PSBP 


.LYCES 


OXYGEN-EVOLVING ENHANCER PROT 


258 


6 ( 


3 4.52 


0 


11. 


PSBP. 


.PEA 


OXYGEN-EVOLVING ENHANCER PROT 


259 


6 I 


3 4.52 


0 


12. 


PSBP. 


.SINAL 


OXYGEN-EVOLVING ENHANCER PROT 


260 


6 < 


b 4.52 


0 


13. 


PSBP. 


.TOBAC 


OXYGEN-EVOLVING ENHANCER PROT 


265 


6 i 


3 4.52 


0 


14. 


IBP3. 


.HUMAN 


INSULIN-LIKE GROWTH FACTOR BI 


291 


6 ( 


3 4.52 


0 


15. 


IBP3. 


.RAT 


INSULIN-LIKE GROWTH FACTOR BI 


292 


6 ( 


3 4.52 


0 



240 250 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastDB - Fast Pairuise Conparison of Sequences 
Release 5.4 

Results file u249_8s.res Hade by on Thu 22 Sep 94 10:19:13-PDT. 



fluery sequence being conpared!US-08-249-182-8 (1-11) 
Number of sequences searched: 36000 
Number of scores above cutoff: 3932 

Results of the initial comparison of US-08-249-182-8 (1-11) uith: 
Data bank : Suiss-Prot 28 > all entries 

100000- 

N 

U50000- 

II 

B 

E - * 
R 

0 

F 10000- ♦ 
S 

E 5000- 
0 

U - * 
E 

N « 

C 

E 

S 1000- 



500- 



100- 



50- 



10- 



5- 



* 



ftfcr WORDS 

FEATURE 
1-73 

74-258 

SUMMARY 
SEQUENCE 



cnioropiast? photosynthesis ; photosysten II? thylakoid 

#donain transit peptide (chloroplast) tstatus predicted 
#label TNP\ 

tprotein photosysten II oxygen-evolving conplex protein 
2 #statu5 predicted Slabel MAT 
tlength 258 #nolecular-ueight 27792 #checfcsun 9784 



Initial Score = 
Residue Identity - 
Gaps = 



6 Optimized Score = 6 Significance = 4.08 
54X Matches = 6 Mismatches = 5 

0 Conservative Substitutions = 0 



X 10 
GGQPLWITATK 

I! 

QAYFGKTDSEGGFESGAVATRNLLEASSATVGGKEYYYLSVLTRTADGDEGGKHQLITATVNDGKLVICKA8 
170 180 190 200 210 X 220 X 230 



AGDKRWFKGAKKFVENAATSFSIA 
240 250 



15, US-08-249-182-8 (1-11) 

JS0771 photosysten II oxygen-evolving complex protein 2 p 



ENTRY 
TITLE 

ALTERNATE.NAMES 
ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
#authors 

#5ubf>iission 
tcontents 
Saccession 



JS0771 #type conplete 

photosysten II oxygen-evolving complex protein 2 precursor - 
garden pea 

photosysten II 23K protein? photosysten II psbP protein 
#fornal_nane Pisun sativum #connon_nane garden pea 
15-Jan-1993 #sequence_revision 15-Jan-1993 #text_change 

30-Sep-1993 
JS0771 
JS0771 

Konishii T.; Haruta* Y. ; Muraser M. i Shinoharai K.? Watanabe, 
A. 

subnitted to JIPIDr October 1992 
Strain Alaska 
JS0771 
#8nolecule_type «RNA 
^residues" 1-259 8#label K0N 
##cross-references DDBJ:D13296 



t«note 

GENETICS 

tgene 
CLASSIFICATION 
KEYWORDS 
FEATURE 

1-73 

74-259 

SUMMARY 
SEQUENCE 



the translation of 
in this paper 



the nucleotide sequence is not given 



psbP 

#superfanily photosysten II oxygen-evolving conplex protein 2 
chloroplast? photosynthesis; photosysten II; thylakoid 

Sdonain transit peptide (chloroplast) ilabel TNP\ 
fprotein photosysten II oxygen-evolving conplex protein 
2 #status experimental ftlabel MAT 
fclength 259 #nolecular-ueight 28047 #checksun 3807 



Initial Score 
Residue Identity 
Gaps 



6 Optinized Score = 6 Significance = 4.08 
54% Matches = 6 Mi snatches = 5 

0 Conservative Substitutions = 0 



X 10 
GGQPLWITATK 



3AFFG0TDSEGGFDTNAVAVANILESSAPVIGGKQYYNISVLTRTADGDEGGKH9LITATVKDGKLYICKA8 
170 180 190 200 210 X 220 X 230 



ENTRY S22763 #type conplete 

TITLE photosysten II oxygen-evolving conplex protein 2 precursor - 

wheat 

oxygen-evolving conplex 23K protein 
#fornal_nane Triticun aestivun #connon_nane connon wheat 
12-Feb-1993 #sequence_revision 12-Feb-1993 #text change 

03-Feb-1994 
S22763 
S22763 

Janes* H.E.i Robinsoni C. 
Plant Hoi. Biol. (1991 ) 17U79-182 

Nucleotide sequence of cDNA encoding the precursor of the 23 
kDa protein of the photosynthetic oxygen-evolving conplex 
fron wheat, 
tcross-references HUID:91329731 
#accession S22763 
##nolecule_type nRNA 
##residues" 1-258 ##label JAM 
##cross-references EMBL?X57407 
CLASSIFICATION isuperfanily photosysten II oxygen-evolving conplex protein 2 
KEYWORDS chloroplast; photosynthesis; photosysten II; thylakoid 

FEATURE 
1-73 



ALTERNATE JAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

#authors 

#journal 

ttitle 



74-258 

SUMMARY 
SEQUENCE 



idonain transit peptide (chloroplast) ftstatus predicted 
#label TNP\ 

tprotein photosysten II oxygen-evolving conplex protein 
2 #status predicted #label MAT 
#length 258 #nolecular-ueight 27269 tchecksun 5649 



Initial Score = 6 
Residue Identity = 54X 
Gaps = 0 



Optimized Score = 6 Significance = 4.08 
Matches = 6 Misnatches = 5 

Conservative Substitutions = 0 



X 10 
GGQPLHITATK 
I! 1111 

QSYGGKTDSEGGFESDAVATANVLESSAPVVDGKflYYSITVLTRTADGDEGGKHQLITATVADGKLYVCKAS 
170 180 190 200 210 X 220 X 230 



RDKRWFKGAKKFVENAAGSFSVA 
240 250 



14. US-08-249-182-8 (1-11) 

F2T0X2 photosysten II oxygen-evolving conplex protein 2 p 

ENTRY F2T0X2 #type complete 

TITLE photosysten II oxygen-evolving conplex protein 2 precursor - 

tonato 

ALTERNATE_NAMES photosysten II oxygen-evolving conplex 23K protein 
ORGANISM ffornal_nane Lycopersicon esculentun #connon_nane tonato 

DATE 31-Dec-1992 #sequence_revision 31-Dec-1992 #text_change 

30-Jun-1993 
ACCESSIONS S20872 
REFERENCE S20872 

tauthors Bettsi S.5 Picherskyi E. 

^journal Plant Mol. Biol. (1992) 18:995-996 

ttitle Nucleotide sequence of cDNA encoding the precursor of the 23 

kDa photosysten II protein of tonato. 
#cro5S-references MUID:92256823 
iaccession S20872 
##nolecule_type nRNA 
ftresidues" 1-258 ##label BET 
ifcross-references EMBL:X63007 
CLASSIFICATION tsuperfanily photosysten II oxygen-evolving conplex protein 2 



C48652 



transfer protein spdA - Streptomyces ambofaciens p 



C48652 Hype complete 

transfer protein spdA - Streptomyces ambofaciens plasmid 
pSAM2 

#formal_name Streptonyces ambofaciens 
03-May-1994 Ssequence_revision 03-Nay-1994 #text_change 

03-May-1994 
C48652 
A48652 

Hageger J.; Pernodetr J.L.J Sezonovi G.; Gerbaud* C.J 

Friedmaww A.; Guerineaur M. 
J. Bacterid. (1993) 175:5529-5538 

Transfer functions of the conjugative integrating element 
p5AM2 from Streptomyces ambofaciens: characterization of a 
kil-kor system associated with transfer, 
iaccession C48652 

tlstatus preliminary 
##molecule_type DNA 
##residues" 1-224 #ilabel HAG 
##cross-references ENBL:Z19593 
GENETICS 

fcgenome plasmid 
SUMMARY #length 224 #Fiolecular-weight 23575 Schecksum 9089 

SEQUENCE 

Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity - 54X Matches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGQPLWITATK 
I! I III 

GPSRLAWSHFVIALVASLGANVATAGLLDLNDVPAMLRILVAAWPALAFMGGTLLAHTATHHEPEAPAPTSP 
70 80 90 100 110 X 120 X 130 

APEPPAFTDEHDLVRVDDTEEPPELPAPGLQ8APAPPAV 
140 150 160 170 



ENTRY 
TITLE 

ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
tauthors 

ftjournal 
#title 



11. US-08-249- 182-8 (1-11) 

A44914 phosphate-dependent exoribonuclease - Bacillus sub 



ENTRY A44914 Stype complete 

TITLE phosphate-dependent exoribonuclease - Bacillus subii lis 

ORGANISM #formal_name Bacillus subtilis 

DATE 17-Feb-1994 #sequence revision 17-Feb-1994 #text change 

17-Feb-1994 
ACCESSIONS A44914 
REFERENCE A44914 

^authors Craven» M.G.; Henner* D.J. i Alessi f D.; Schauer* A.T.! Ost, 

K.A.; Deutscherr M.P.? Friedman* D.I. 
#journal J. Bacteriol. (1992) 174:4727-4735 
#title Identification of the rph (RNase PH) gene of Bacillus 

subtilis: evidence for suppression of cold-sensitive 
mutations in Escherichia coli. 
ttcross-references MUID:92325065 
^accession A44914 

tlstatus prel iminary 

#tmolecule_type DNA 
••residues" 1-245 ##label CRA 
##cross-references NCBIP: 108178 

t#note sequence extracted from NCBI backbone 

GENETICS 

#gene rph 



SEQUENCE 



inengin aw smotecuiar-ueigni, ttow »cnecKsun /vy<; 



Initial Score = 6 Optimized Score = 6 Significance = 4.08 
Residue Identity = 54X Hatches = 6 Mismatches = 5 

Gaps = 0 Conservative Substitutions = 0 

X 10 
GGGPLUITATK 
II III) 

HDGRGHDELRPI TFDLDF I SHPEGSVL I TAGNTKVI CNASVEDR VPPFLRGGGKGH I TAEYSMLPRATNGRT 
10 20 30 40 50 X 60 X 70 

IRESSKGKISGRTMEI8RLIGRALRAVVDLEKLGERTIW 
80 90 100 110 



12. US-08-249-182-8 (1-11) 

S03888 photosystem II oxygen-evolving complex protein 2 p 

ENTRY S03888 #type fragment 

TITLE photosystem II oxygen-evolving complex protein 2 precursor - 

white mustard (fragment) 
photosysten II extrinsic membrane protein 23K chain 
#formal_nane Sinapis alba #common_name white mustard 
Ol-Dec-1989 #sequence_revision Ol-Dec-1989 #textj:hange 

31-Dec-1993 
S03888 
S03888 

Wenngi A.; Ehnann* B.J Schaefert E. 
FEBS Lett, (1989) 246? 140-144 

The 23 kDa polypeptide of the photosynthetic oxygen-evolving 
complex from mustard seedlings (Sinapis alba LJ. 
Nucleotide sequence of cDNA and evidence for phytochrome 
control of its mRNA abundance, 
tcross-references MUID:S9211386 
Saccession S03888 
##molecule_type mRNA 
#*residues~ 1-248 ##label HEN 
##cross-references EMBL:Y07498 
CLASSIFICATION Ssuperfamily photosystem II oxygen-evolving complex protein 2 
chloroplasti membrane protein? photosynthesis; photosystem 
II; thylakoid 



ALTERNATE_NAMES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

tauthors 

#journal 

ititle 



KEYWORDS 



FEATURE 
1-62 
63-248 

SUMMARY 
SEQUENCE 



idomain transit peptide (fragment) #label TNP\ 
Sprotein photosystem II oxygen-evolving complex protein 
2 81abel MAT 
# length 248 ichecksum 446 



Initial Score = 
Residue Identity - 
Gaps = 



6 Optimized Score = 6 Significance = 4.08 
54X Matches = 6 Mismatches = 5 

0 Conservative Substitutions = 0 



X 10 
GGQPLHITATK 
II llll 

QAYFGETASEGGFDNNAVATANILETNIQDVGGKPYYYLSVLTRTADGDEGGKHQLITATVNGGKLYICKA6 
160 170 180 190 200 X 210 X 220 



AGDKRWFKGANKFVEKAATSFSVA 
230 240 



13. US-08-249-182-8 (1-11) 

S22763 photosystem II oxygen-evolving complex protein 2 



ssstatus preliminary 
##nolecule_type protein 
SSresidues" 1-114 ##label STR 

#*cross-references NCBIPJ78526J NCBIP;78523; NCBIP:78521; NCBIP:78518; 

NCBIP:78515; NCBIP.'78512; NCBIP:7851U NCBIP:78510! 
NCBIPJ78509; NCBIP:78508; NCBIP:78503 
tlnote sequence extracted frow NCBI backbone 

SUMMARY Slength 114 #checksu« 7335 

SEQUENCE 

Initial Score = 16 Optinized Score = 16 Significance = 13.09 
Residue Identity = 1007. Matches = 16 Hi snatches = 0 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSN8TVFVGYGPTFK 

mmmiimii 

TEFLSNYLTNVDD I TLVPGTLGRD I EHLTSLDFFRVNSHQT VFVGYGPTFKGG8PLH I TATKSPPFENI NLY 
10 20 30 X 40 50 60 70 

YDVPWNET I PEE VTXPNYL6AEVS YPAFK 
80 90 100 



2. US-08-249-182-9 (1-16) 

A39216 plasna cell nenbrane protein PC-1 - hunan 



ENTRY A39216 Stupe conplete 

TITLE plasna cell nenbrane protein PC-1 - hunan 

ORGANISM #fornal_nane Hono sapiens #conpion_nane nan 

DATE 23-Aug-1991 #sequence_revision 23-Aug-1991 #text change 

31-Dec-1993 
ACCESSIONS A39216 
REFERENCE A39216 

iauthors Buckley. M.F.; loveland. K.A.; McKinstryr U.J.i Garson, O.M.; 

Goding. J.H. 

#journal J. Biol. Chen. (1990) 265? 17506-17511 

Stitle Plasna cell nenbrane glycoprotein PC-1. cDNA cloning of the 

human nolecule> anino acid sequence, and chronosonal 
location. 
#cross-references MUID;91009202 
taccession A39216 

##status prelininar-y 
#8noleeule_type nRNA 
«#residues" 1-925 t#label BUC 
##cross-references GB:J05654 
KEYWORDS nenbrane protein 

SUMMARY #length 925 #nolecular-weight 104924 

SEQUENCE 



tchecksun 7446 



Initial Score 
Residue Identity 
Gaps 



10 Optimized Score = 11 
70X Matches = 12 

1 Conservative Substitutions 



Significance = 
Mi snatches = 



7.48 
4 
0 



X 10 X 

VNS-MQTVFVGYGPTFK 
I I II llllll II 

YLKHFLPKRLHFAKSDRIEPLTFYLDPQWQLALNPSERKYCGSGFHGSDNVFSNMQALFVGYGPGFKHGIEA 
490 500 510 520 530 540 550 X 560 



DTFENIEVYNLMCDLLNLTPAPNNGTHGSLNHLLKNPVYTPKHPK 
570 580 590 600 



3. US-08-249-182-9 (1-16) 

S21706 nucleotide pyrophosphatase - hunan 



ENTRY S21706 #type complete 

TITLE nucleotide pyrophosphatase - human 

ALTERNATE.NAMES plasma cell membrane protein PC-i 

ORGANISM #formal_nane Homo sapiens •common_name nan 

DATE 12-Feb-1993 Ssequence_revision 12-Feb-1993 #text_change 

18-Jun-1993 
ACCESSIONS S21706! S23587 

REFERENCE S21706 

#authors Funakoshi, I.; Kato, H.; Horie, K.J Yano, T.J Hori, Y.? 

Kobayashi* H.; Inouer T.J Suzuki » H.J Fukui, S.I Tsukaharai 
H.f Kajii, T.; Yanashina, I. 
•journal Arch. Biochem. Biophys. (1992) 295? 180-187 
#title Molecular cloning of cDNAs for human fibroblast nucleotide 

pyrophosphatase, 
ftcross-references MUID:92246539 
•accession S21706 
##noleculeJ,ype mRNA 
•iresidues" 1-925 •Slabel FUN1 

##note sequence not compared to nucleotide translation 

#accession S23587 

•8molecule_type protein 

••residues" 116-121 ; 247-27 1 , ' X r , 273-275 ; 279-280 r * X r * 282-283 ; 303-3 1 6 f 
362-364 J 449-465 ! 482-525 J 529-534 , ' X ' , 536-55 1 , 9 X ' , 553 , 
' X ' , 555-556 ; 597-606 \ 9 X ' , 727-730 i 775-782 ? 840-846 ,'XX T , 
849-852^X^854-859 ##label FUN2 
##note it is uncertain whether Met-1 or Met-53 is the initiator 

GENETICS 

•nap_position 6q22-q23 
KEYWORDS glycoprotein? membrane protein 

FEATURE 

77-97 Bdomain transmembrane tstatus predicted #label TMM\ 

179, 285,341,477, 

578,585,643,700, 

731,748 #binding_site carbohydrate (Asn) (covalent) tstatus 

predicted 

SUMMARY tlength 925 #molecular-ueight 104924 #checksum 7446 

SEQUENCE 

Initial Score = 10 Optimized Score = 11 Significance * 7.48 
Residue Identity = 70X Matches = 12 Mismatches = 4 

Gaps = 1 Conservative Substitutions = 0 

X 10 X 

VNS-MGTVFVGYGPTFK 
I I II HUM II 

YLKHFLPKRLHFAKSDRIEPLTFYLDP8H8LALNPSERKYCGSGFHGSDNVFSNM8ALFVGYGPGFKHGIEA 
490 500 510 520 530 540 550 X 560 

DTFENIEVYNLMCDLLNLTPAPNNGTHGSLNHLLKNPVYTPKHPK 
570 580 590 600 



4. US-08-249-182-9 (1-16) 



A27410 plasma cell membrane protein PC-1 - mouse 

ENTRY A27410 itype complete 

TITLE plasma cell membrane protein PC-1 - mouse 

ORGANISM #formal_name Mus musculus #common_name house mouse 

DATE 15-Dec-1988 #sequence_revision 15-Dec-1988 fttext change 

31-Dec-1993 
ACCESSIONS A27410 
REFERENCE A27410 

•authors van Driel, l.R.i Coding, J.W. 

•journal J. Biol. Chem. (1987) 262:4882-4887 

•title Plasma cell membrane glycoprotein PC-1. Primary structure 



oeouceo tron cum clones, 
•cross-references MUID: 87 165906 
•accession A27410 

##nolecule_type rRNA 

••residues" 1-905 ••label VAN 



••note 



KEYWORDS 

SUMMARY 

SEQUENCE 



the authors translated the codon CAG for residue 24 as 
Glu 

nenbrane protein 

•length 905 •nolecular-weight 102880 •checksun 1749 



Initial Score = 9 
Residue Identity = 56X 
Gaps = 0 



Optimized Score = 9 Significance = 6.55 
Matches = 9 Misnatches = 7 

Conservative Substitutions = 0 



X 10 X 

VNSM8TVFVGYGPTFK 
II I llll II 

LKPFLPKRLHFAKSDRIEPLTFYLDP8WQLALNPSERKYCGSGFHGSDNLFSNM9ALFIGYGPAFKHGAEVD 
480 490 500 510 520 X 530 540 



SFENIEVYNLMCDLLGLIPAPNNGSHGSLNHLLKKPIYNPSHPK 
550 560 570 580 



5. US-08-249-182-9 (1-16) 

RKAALC ribulose-bisphosphate carboxylase (EC 4.1.1,39) la 



ORGANISM 
DATE 

ACCESSIONS 
REFERENCE 
•authors 
•journal 
•title 



ENTRY RKAALC Itype complete 

TITLE ribulose-bisphosphate carboxylase (EC 4.1.1.39) large chain 

precursor - alfalfa chloroplast 
•fornal _nane chloroplast Medicago sativa •coiwonjiane alfalfa 
30-5ep-1992 Ssequence.revision 30-Sep-1992 #text change 

30-Jun-1993 
A25578 
A25578 

Aldrichi J.; Cherney* B.J Merlin* E.? Palneri J. 
Nucleic Acids Res. (1986) 14:9535 

Sequence of the rbcL gene for the large subunit of ribulose 
bisphosphate carboxylase-oxygenase fron alfalfa, 
•cross-references MUID:87091586 
•accession A25578 

••nolecule_type protein 
••residues" 1-474 SSlabel ALD 
••cross-references GB:X04975 
COMMENT In addition to Lys-20i> another lysinei it is not certain which* 
nay be the site of autocatalytic carbanylation. 

GENETICS 

rbcL 

chloroplast 

•superfanily ribulose-bisphosphate carboxylase large chain 
Calvin cycle; carbon dioxide fixation; carbon-carbon lyase; 
carboxy-lyase; chloroplast; nonooxygenase* 
photorespiration; photosynthesis 



•gene 

•genone 
CLASSIFICATION 
KEYWORDS 



FEATURE 
3-474 

175,334 

201 

203 
SUMMARY 
SEQUENCE 



•protein ribulose-bisphosphate carboxylase large chain 

•status predicted •label MAT\ 
•active_site Lys (ribulose-bisphosphate-binding) #status 

predicted\ 

•binding_site carbon dioxide (Lys) (covalent) (by 

Rubisco activase) •status predicted 
•binding_site nagnesiun (Asp) •status predicted 
•length 474 inolecular-ueight 52625 •checksum 2374 



Initial Score 



8 Optimized Score 



8 Significance = 5.61 



Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSM8TVFVGYGPTFK 
II I I II II 

TGTWTTVWTDGLTSLDRYKGRCYHIEPVAGEETSFIAYVAYPLDLFEEGSVNYNFTSIVGNVFGFKALRALR 
70 80 90 100 110 X 120 130 

LEDLRIPAAYVKTFSGPPflGIGVERDKLNKYGRPLLGCTIKPKL 
140 150 160 170 



6. US-08-249- 182-9 (1-16) 

DSNVAC superoxide disnutase (EC 1.15.1.1) (Cu-Zn) - Autog 

ENTRY DSNVAC Stype conplete 

TITLE superoxide disnutase (EC 1.15.1.1) (Cu-Zn) - Autographa 

californica nuclear polyhedrosis virus (strain LI) 

ORGANISM #fornal_nane Autographa californica nuclear polyhedrosis 

virus? AcMNPV 

DATE 30-Jun-1992 #sequence_revision 30-Jun-1992 Itext change 

24-Feb-1994 
ACCESSIONS A40564 
REFERENCE A40564 

tauthors Tonalski. H.D.i Eldridge, R.; Miller. L.K. 

# journal Virology (1991) 184:149-161 

ttitle A baculovirus bono log of a Cu/Zn superoxide disnutase gene. 

Scross-references MUID: 91335744 
•accession A40564 
##nolecule_type DNA 
Itresidues" 1-151 »8label TOM 
##cross-references GB:M68862 
CLASSIFICATION isuperfanily superoxide disnutase (Cu-Zn) 
KEYWORDS oxidoreductase 

SUMMARY tlength 151 #molecular-weight 16182 ic heck sun 4639 

SEQUENCE 



Initial Score 
Residue Identity 
Gaps 



7 Optimized Score = 7 Significance = 4.68 
43? Matches = 7 Hisnatches = 9 

0 Conservative Substitutions = 0 



X 10 X 

VNSM8TVFVGYGPTFK 
II I I III 

HEYGDTSNGCTSAGEHFNPTNEDHGAPDAEIRHVGDLGNIKSAGYNSLTEVNMMDNVHSLYGPHNIIGRSLV 
50 60 70 80 90 X 100 110 



VHTDKDDLGLTDHPLSKTTGNSGGRLGCGIIAICK 
120 130 140 150 



7. US-08-249-182-9 (1-16) 



A35216 



FPD4 protein - foulpox virus (strain FP-1) 



ENTRY A35216 itype conplete 

TITLE FPD4 protein - foulpox virus (strain FP-1) 

ORGANISM #fornal_nane foulpox virus 

DATE 23-Aug-199l tsequence revision 23-Aug-1991 ttext change 

30-Sep-1993 
ACCESSIONS A35216 
REFERENCE A35216 

•authors Tartaglia. J.; Winslou. J.; Goebelr S.; Johnsonr G.P.; 

Taylor i J.> Paoletti> E. 
Sjournal J. Gen. Virol. (1990) 71:1517-1524 

fttitle Nucleotide sequence analysis of a 10.5 kbp Hindlll fraanent 



of touipoK virus; reiateoness to the central portion ot the 
vaccinia virus Hindi 1 1 D region. 
#cros$-references MUID:90324937 
#accession A35216 

ilstatus preliminary 
tfmolecule^type DNA 
tftresidues" 1-218 ##label TAR 
#8cross-references GBJX17202 
CLASSIFICATION #superf anily vaccinia virus D4 protein 
SUMMARY Slength 218 #molecular-ueight 25563 Schecksun 5203 

SEQUENCE 

Initial Score * 7 Optimized Score = 8 Significance = 4.68 
Residue Identity * SOX Hatches = 8 Mismatches « 8 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
II II III I 

NYYLSCREGEAKSHKIFUERLADVFINHIAAYVSVFYFLGKSDFSNFRSILNSPTTVVVGYHPAARNRQFDT 
120 130 140 150 160 170 180 X 190 

DETFEIVNTLLELKNEPRINWVQGFEI 
200 210 



8. US-08-249-182-9 (1-16) 

A24637 T-cell surface glycoprotein CD8 precursor - rat 

ENTRY A24637 Stype complete 

TITLE T-cell surface glycoprotein CD8 precursor - rat 

ALTERNATE.NAMES MRC OX-8 antigen 

ORGANISM #formal_name Rattus norvegicus #common_nane Norway rat 

DATE 17~Sep-1987 #sequence_revision 17-Sep-1987 fttext change 

31-Dec-1993 
ACCESSIONS A24637 
REFERENCE A24637 

tauthors Johnson, P.? Gagnon, J.; Barclay^ A.N. ; Williams, A.F. 

#journal EMB0 J. (1985) 4:2539-2545 

ititle Purification* chain separation and sequence of the MRC OX-8 

antigen? a marker of rat cytotoxic T lymphocytes. 
#cross-references MUID:86030231 
#accession A24637 
##molecule_type fiRNA 
##residues" 1-236 ##label J0H 
CLASSIFICATION Bsuperfamily immunoglobulin V region? immunoglobulin homology 
KEYWORDS glycoprotein 

SUMMARY length 236 imolecular-ueight 26196 Schecksum 3912 

SEQUENCE 



Initial Score = 7 
Residue Identity = 43X 
Gaps = 0 



Optimized Score = 7 Significance * 4.68 
Matches = 7 Mismatches = 9 

Conservative Substitutions = 0 



X 10 X 

VNSMQTVFVGYGPTFK 
III I I II 

DPNLFSARKENNKYILTLSKFSTKN9GYYFCSITSNSVMYFSPLVPVFQKVNSIITKPVTRAPTPVPPPTGT 
90 100 110 120 130 140 150 X 160 



PRPLRPEACRPGASGSVEGMGLGFACDIYIWAPLAGICAVLLLS 
170 180 190 200 



9. US-08-249-132-9 (1-16) 

A45442 Secl3p=endoplasnic reticulum vesicle formation pro 



ENTRY A45442 #type complete 

TITLE Secl3p=endoplasnic reticulum vesicle formation protein - 

yeast (Saccharomyces cerevisiae) 
ORGANISM #formal_name Saccharonyces cerevisiae 

DATE 21-Sep-1993; #sequence_revision 21-Sep-1993» #text_change 

21-Sep-1993 
ACCESSIONS A45442 
REFERENCE A45442 

^authors Pryer, N.K.; Salanar N.R.; Schekman. R.? Kaiser, C.A. 
tjournal J. Cell Biol. (1993) 120:865-875 

#title Cytosolic Secl3p complex is required for vesicle formation 

from the endoplasmic reticulum in vitro, 
#cross-references HUID:93163112 
taccession A45442 

##status preliminary 
#tmolecule_type nucleic acid 
^residues" 1-297 tSlabel PRY 
Stcross-references NCBIN? 124845; NCBIP: 124846 
##note sequence extracted from NCBI backbone 

SUMMARY #length 297 #molecular-ueight 33043 ichecksum 717 

SEOUENCE 



Initial Score = 7 
Residue Identity - 43X 
Gaps = 0 



Optimized Score = 7 Significance = 4.68 
Matches = 7 Mismatches = 9 

Conservative Substitutions = 0 



X 10 X 

VNSM9TVFVGYGPTFK 
II! I III 

HEGPVWRVDHAHPKFGTILASCSYOGKVLIHKEENGRHSSIAVHAVHSASVNSVSWAPHEYGPLLLVASSDG 
60 70 80 90 100 X 110 120 



KVSVVEFKENGTTSPI I IDAHAIGVNSASWAPATIEEDGEHNGT 
130 140 150 160 



10. US-08-249-182-9 (1-16) 
S30803 SEC 13 protein 



yeast (Saccharonyces cerevisiae) 



ENTRY S30803 Stype complete 

TITLE SEC13 protein - yeast (Saccharonyces cerevisiae) 

ORGANISM #formal_name Saccharonyces cerevisiae 

DATE 28-May-1993 isequence^revision 28-May-1993 ttext change 

06-May-1994 
S30803 
S30303 

Pryer» N.K.J Salanar N.R.; Scheknanr R.; Kaiser » C.A. 
submitted to the EMBL Data Library, November 1992 
Cytosolic Secl3p complex is required for vesicle formation 

from the endoplasmic reticulum in vitro. 
S30803 
##molecule_type DNA 
##residues" 1-297 ##label PRY 
##cross-references EMBL:L05929 
GENETICS 

#gene LISTA:SEC13 
SUMMARY ilength 297 #molecular-ueight 33043 tchecksum 717 

SEQUENCE 



ACCESSIONS 

REFERENCE 
tauthors 
#5ubnission 
^description 

^accession 



Initial Score = 
Residue Identity = 
Gaps = 



1 Optimized Score = 7 Significance = 4.68 
437. Matches = 7 Mismatches = 9 

0 Conservative Substitutions = 0 



X 10 X 

VNSM8TVFVGYGPTFK 



Ml ! Ill 

HEGP VWRVDWAHPKF GT I LASCSYDGKVL I HKEENGRUSO I AVH A VHS ASVNS VQWAPHEYGPLLL VASSDG 
60 70 80 90 100 X 110 120 

KVSVVEFKENGTTSPIIIDAHAIGVNSASWAPATIEEDGEHNGT 
130 140 150 160 



11. US-08-249-182-9 U-16) 

S06938 glutanate dehydrogenase (NADP+) (EC 1.4.1.4) - uni 

ENTRY S06938 #type complete 

TITLE glutanate dehydrogenase (NADP+) (EC 1.4.1.4) - unidentified 

bacteriun 

ORGANISE #fornal_nane unidentified bacteriun 

DATE 07-Sep-1990 #sequence_revision 07-5ep-1990 #text_change 

31-Dec-1993 
ACCESSIONS S06938 
REFERENCE S06938 

•authors Cock, J.N.; Schnidtr R.R. 
journal Nucleic Acids Res. (1989) 17:10500 
•title A glutanate dehydrogenase gene sequence, 

•cross-references MUID:90098893 
#accession S06938 
••nolecule_type DNA 
••residues" 1-446 ##label COC 
••cross-references EHBL? X 16399 

##note the translation of the nucleotide sequence is not given 

in this paper 

CLASSIFICATION #superfanily glutanate dehydrogenase (NAD<P)+) 
KEYWORDS NADP; oxidoreductase 

SUMMARY # length 446 •nolecular-weight 48490 •checksun 8711 

SEQUENCE 



Initial Score = 7 
Residue Identity - 43X 
Gaps = 0 



Optimized Score = 7 Significance = 4.68 
Hatches = 7 Mi snatches = 9 

Conservative Substitutions = 0 



X 10 X 

VNSMSTVFVGYGPTFK 
I! I I III 

LERLVEPERIIQFRVSHVDDRGGV8VNRAFRVQFNSAIGPYKGGMRFHPSVNLSILKFLGFE9TFKNALTTL 
60 70 80 90 100 110 X 120 



PMGGGKGGSDFDPKGKS9GR I MRFCOALMTEL YRHLGPDTDVP A 
130 140 150 160 



12. US-08-249-182-9 (1-16) 

A33504 glutanate dehydrogenase (EC 1.4.1.2) - Salmonella 

ENTRY A33504 ttype complete 

TITLE glutanate dehydrogenase (EC 1.4.1.2) - Salmonella typhinuriun 

ORGANISM •fornaljiane Salnonella typhinuriun 

DATE 08-Dec-1989 •sequence_revision 08-Dec-1989 *text_change 

30-Sep-1993 
ACCESSIONS A33504 
REFERENCE A33504 

•authors Bansal» A.; Dayton, M.A.; Zalkin* H.; Colnani R.F. 

•journal J. Biol. Chen. (1989) 264:9827-9835 

ititle Affinity labeling of a glutamyl peptide in the coenzyne 

binding site of NADP (+) -specif ic glutanate dehydrogenase of 
Salnonella typhinuriun by 2-t (4-brono-2f 
3-dioxobuty I ) thio ]-l , N (6) -ethenoadenosine 2 ' i 
5'-bisphosphate. 
•cross-references MUID:89255551 



^accession ajjoui 

Ststatus preliminary 

##nolecule_type DNA 

♦♦residues' 1-447 i« label BAN 

♦♦cross-references GB:W24021 ; GB: J04814 
CLASSIFICATION ♦superfamily glutamate dehydrogenase (NAD(P)+) 
KEYWORDS oxidoreductase 

SUMMARY Slength 447 ♦nolecular-ueight 48574 ♦checksum 5173 

SEQUENCE 



Initial Score * 7 Optimized Score = 7 Significance = 4.68 
Residue Identity = 43X Matches = 7 Mismatches = 9 

Gaps - 0 Conservative Substitutions = 0 



X 10 X 

VNSMQTVFVGYGPTFK 
II I I III 

LERLVEPERVIQFRVVWLDDKN9VQVNRAWRVQFNSAIGPYKGGMRFHPSVNLSILKFLGFE8TFKNALTTL 
60 70 80 90 100 110 X 120 



PMGGGKGGSDFDPKGKSEGEVMRFCQALMTELYRHLGPDTDVPA 
130 140 150 160 



13, US-08-249-182-9 (1-16) 

A22413 glutamate dehydrogenase (NADP+) (EC 1.4.1,4) - Esc 

ENTRY A22413 itype complete 

TITLE glutamate dehydrogenase (NADP+) (EC 1.4.1.4) - Escherichia 

coli 

ALTERNATEJ4AMES NADP-specif ic glutamate dehydrogenase 
ORGANISM ♦fcrmaljiame Escherichia coli 

DATE 30-Jun-1988 #sequence_revision 30-Jun-1988 #text change 

31-Dec-1993 
ACCESSIONS A22413 
REFERENCE A22413 

#authors Vallei F.; Becerril, B.; Chen r E.J Seeburg* P.? Heyneker* H.? 

Bolivar> F. 
♦journal Gene (1984) 27:193-199 

ttitle Complete nucleotide sequence of the glutamate dehydrogenase 

gene from Escherichia coli K-12. 
♦cross-references HUID? 84209849 
♦contents Strain K12 
♦accession A22413 
♦♦moleculejtype DNA 
♦♦residues" 1-447 Hlabel VAL 
CLASSIFICATION ♦superf amily glutamate dehydrogenase (NAD(P)+) 
KEYWORDS NADP; oxidoreductase 

SUMMARY #length 447 ♦molecular-yeight 48581 ♦checksum 2843 

SEQUENCE 



Initial Score = 7 Optimized Score = 7 Significance = 4.68 
Residue Identity » 432 Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

VNSMQTVFVGYGPTFK 
II I I III 

LERLVEPERVIQFRVVWVDDRNQIQVNRAWRVQFSSAIGPYKGGHRFHPSVNLSILKFLGFEQTFKNALTTL 
60 70 80 90 100 110 X 120 



PMGGGKGGSDFDPKGKSEGEVMRFCQALMTELYRHLGADTDVPA 
130 140 150 160 



14. US-08-249-182-9 (1-16) 



giut,anax,e oenyorogenase iwflurt/ itt i.<t.i.tj - tsc 



ALTERNATE.NAHES 

ORGANISM 

DATE 

ACCESSIONS 
REFERENCE 

•authors 

•journal 

•title 



ENTRY DEECEN ftype complete 

TITLE glutanate dehydrogenase (NADP+) (EC 1.4.1.4) - Escherichia 

coli 

NADP-specif ic glutanate dehydrogenase 
•fornalj^ane Escherichia coli 

03-Aug-1984 isequence revision 20-Sep-1984 #text change 

31-Dec-1993 
A00382 
A00382 

HcPherson* H.J.J Uootton* J.C. 
Nucleic Acids Res. (1983) 11:5257-5266 
Conplete nucleotide sequence of the Escherichia coli gdhA 
gene. 

•cross-references MUID:83272967 

•accession A00382 
##nolecule_type DNA 
••residues" 1-447 ttlabel NCP 
COMMENT This enzyne is a hexaner of identical chains. 
GENETICS 

•gene gdhA 

•nap_position 27 nin 
CLASSIFICATION Ssuperfanily glutanate dehydrogenase (NAD(P)+) 
KEYWORDS oxidoreductase 

SUMMARY #length 447 #nolecular-ueight 48581 Schecksun 2843 

SEQUENCE 



Initial Score = 7 
Residue Identity * 43X 
Gaps = 0 



Optimized Score = 7 Significance = 4.68 
Hatches = 7 Mi snatches = 9 

Conservative Substitutions = 0 



X 10 X 

VNSMSTVFVGYGPTFK 
II I I lil 

LERLVEPERVI9FRVVWVDDRNQIQVNRAHRV9FSSAIGPYKGGMRFHPSVNLSILKFLGFE9TFKNALTTL 
60 70 80 90 100 110 X 120 

PMGGGKGGSDFDPKGKSEGEVHRFC9ALNTELYRHLGADTDVPA 
130 140 150 160 



15. US-08-249-182-9 (1-16) 

APBOL leucyl aninopeptidase (EC 3.4.11.1) 



bovine 



ENTRY APBOL Hype complete 

TITLE leucyl aninopeptidase (EC 3.4.11.1) - bovine 

ALTERNATE.NAMES cytosol aninopeptidase 

ORGANISM •fornaljiane Bos prinigenius taurus #copmon_nafte cattle 

DATE 31-Aug-1980 #sequence_revision 31-Aug-1980 *text_change 

31-Dec-1993 
ACCESSIONS A00907 
REFERENCE A92380 

•authors Cuypersr H.T.; van Loon-Klaassen» L.A.H.; Vree Egberts* 

U.T.H.? de Jong; W.W.; Bloenendal* H. 
•journal J. Biol. Chen. (1982) 257:7077-7085 

•title The prinary structure of leucine aninopeptidase fron bovine 

eye lens, 
•cross-references MUID:82213853 
•accession A00907 

••piolecule_type protein 
••residues" 1-478 ##label CUY 
REFERENCE A92381 

•authors Cuypers» H.T.? van Loon-Klaasseni L.A.H.; Vree Egberts> 

W.T.M.; de Jongr W.W.; Bloenendali H. 
•journal J. Biol. Chen. (1982) 257:7086-7091 



wiuie suitnyoryi coniem, ot oovine eye lens leucine aninopepuoase. 

Determination of the reactivity of the sulfhydryl groups of 
the zinc metalloenzyne, of the enzyme activated by Hg(2+)» 
Mn(2+)i and Co(2*)i and of the metal-free apoenzyne. 
icross-references MUID:82213854 
fcontents annotation 
Snote no disulfide bonds are present 

COMMENT This protein, isolated from calf lenst is a hexaner of identical 
chains. 

CLASSIFICATION Ssuperfanily cytosol aminopeptidase 

KEYWORDS alpha-aninoacylpeptide hydrolase 

SUMMARY 0 length 478 #flolecular-ueight 51691 ichecksun 5529 

SEQUENCE 

Initial Score = 7 Optimized Score = 7 Significance = 4.68 
Residue Identity = 43% Matches = 7 Mismatches - 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSH9TVFVGYGPTFK 
INI I II 

NLKSAS IKTDVF I RPKSWI EE8EMGSFLSV AKGSEEPPVFLE I HYKGSPNASEPPLVFVGKG I TFDSGG I SI 
190 200 210 220 230 240 250 X 260 

KAAANMDLMRADMGG A AT I CSA I VSA AKLDLP INI VGL APLCEN 
270 280 290 300 

> 0 < 

0| |0 IntelliGenetics 

> 0 < 

FastOB - Fast Pairuise Comparison of Sequences 
Release 5.4 

Results file u249_9s.res wade by on Thu 22 Sep 94 10:40.' 14-PDT. 



Suery sequence being compared:US-08-249-182-9 (1-16) 
Number of sequences searched: 36000 
Number of scores above cutoff: 4370 

Results of the initial comparison of US-08-249-182-9 (1-16) with: 
Data bank : Suiss-Prot 28, all entries 

100000- 

N 

U50000- 
M 
B 
E 

R ♦ 
0 

F 10000- * 
S 

E 5000- ♦ 

Q 

U 

E 

N 

C « 

E - * 
S 1000- 



ioo- 



50- 



10- 



5- 



0 

II I II II I I I I I I I I I I I 
SCORE 0| 1 |2 |3 | 4 | 6 | 7 | 8 | 9 10 
STDEV -1 0 1 2 3 4 5 6 7 



Similarity matrix Unita 
Mismatch penalty 
Gap penalty 1. 
Gap size penalty 0. 
Cutoff score 
Randomization group 

Initial scores to save 
Optimized scores to save 



Scores: Mean 

2 

Tines: CPU 

00:00:51.91 

Number of residues: 

Number of sequences searched: 

Number of scores above cutoff: 

Cut-off raised to 2. 
Cut-off raised to 3. 
Cut-off raised to 4. 
Cut-off raised to 5, 



2 

20 
5 



ave 15 
50 



Median Standard Deviation 
3 0.96 

Total Elapsed 
00:00:58.00 

12496420 
36000 
4370 



PARAMETERS 

y K-tuple 

1 Joining penalty 

0 Window size 

5 

0 

0 

0 Alignments to s 

0 Display context 

SEARCH STATISTICS 



The scores belou are sorted by initial score. 
Significance is calculated based on initial score. 



A 100% identical sequence to the query sequence was not found. 



The list of best scores is* 



Init. Opt. 

Sequence Nane Description Length Score Score Sig. Frame 







**** 8 standard deviations i 


above nean 


#**# 








1. 


PCI .HUMAN 


PLASMA-CELL MEMBRANE GLYC0PR0 


873 


10 


11 


8.29 


0 






**** 7 standard deviations « 


above nean 


«#* 








2. 


PC1_M0USE 


PLASMA-CELL MEMBRANE GLYC0PR0 


871 


9 


9 


7.26 


0 






**** 6 standard deviations < 


above nean 


#*tft 








3. 


RBL.MEDSA 


RIBULOSE BISPHOSPHATE CARBOXY 


474 


8 


8 


6.22 


0 






5 standard deviations above nean 


tftfft 








4. 


50DCJ1PVAC 


DIITATflJfT CIIDCDnVTnC nTCMIITACr 

PUTATIVE SUPEROXIDE DISnUTASE 


151 


7 


7 


5.18 


0 


5. 


i ill a i^ntm i 

UNG_FQWP1 


iidatti imlia n v/*ncvi acc /c/* i 
UHAC1L-UNA ULYCUbYLAbc let J. 


218 


7 


8 


5.18 


0 


6. 


CD8A_RAT 


I -CELL SURFACE GLYCOPROTEIN C 


236 


7 


7 


5.18 


0 


7. 


SCI 3_ YEAST 


DDnTCfhl TOAMCDflDT DDflTCTkl GCf* 

PRultlN IKANbrUKI PROTEIN bEC 


297 


7 


7 


5.18 


0 


8. 


DHE4_UNKP 


MAnD CDCflCTf 1 IM IITAMATCT TMTUVn 

NAUP-bPEtlrlL ULUIAnATc UEHYD 


A A L 

446 


7 


7 


5.18 


0 


9. 


DHE4J5ALTY 


NADP-SPECIFIC GLUTAHATE DEHYD 


447 


7 


7 


5.18 


0 


10. 


DHE4_EC0LI 


NADP-SPECIFIC GLUTAHATE DEHYD 


447 


7 


7 


5.18 


0 


11. 


AMPL_B0VIN 


CYT0S0L AMINOPEPTIDASE <EC 3. 


478 


7 


7 


5.18 


0 


12. 


HEHA_PI3B 


HEMAGGLUTININ-NEURAMINIDASE ( 


572 


7 


7 


5.18 


0 


13. 


NAK1_HUMAN 


EARLY RESPONSE PROTEIN NAK1. 


598 


7 


7 


5.18 


0 


14. 


TOXA^PSEAE 


EXOTOXIN A PRECURSOR (NAD-DEP 


638 


7 


7 


5.18 


0 


15. 


FEPA_EC0LI 


FERRIC ENTEROCHELIN RECEPTOR 


745 


7 


7 


5.18 


0 


16. 


VG43_HSVI1 


HYPOTHETICAL GENE 43 PROTEIN. 


891 


7 


7 


5.18 


0 


17. 


VGL2JBVH 


E2 GLYCOPROTEIN PRECURSOR (SP 


1162 


7 


7 


5.18 


0 


18. 


VGL2JBVK 


E2 GLYCOPROTEIN PRECURSOR (SP 


1162 


7 


7 


5.18 


0 


19. 


TYK2JUJMAN 


NON-RECEPTOR TYROSINE-PROTEIN 


1187 


7 


7 


5.18 


0 


20. 


V0R1_FXMV 


152 KD PROTEIN (ORF 1). 


1335 


7 


7 


5.18 


0 


21. 


MPRI JBOVIN 


CAT I ON- INDEPENDENT HANNOSE-6- 


2499 


7 


7 


5.18 


0 






4 standard deviations above nean 


«**# 








22. 


Y601.BPT4 


HYPOTHETICAL 8.5 KD PROTEIN I 


70 


6 




4.15 


0 


23. 


YVDB_VACCV 


HYPOTHETICAL 8.5 KD PROTEIN. 


80 


6 




4.15 


0 


24. 


YVDB.VACCC 


HYPOTHETICAL 8.5 KD PROTEIN. 


80 


6 




4.15 


0 


25. 


Y242_BPT4 


HYPOTHETICAL 11.0 KD PROTEIN 


92 


6 




4.15 


0 


26. 


PLAS.PETCR 


PLASTOCYANINS A AND B. 


97 


6 


7 


4.15 


0 


27. 


PLAS_0RYSA 


PLAST0CYANIN. 


97 


6 




4.15 


0 


28. 


PLAS_CUCSA 


PLAST0CYANIN. 


99 


6 


7 


4.15 


0 


29. 


ISS.ECOLI 


HYPOTHETICAL ISS PROTEIN. 


102 


6 




4.15 


0 


30. 


K2M1.SHEEP 


KERATIN, TYPE II MICROFIBRIL 


109 


6 




4.15 


0 


31. 


HV35J1QUSE 


IG HEAVY CHAIN V-III REGION ( 


111 


6 




4.15 


0 


32. 


HV34_MQU5E 


IG HEAVY CHAIN V REGION (AH PC 


113 


6 




4.15 


0 


33. 


HV31 J10USE 


IG HEAVY CHAIN V-III REGION < 


113 


6 




4.15 


0 


34. 


HV30„MQUSE 


IG HEAVY CHAIN V-III REGION ( 


113 


6 




4.15 


0 


35, 


HV29~M0USE 


IG HEAVY CHAIN V-III REGION ( 


113 


6 




4.15 


0 


36. 


HV28J10USE 


IG HEAVY CHAIN V-III REGION ( 


113 


6 




4.15 


0 


37. 


HV27_MQUSE 


IG HEAVY CHAIN V-III REGION ( 


113 


6 




4.15 


0 


38. 


HV33.M0USE 


IG HEAVY CHAIN V-III REGION < 


115 


6 




4.15 


0 


39, 


HV32.M0USE 


IG HEAVY CHAIN V-III REGION ( 


115 


6 




4.15 


0 


40. 


SFP1.B0VIN 


SEMINAL PLASMA PROTEIN PDC-10 


134 


6 




4.15 


0 



1. US-08-249-182-9 (1-16) 

PC1_HUNAN PLASMA-CELL MEMBRANE GLYCOPROTEIN PC-1 (ALKALINE P 

ID PCl.HUMAN STANDARD; PRT; 873 AA. 

AC P22413; 

DT 01-AUG-1991 (REL. 19, CREATED) 

DT 01-AUG-1991 (REL. 19, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE PLASMA-CELL MEMBRANE GLYCOPROTEIN PC-1 (ALKALINE PHOSPHODIESTERASE I 

DE (EC 3.1.4.1) / NUCLEOTIDE PYROPHOSPHATASE (EC 3.6.1.9)). 

OS HOMO SAPIENS (HUMAN) . 

OC EUKARYOTAJ METAZQA? CHORDATAJ VERTEBRATA5 TETRAPODA; MAMMALIA) 



ui tuintniH, rmnnita. 

RN Cll 

RP SEQUENCE FROM N.A. 

RM 91009202 

RA BUCKLEY H.F., LOVELAND K.A., MCKINSTRY U.J.. GARSON O.M., GOOING J.U.i 

RL J. BIOL. CHEN. 265:17506-17511(1990). 

CC -!- FUNCTION: NAY HAVE A ROLE IN THE REGULATION OF N-GLYCOSYLATION. 

CC -!- CATALYTIC ACTIVITY: HYDROLYTICALLY REMOVES 5 '-NUCLEOTIDES 
CC SUCCESSIVELY FROM THE 3'-HYDR0XY TERMINI OF 3 ' -H YDROX Y-TERM I NATED 

CC OLIGONUCLEOTIDES. 

CC -!- CATALYTIC ACTIVITY: A D I NUCLEOTIDE + H(2)0 = 2 MONONUCLEOTIDE. 

CC -!- SUBUNIT: HOMQDINER, DISULFIDE-LINKED. 

CC -!- SUBCELLULAR LOCATION: TYPE II MEMBRANE PROTEIN. 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN PLASMA CELLS AND ALSO IN A NUMBER 

CC OF NON-LYMPHOID TISSUES, INCLUDING THE DISTAL CONVOLUTED TUBULE 

CC OF THE KIDNEY, CHONDROCYTES. AND EPIDIDYMIS. 

CC -!- SIMILARITY: CONTAINS TWO TANDEM COPIES OF A SOMATOHEDIN-B TYPE 

CC DOMAIN. 

DR EMBL; M57736,' HSPC1Q1. 

DR PIR; A39216; A39216. 

DR HIM! 173335.' TENTH EDITION. 

DR PROSITE; PS00524; SOM ATONED I N_B. 

KU GLYCOPROTEIN; TRANSMEMBRANE; DUPLICATION; SIGNAL- ANCHOR; HYDROLASE. 



DOMAIN 


1 


24 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


25 


45 


SIGNAL-ANCHOR (TYPE-II MEMBRANE PROTEIN). 


DOMAIN 


46 


873 


EXTRACELLULAR (POTENTIAL) . 


DOMAIN 


52 


92 


SOMATOMEDIN-B LIKE. 


DOMAIN 


93 


136 


SOM ATONED I N-B LIKE. 


CARBOHYD 


127 


127 


POTENTIAL. 


CARBOHYD 


233 


233 


POTENTIAL. 


CARBOHYD 


289 


289 


POTENTIAL. 


CARBOHYD 


425 


425 


POTENTIAL. 


CARBOHYD 


533 


533 


POTENTIAL. 


CARBOHYD 


591 


591 


POTENTIAL. 


CARBOHYD 


648 


648 


POTENTIAL. 


CARBOHYD 


679 


679 


POTENTIAL. 


CARBOHYD 


696 


696 


POTENTIAL. 


SEQUENCE 


873 aa; 


99929 MM; 4095996 CN; 



SQ 

Initial Score = 10 Optimized Score = 11 Significance = 8.29 

Residue Identity = 70% Matches = 12 Mi snatches - 4 

Gaps = 1 Conservative Substitutions = 0 

X 10 X 

VNS-H8TVFVGYGPTFK 
I I II llllll II 

YLKHFLPKRLHFAKSDRIEPLTFYLDP6WQLALNPSERKYCGSGFHGSDNVFSNH9ALFVGYGPGFKHGIEA 
440 450 460 470 480 490 500 X 

DTFENIEVYNLMCDLLNLTPAPNNGTHGSLNHLLKNPVYTPKHPK 
510 520 530 540 550 



2. US-08-249-182-9 (1-16) 

PCl.NOUSE PLASMA-CELL MEMBRANE GLYCOPROTEIN PC-1 (ALKALINE P 

ID PC1.M0USE STANDARD? PRT; 871 AA. 

AC P06802; 

DT 01-JAN-1988 (REL. 06. CREATED) 

DT 01-AUG-1991 (REL. 19, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE PLASMA-CELL MEMBRANE GLYCOPROTEIN PC-1 (ALKALINE PHOSPHODIESTERASE I 

DE (EC 3.1.4.1) / NUCLEOTIDE PYROPHOSPHATASE (EC 3.6.1.9)) (LY-41). 

OS MUS MUSCULUS (MOUSE) . 

oc eukaryota; metazoa; chordata; vertebrata; tetrapoda; mammalia; 

oc eutheria; rodemtia. 



nn i i j 

RP SEQUENCE FROM N.A. 

RN 87165906 

RA VAN DRIEL I.R.- CODING J.W.i 

RL J. BIOL. CHEN. 2625 4882-4887(1987) . 

RN [2] 

RP PARTIAL SEQUENCE. 

RH 85056299 

RA STEARNE P. A., VAN ORIEL I.R.. GREGO B.- SIMPSON R.J., GODING J.W.J 

RL J. IMMUNOL. 134:443-448(1985). 

RN [31 

RP FUNCTION. AND SEQUENCE FROM N.A. 

RM 91271356 

RA REBBE N.F., TONG B.D.. FINLEY E.M., HICKMAN $.'. 

RL PROC. NATL. ACAD. SCI. U.S.A. 88:5192-5196(1991) . 

CC -!- FUNCTION: MAY HAVE A ROLE IN THE REGULATION OF N-GLYCOSYLATION. 

CC -!- CATALYTIC ACTIVITY: HYDROLYTICALLY REMOVES 5 '-NUCLEOTIDES 

CC SUCCESSIVELY FROM THE 3'-HYDR0XY TERMINI OF 3 ' -HYDROX Y-TERM I NATED 

CC OLIGONUCLEOTIDES. 

CC -!- CATALYTIC ACTIVITY: A DINUCLEOTIDE + H(2)0 = 2 MONONUCLEOTIDE. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE-LINKED. 

CC -!- SUBCELLULAR LOCATION: TYPE II MEMBRANE PROTEIN. 

CC -!- TISSUE SPECIFICITY: SELECTIVELY EXPRESSED ON THE SURFACE OF 

CC ANT I BODY -SECRET I NG CELLS. 

CC -!- SIMILARITY: CONTAINS TWO TANDEM COPIES OF A SOMATOMEDIN-B TYPE 

CC DOMAIN. 

DR EMBL; J02700J MNPC1B. 

DR PIR; A274105 A27410. 

DR PROS I TE J PS00524; SOMATOMEDIN^. 

KU GLYCOPROTEIN; TRANSMEMBRANE; DUPLICATION; SIGNAL-ANCHOR; HYDROLASE. 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SO 

Initial Score = 9 Optinized Score = 9 Significance = 7.26 
Residue Identity = 56% Matches = 9 Hi snatches = 7 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
II I Mil II 

LKPFLPKRLHFAKSDRIEPLTFYLDPQUQLALNPSERKYCGSGFHGSDNLFSNMQALFIGYGPAFKHGAEVD 
440 450 460 470 480 490 500 X 510 

SFENIEVYNLNCDLLGLIPAPNNGSHGSLNHLLKKPIYNPSHPK 
520 530 540 550 



MOD.RES 


?1 


?1 


BLOCKED. 


DOMAIN 


1 


24 


CYTOPLASMIC (POTENTIAL) . 


TRANSMEM 


25 


45 


SIGNAL-ANCHOR (TYPE-II MEMBRANE 


DOMAIN 


46 


871 


EXTRACELLULAR (POTENTIAL) . 


DOMAIN 


52 


92 


SOMATOMEDIN-B LIKE. 


DOMAIN 


93 


136 


SOMATOMEDIN-B LIKE. 


CARBOHYD 


127 


127 


POTENTIAL. 


CARBOHYD 


233 


233 


POTENTIAL. 


CARBOHYD 


289 


289 


POTENTIAL. 


CARBOHYD 


425 


425 


POTENTIAL. 


CARBOHYD 


533 


533 


POTENTIAL. 


CARBOHYD 


590 


590 


POTENTIAL. 


SEQUENCE 


871 AAi 


99487 


m; 4094440 cn; 



3. US-08-249-182-9 (1-16) 

RBL.MEDSA RIBULOSE BISPHOSPHATE CARBOXYLASE LARGE CHAIN PREC 

ID RBL.MEDSA STANDARD; PRT; 474 AA. 

AC P04991; 

DT 13-AUG-1987 (REL. 05. CREATED) 

DT 13-AUG-1987 (REL. 05, LAST SEQUENCE UPDATE) 

DT 01-AUG-1992 (REL. 23. LAST ANNOTATION UPDATE) 



uc niDULUoc ciarnuDrnHit unnouA TLnac Lnnuc imiN rntv.unauK ttv, i.i.i.jyj. 

GN RBCL. 

OS MEDIC AGO SATIVA (ALFALFA) . 

OG CHLOROPLAST. 

OC EUKARYOTA; PLANTA; EHBRYOPHYTA; ANGIOSPERMAE; DICOTYLEDONEAE; FAB ALES; 

OC FABACEAE. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CV. REGEN S; 

Rtl 87091586 

RA ALDRICH J., CHERNEY B. , MERLIN E., PALMER J. » 

RL NUCLEIC ACIDS RES. 14:9535-7535(1986) . 

CC -!- FUNCTION: RUB1SC0, A MAJOR COMPONENT OF LEAF PROTEIN. CATALYSES 
CC TWO REACTIONS: THE CARBOXYLATION OF D-RIBULOSE 1 ,5-BISPHOSPHATE, 

CC THE PRIMARY EVENT IN PHOTOSYNTHETIC CARBON DIOXIDE FIXATION, AS 

CC WELL AS THE OXIDATIVE FRAGMENTATION OF THE PENTOSE SUBSTRATE IN 

CC THE PHOTORESPIRATION PROCESS. BOTH REACTIONS OCCUR SIMULTANEOUSLY 

CC AND IN COMPETITION AT THE SAME ACTIVE SITE. 

CC -!- CATALYTIC ACTIVITY: D-RIBULOSE 1 ,5-BISPHOSPHATE + C0<2) = 
CC 2 3-PH0SPH0-D-GLYCERATE. 

CC -!- CATALYTIC ACTIVITY: D-RIBULOSE 1 ,5-BISPHOSPHATE + 0(2) = 

CC 3-PH0SPH0-D-GLYCERATE + 2-PH0SPH0GLYC0LATE. 

CC -!- SUBUNIT: 8 LARGE CHAINS + 8 SMALL CHAINS. 

CC -!- SUBCELLULAR LOCATION: CHLOROPLAST. 

DR EMBL; X04975; CHMSRBCL. 

DR PIR; A25578J RKAALC. 

DR PROSITE; PS00157; RUBISCO.LARGE. 

KU PHOTOSYNTHESIS: CARBON DIOXIDE FIXATION; PHOTORESPIRATION} 

KU LYASE; OXIDOREDUCTASE; MONOOXYGENASE; CHLOROPLAST; ACETYLATION. 

FT PROPEP 1 2 BY SIMILARITY. 

FT CHAIN 3 474 RUBISCO LARGE CHAIN. 

FT MOD.RES 3 3 ACETYLATION (BY SIMILARITY). 

FT ACT.SITE 201 201 BINDING OF C0(2) ACTIVATES THE ENZYME. 

S8 SEQUENCE 474 AA; 52626 HU; 1066493 CN; 

Initial Score = 8 Optinized Score = 8 Significance = 6.22 
Residue Identity = 50% Matches = 8 Mi snatches = 8 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSM8TVFVGYGPTFK 
II II II II 

TGTWTTVWTDGLTSLDRYKGRCYHIEPVAGEETSFIAYVAYPLDLFEEGSVNYMFTSIVGNVFGFKALRALR 
70 80 90 100 110 X 120 130 

LEDLRIPAAYVKTFQGPPQGIQVERDKLNKYGRPLLGCTIKPKL 
140 150 160 170 



4. US-08-249- 182-9 (1-16) 

SODC_NPVAC PUTATIVE SUPEROXIDE DISMUTASE (CU-ZN) (EC 1.15.1.1 

ID SODC.NPVAC STANDARD; PRTf 151 AA. 

AC P24705; 

DT 01-MAR-1992 (REL. 21, CREATED) 

DT Ot-MAR-1992 (REL. 21, LAST SEQUENCE UPDATE) 

DT Ol-AUG-1992 (REL. 23, LAST ANNOTATION UPDATE) 

DE PUTATIVE SUPEROXIDE DISMUTASE (CU-ZN) (EC 1.15.1.1). 

GN SOD. 

OS AUTOGRAPHA CALIFORNIA NUCLEAR POLYHEDROSIS VIRUS IACHNPV) . 

OC VIRIDAE! ds-dna enveloped viruses; baculoviridae; EUBACULOVIRINAE. 

RN [11 

RP SEQUENCE FROM N.A. 

RC STRAIN=L1; 

RM 91335744 

RA TONALSKI M.D. , ELDRIDGE R. , MILLER L.K.; 



CC -!- FUNCTION: NONESSENTIAL FOR NORMAL VIRUS REPLICATION. COULD BE 
CC EITHER NON-FUNCTIONAL OR WITH A LOU ACTIVITY. 

CC -!- CATALYTIC ACTIVITY: 2 PEROXIDE RADICAL + 2 H(+) = 0(2) + H(2>0(2). 

CC -!- SIHILARITY: TO CU-ZN SUPEROXIDE DISNUTASES. 

DR EtlBLJ 1168862; 8ANPAS0D. 

DR PIR; A40564; DSNVAC. 

DR PROSITEJ PS00087; S0D_CU_ZN_1. 

DR PROSITE! PS00332; S0dIcU~ZN_2. 

KW LATE PROTEIN; OXIDOREDUCTASE; COPPER; ZINC. 



NETAL 


43 


43 


COPPER (BY SIMILARITY) . 


NETAL 


45 


45 


COPPER (BY SIHILARITY). 


METAL 


60 


60 


COPPER AND ZINC (BY SIMILARITY). 


NETAL 


68 


68 


ZINC (BY SIMILARITY). 


METAL 


77 


77 


ZINC (BY SIMILARITY). 


METAL 


80 


80 


ZINC (BY SIMILARITY). 


METAL 


118 


118 


COPPER (BY SIMILARITY). 


DISULFID 


54 


144 


BY SIMILARITY. 


SEQUENCE 


151 AA; 


16182 


NU; 113718 cn; 



Initial Score = 7 Optimized Score = 7 Significance = 5.18 
Residue Identity = 43X Matches = 7 Nisnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSHQTVFVGYGPTFK 
II I I III 

HEYGDTSNGCTSAGEHFNPTNEDHGAPDAEIRHVGDLGNIKSAGYNSLTEVNHNDNVNSLYGPHNIIGRSLV 
50 60 70 80 90 X 100 110 

VHTDKDDLGLTDHPLSKTTGNSGGRLGCGIIAICK 
120 130 140 150 



5. US-08-249-182-9 (1-16) 

UNG_F0HP1 URACIL-DNA GLYCOSYLASE (EC 3.2.2.-). 

ID UNG.F0WP1 STANDARD; PRT; 218 AA. 

AC P21968; 

DT 01-AUG-1991 (REL. 19, CREATED) 

DT 01-AUG-1991 (REL. 19. LAST SEQUENCE UPDATE) 

DT 01-0CT-1993 (REL. 27, LAST ANNOTATION UPDATE) 

DE URACIL-DNA GLYCOSYLASE (EC 3.2.2.-). 

GN FPD4. 

OS FOWLPOX VIRUS (STRAIN FP-1). 

OC VIRIDAE, DS-DNA ENVELOPED VIRUSES; POXVIRIDAE; CHORDOPOXVIRINAE; 

OC AVIPOXVIRUSES. 

RN [11 

RP SEQUENCE FROM N.A. 

RM 90324937 

RA TARTAGLIA J., WINSLOM J., GOEBEL S., JOHNSON G.P., TAYLOR J., 

RA PAOLETTI E.J 

RL J. GEN. VIROL. 71:1517-1524(1990). 

CC -!- FUNCTION: EXCISES URACIL RESIDUES FROM THE DNA WHICH CAN ARISE 

CC AS A RESULT OF MISINCORPORTATION OF DUMP RESIDUES BY DNA 

CC POLYMERASE OR DUE TO DEAMINATION OF CYTOSINE. 

CC -!- SIMILARITY: DISTANTLY, BUT SIGNIFICANTLY RELATED TO ALL OTHER 

CC SPECIES UNG. 

DR EMBL; XI 7202; POFPHIND. 

DR PIR; A35216; A35216. 

DR PROSITE; PS00130; UJ)NA_GLYCOSYLASE. 

KU HYDROLASE; GLYCOSIDASE; _ DNA REPAIR. 

SQ SEQUENCE 218 AA; 25563 MM; 252913 CN; 



Initial Score = 7 Optinized Score = 8 Significance = 5.18 
Residue Identity = SOX Hatches = 8 Hisnatches = 8 



baps = u conservative suoswiuiions = u 

X 10 X 

VNSMQTVFVGYGPTFK 
II II III I 

NY YLSCREGEAKSHK I FWERL ADVF I NH I AAYVSVFYFLGKSDFSNFRS ILNSPTT VVVG YHPAARNR9FDT 
120 130 140 150 160 170 180 X 190 



DETFEIVNTLLELKNEPRINWVQGFEI 
200 210 



6. US-08-249-182-9 (1-16) 

CD8A.RAT T-CELL SURFACE GLYCOPROTEIN CD8 ALPHA CHAIN PRECUR 



ID CD8A.RAT STANDARD; PRT; 236 AA. 

AC P07725; 

DT 01 -APR- 1988 (REL. 07, CREATED) 

DT 01-APR-1988 (REL. 07, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE T-CELL SURFACE GLYCOPROTEIN CD8 ALPHA CHAIN PRECURSOR (CDS ANTIGEN 32 

DE KD CHAIN) (OX -8 MEMBRANE ANTIGEN) . 

OS RATTUS NORVEGICUS (RAT) . 

OC EUKARYOTA; METAZOAI CHORDATA; VERTEBRATA; TETRAPODAi MAMMALIA! 

OC EUTHERIA; RODENT I A. 

RN [1] 

RP SEQUENCE FROM N.A. 

RN 86030231 

RA JOHNSON P., GAGNON J., BARCLAY A.N., WILLIAMS A.F.; 

RL EMBO J. 4:2539-2545(1985). 

CC -!- FUNCTION: IDENTIFIES CYTOTOXIC/SUPPRESSOR T CELLS THAT INTERACT 

CC HITH MHC CLASS I BEARING TARGETS. CD8 IS THOUGHT TO PLAY A ROLE IN 

CC THE PROCESS OF T CELL MEDIATED KILLING. CD8 ALPHA CHAINS BINDS TO 

CC CLASS MHC MOLECULES ALPHA-3 DOMAINS. 

CC -!- SUBCELLULAR LOCATION: TYPE I MEMBRANE PROTEIN. 

CC -!- SUBUNIT: IN GENERAL HETERODIMER OF AN ALPHA AND A BETA CHAIN 

CC LINKED BY TWO DISULFIDE BONDS. CAN ALSO FORMS HOMODIMERS. 

CC -!- SIMILARITY: BELONGS TO THE IMMUNOGLOBULIN SUPERFAMILY. 

DR EMBL; X03015; RNANT0X8. 

DR PIR5 A24637J A24637. 

KU IMMUNOGLOBULIN FOLD; TRANSMEMBRANE; T-CELL; ANTIGEN; GLYCOPROTEIN; 

KW MHC; SIGNAL. 

FT SIGNAL 1 26 POTENTIAL. 

FT CHAIN 27 236 T-CELL SURFACE GLYCOPROTEIN CD8 ALPHA 

FT CHAIN. 



FT 


DOMAIN 


27 


187 


EXTRACELLULAR (POTENTIAL) 


FT 


TRANSMEM 


188 


212 


POTENTIAL. 


FT 


DOMAIN 


213 


236 


CYTOPLASMIC (POTENTIAL) . 


FT 


DISULFID 


47 


119 


POTENTIAL. 


FT 


CARBOHYD 


63 


63 


PROBABLE. 


SO 


SEQUENCE 


236 AA; 


26196 


MU; 307073 cn; 



Initial Score = 7 Optinized Score - 7 Significance = 5.18 
Residue Identity = 43X Matches = 7 Hisnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMQTVFVGYGPTFK 
III I I II 

DPNLFSARKENNKY I LTLSKFSTKNQGYYFCS I TSNSVMYFSPLVPVFQK VNS I I TKPVTRAPTPVPPPTGT 
90 100 110 120 130 140 150 X 160 



PRPLRPEACRPGASGSVEGNGLGFACDIYIHAPLAGICAVLLLS 
170 180 190 200 



/. U0-UO-C*7-J0C-T \1~10I 



SC13_ YEAST PROTEIN TRANSPORT PROTEIN SEC13. 

ID SC13_YEAST STANDARD; PRT; 297 AA. 

AC 804491; 

DT 01-0CT-1993 (REL. 27, CREATED) 

DT 01-0CT-1993 (REL. 27, LAST SEQUENCE UPDATE) 

DT 01-FEB-1994 (REL. 28, LAST ANNOTATION UPDATE) 

DE PROTEIN TRANSPORT PROTEIN SEC13. 

GN SEC13. 

OS SACCHARONYCES CEREVISIAE (BAKER'S YEAST) . 

OC EUKARYOTA! FUNGI; ASCOMYCOTINA; HEMIASCOMYCETES. 

RN [1] 

RP SEQUENCE FROM N.A., AND MUTAGENESIS. 

RN 93163112 

RA PRYER N.K., SALAMA N.R., SCHEKNAN R., KAISER C.A.; 

RL J. CELL BIOL. 120:865-875(1993). 

CC -!- FUNCTION; REQUIRED IN VESICLE BIOGENESIS AT A STEP BEFORE OR 

CC CONCURRENT WITH THE RELEASE OF TRANSPORT VESICLES FRON THE ER 

CC MEMBRANE. REQUIRED FOR GERMINATION AND/OR GROWTH AT 24 DEGREE C. 

CC MIGHT INTERACT WITH PROTEINS SEC23 AND SARI. 

CC -!- SUBUNIT: FORMS AN ACTIVE 700 KD LARGE COMPLEX WITH OTHER PROTEINS. 

CC -!- SUBCELLULAR LOCATION: CYTOPLASMIC. PERIPHERALLY ASSOCIATED WITH 

CC MEMBRANES. 

CC -!- SIMILARITY: SOME SIMILARITY TO THE BETA TRANSDUCIN FAMILY TRP-ASP 

CC DOMAINS. 

DR EMBL; L05929; SCSEC13P. 

DR PIR; S30803; S30803. 

DR PIR; A45442; A45442. 

KU TRANSPORT ; PROTEIN TRANSPORT; MEMBRANE; ENDOPLASMIC RETICULUM; 
KU 

FT DOMAIN 8 281 6 X APPROXIMATE REPEATS. 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 

FT MUTAGEN 266 266 G->D: GROWTH INHIBITED ABOVE 34 C. 

480631 CN; 

Initial Score = 7 Optinized Score = 7 Significance = 5.18 

Residue Identity = 43% Matches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSH8TVFVGYGPTFK 
III I III 

HEGPVWRVDWAHPKFGT I L ASCSYDGKVL I UKEENGRUSS I A VHAVHSASVNSVSUAPHE YGPLLL V ASSDG 
60 70 80 90 100 X 110 120 

KVSVVEFKENGTTSPIIIDAHAIGVNSASWAPATIEEDGEHNGT 
130 140 150 160 



REPEAT. 






DOMAIN 


8 


281 6 


REPEAT 


8 


35 1. 


REPEAT 


52 


82 2. 


REPEAT 


98 


127 3. 


REPEAT 


144 


185 4. 


REPEAT 


203 


234 5. 


REPEAT 


257 


281 6. 


MUTAGEN 


224 


224 S- 


MUTAGEN 


262 


262 H- 


MUTAGEN 


266 


266 G- 


SEQUENCE 


297 AAJ 


33043 MU; 



8. US-08-249-182-9 (1-16) 

DHE4_UNKP NADP-SPECIFIC GLUTAMATE DEHYDROGENASE (EC 1.4.1.4) 

ID DHE4 UNKP STANDARD; PRT; 446 AA. 

AC P14657; 

DT 01-APR-1990 (REL. 14, CREATED) 

DT 01-APR-1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE NADP-SPECIFIC GLUTAMATE DEHYDROGENASE (EC 1.4.1.4) (NADP-GDH) . 



us vnnnunn rnunnrwimi ununnisn. 

oc prokaryota; not yet classified. 

RN [1] 

RP SEQUENCE FROM N.A. 

RM 90098893 

RA COCK J.H.. SCHMIDT R.R.; 

RL NUCLEIC ACIDS RES. 17:10500-10500(1989). 

CC -!- SEQUENCE ORIGINATES FROM AN ORGANISM CONTAMINATING A CHLORELLA 
CC SOROKINIANA CULTURE. PROBABLY A BACTERIUM. 

CC -!- CATALYTIC ACTIVITY? L-GLUTAHATE ♦ H(2)0 + NADP<+) = 2-OXOGLUTARATE 

CC + NH(3) + NADPH. 

CC -!- SUBUNIT: HOMOHEXAMER. 

CC -!- SIMILARITY! TO OTHER GLUTAMATE DEHYDROGENASES (EC 1.4.1.2 AND 

CC EC 1.4.1.3). AND TO LEUCINE AND PHENYLALANINE DEHYDROGENASES. 

DR EMBL; XI 6399; XXGDH. 

DR PIR! S06938; S06938. 

DR PROSITE; PS00074; GLF_DEHYDROGENASE . 

KW oxidoreductase; nadp. 

FT ACT.SITE 128 128 BY SIMILARITY. 

SQ SEQUENCE 446 AA; 48490 MW; 922941 CN; 

Initial Score = 7 Opiinized Score = 7 Significance = 5.18 
Residue Identity = 43X Hatches = 7 Misnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSH8TVFVGYGPTFK 
II I I III 

LERLVEPER I IQFRVSUVDDRGQV8VNRAFRVGFNS A IGP YKGGNRFHPSVNLS I LKFLGFEQTFKNALTTL 
60 70 80 90 100 110 X 120 

PMGGGKGGSDFDPKGKSQGR1MRFCQALMTELYRHLGPDTDVPA 
130 140 150 160 



9. US-08-249-182-9 (1-16) 

DHE4JALTY NADP-SPECIFIC GLUTAMATE DEHYDROGENASE (EC 1.4.1.4) 

ID DHE4.SALTY STANDARD." PRT; 447 AA. 

AC P15111; 

DT 01-APR-1990 (REL. 14, CREATED) 

DT 01-APR-1990 (REL. 14, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE NADP-SPECIFIC GLUTAMATE DEHYDROGENASE (EC 1.4.1.4) (NADP-GDH) . 

GN GDHA. 

OS SALMONELLA TYPHIMURIUM. 

OC PROKARYOTA : GRACILICUTES; SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS. 

OC ENTEROBACTERIACEAE. 

RN [13 

RP SEQUENCE FROM N.A. 

RM 89255551 

RA BANSAL A., DAYTON M.A., ZALKIN H.. COLHAN R.F.; 

RL J. BIOL. CHEM. 264:9827-9835(1989). 

CC -!- CATALYTIC ACTIVITY: L-GLUTAMATE + H(2)0 + NADP(+) = 2-OXOGLUTARATE 

CC + NH(3) ♦ NADPH. 

CC -!- SUBUNIT: HOMOHEXAMER. 

CC -!- SIMILARITY: TO OTHER GLUTAMATE DEHYDROGENASES (EC 1.4.1.2 AND 

CC EC 1.4.1.3). AND TO LEUCINE AND PHENYLALANINE DEHYDROGENASES. 

DR EMBL; M24021: STGDHA. 

DR PROSITE; PS00074? GLF.DEHYDROGENASE. 

KW OXIDOREDUCTASE; NADP. 

FT ACTJITE 128 128 BY SIMILARITY. 

SQ SEQUENCE 447 AA, 48560 MW; 955116 CN; 

Initial Score = 7 Optinized Score = 7 Significance = 5.18 
Residue Identity = 43X Matches = 7 Misnatches = 9 



X 10 X 

VNSNQTVFVGYGPTFK 
II I I III 

LERLVEPERVIQFRVVHLDDKNQVSVNRAWRVQFNSAIGPYKGGHRFHPSVNLSILKFLGFEQTFKNALTTL 
60 70 80 90 100 110 X 120 

PMGGGKGGSDFDPKGKSEGEVMRFCQALMTELYRHLGPDTDVPA 
130 140 150 160 



10. US-08-249-182-9 (1-16) 

DHE4.EC0LI NADP-SPECIFIC GLUTAMATE DEHYDROGENASE (EC 1.4.1.4) 

ID DHE4_EC0LI STANDARD; PRT? 447 AA. 

AC P00370; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-1986 (REL. 01, LAST SEQUENCE UPDATE) 

DT 01-DEC-1992 (REL. 24, LAST ANNOTATION UPDATE) 

DE NADP-SPECIFIC GLUTAMATE DEHYDROGENASE (EC 1.4.1.4) (NADP-GDH) . 

GN GDHA. 

OS ESCHERICHIA COLI. 

OC PROKARYOTA; GRACILICUTES; SCOTOBACTERIA; FACULTATIVELY ANAEROBIC RODS; 

OC ENTEROBACTER I ACE AE . 

RN m 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 1-36. 

RC STRAIN=K12; 

RM 83272967 

RA MCPHERSON M.J., WOOTTON J.C.; 

RL NUCLEIC ACIDS RES. 11:5257-5266(1983). 

RN [21 

RP SEQUENCE FROM N.A. 

RM 84209849 

RA VALLE F., BECERRIL B. , CHEN E., SEEBURG P.H., HEYNEKER H. > BOLIVAR F.i 

RL GENE 27:193-199(1984). 

CC -!- CATALYTIC ACTIVITY: L-GLUTAMATE + H(2)0 + NADP(+) = 2-OXOGLUTARATE 

CC + NH(3) + NADPH. 

CC -!- SUBUNIT: HOMOHEXAMER. 

CC -!- SIMILARITY; TO OTHER GLUTAMATE DEHYDROGENASES (EC 1.4.1.2 AND 

CC EC 1.4.1.3), AND TO LEUCINE AND PHENYLALANINE DEHYDROGENASES. 

DR EHBL; X00988! ECGDHA. 

DR EHBL; K02499; ECGDHAK. 

DR PIR; A00382; DEECEN. 

DR PIR-' A22413; A22413. 

DR EC02DBASE; G043.6; 5TH EDITION. 

DR EC02DBASE; G043.7; 5TH EDITION. 

DR ECOGENE; EG 10372! GDHA. 

DR PROSITE! PS00074J GLF.DEHYDROGENASE. 

KW OXIDOREDUCTASE; NADP. 

FT ACT_SITE 128 128 BY SIMILARITY. 

FT CONFLICT 385 385 A -> P (IN REF. 2> . 

SO SEQUENCE 447 AA; 48581 MM; 947292 CN; 

Initial Score = 7 Optimized Score = 7 Significance = 5.18 
Residue Identity = 43X Matches = 7 Mi snatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSMSTVFVGYGPTFK 
II I I III 

LERLVEPERVIQFRVVWVDDRNQIQVNRAWRVQFSSAIGPYKGGNRFHPSVNLSILKFLGFEQTFKNALTTL 
60 70 80 90 100 110 X 120 



PMGGGKGGSDFDPKGKSEGEVMRFCQALHTELYRHLGADTDVPA 
130 140 150 160 



11. US-08-249-182-9 (1-16) 

ANPL.BOVIN CYTOSOL AHINOPEPTIDASE (EC 3.4.11.1) (LEUCINE A WIN 



ID AHPLJOVIN STANDARD; PRT? 478 AA. 

AC P00727; 

DT 21-JUL-1986 (REL. 01, CREATED) 

DT 21-JUL-1986 (REL. 01. LAST SEQUENCE UPDATE) 

DT 01-JUL-1993 (REL. 26, LAST ANNOTATION UPDATE) 

DE CYTOSOL AHINOPEPTIDASE (EC 3.4.11.1) (LEUCINE AHINOPEPTIDASE) (LAP) 

DE (LEUCYL AHINOPEPTIDASE) (PROLINE AHINOPEPTIDASE (EC 3.4.11.5) (PROLYL 

DE AHINOPEPTIDASE) ) . 

OS BOS TAURUS (BOVINE) . 

oc eukarydta; metazoa; chordata; vertebrata; tetrapoda,* hahhalia; 

oc eutheria; artiodactyla. 

RN [1] 

RP SEQUENCE. 

RC TISSUE=LENSJ 

RH 82213853 

RA CUYPERS H.T., VAN LOON-KLAASSEN L.A.H., VREE EGBERTS U.T.N. , 

RA DE JONG W.W., BLOEHENDAL H.! 

RL J. BIOL. CHEH. 257:7077-7085(1982). 

RN [2] 

RP SEQUENCE. 

RH 82213854 

RA CUYPERS H.T., VAN LOON-KLAASSEN L.A.H., VREE EGBERTS U.T.H., 

RA DE JONG W.W., BLOEHENDAL H.J 

RL J. BIOL. CHEH. 257:7086-7091(1982). 

RN [31 

RP X-RAY CRYSTALLOGRAPHY (2.7 ANGSTROHS) . 

RH 90370887 

RA BURLEY S.K.. DAVID P.R., TAYLOR A., LIPSCOHB H.N. I 

RL PROC. NATL. ACAD. SCI. U.S.A. 87:6878-6882(1990). 

RN [41 

RP X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROHS) . 

RH 92194311 

RA BURLEY S.K., DAVID P.R.. SWEET R.H., TAYLOR A., LIPSCOHB H.N. 5 

RL J. HOL. BIOL. 224:113-140(1992). 

CC -!- FUNCTION: PRESUHABLY INVOLVED IN THE PROCESSING AND REGULAR 
CC TURNOVER OF INTRACELLULAR PROTEINS. CATALYZES THE REHOVAL OF 

CC UNSUBSTITUTED AM I NO -TERMINAL A HI NO ACIDS FROH VARIOUS PEPTIDES. 

CC -!- CATALYTIC ACTIVITY: AHINOACYL-PEPTIDE + H(2)0 = AH I NO ACID + 
CC PEPTIDE. 

CC -!- COFACTOR: BINDS TWO ZINC IONS PER SUBUNIT. ONE ZINC ION IS TIGHTLY 

CC BOUND AND ESSENTIAL FOR ENZYHE ACTIVITY, WHILE THE SECOND HETAL 

CC COORDINATION SITE CAN BE OCCUPIED BY ZINC, HAGNESIUH OR HANGANESE 

CC TO GIVE ENZYHES OF DIFFERENT ACTIVITIES. 

CC -!- ENZYHE REGULATION: INHIBITED BY BESTATIN. 

CC -!- SUBUNIT: HOHOHEXAHER. 

CC -!- SUBCELLULAR LOCATION: CYTOPLASHIC. 

CC -!- SIHILARITY: BELONGS TO THE CYTOSOL AHINOPEPTIDASE FAHILY. 

DR PIR5 A00907J APBOL. 

DR PDB; 1LAP5 15-0CT-91. 

DR PDB; 1BPH; 15-JUL-93. 

DR PDBJ 1BPN,* 15-JUL-93. 

DR PROSITE; PS00631! CYTOSOL.AP. 

KW HYDROLASE; AHINOPEPTIDASE; ACETYLATION; ZINC 3D-STRUCTURE; HAGNESIUH; 

KU HANGANESE. 

HOD.RES 1 1 ACETYLATION. 

HETAL 250 250 ZINC (2). 

HETAL 255 255 ZINC (1 AND 2). 

HETAL 273 273 ZINC (2). 

HETAL 332 332 ZINC (1). 

HETAL 334 334 ZINC (1 AND 2). 

ACT SITE 262 262 POTENTIAL. 



ri oinniiu toj too 

FT TURN 469 470 

FT HELIX 471 472 

SQ SEQUENCE 478 AA, 51692 NWJ 1107347 CN; 

Initial Score = 7 Optinized Score = 7 Significance = 5.18 

Residue Identity = 43X Matches = 7 Hisnatches = 9 

Gaps = 0 Conservative Substitutions = 0 

X 10 X 

VNSHQTVFVGYGPTFK 
Mil I II 

NLKSAS I KTDVF IRPKSWI EE8EHGSFLSVAKGSEEPP VFLE IHYKGSPNASEPPLVFVGKGI TFDSGG I S I 
190 200 210 220 230 240 250 X 260 

kaaanmdlhradmggaaticsaivsaakldlpinivglaplcen 

270 280 290 300 



12. US-08-249-182-9 (1-16) 

HEHA.PI3B HEMAGGLUTININ-NEURAMINIDASE (EC 3.2.1.18). 

ID HEMA.PI3B STANDARD; PRTf 572 AA. 

AC P06167; 

DT 01-JAN-1988 (REL. 06, CREATED) 

DT 01-JAN-1988 (REL. 06, LAST SEQUENCE UPDATE) 

DT 01-MAY-1992 (REL. 22, LAST ANNOTATION UPDATE) 

DE HEMAGGLUTININ-NEURAMINIDASE (EC 3.2.1.18). 

GN HN. 

OS BOVINE PARAINFLUENZA 3 VIRUS. 

OC VIRIDAE; SS-RNA ENVELOPED VIRUSES, NEGATIVE-STRAND; PARAMYXOVIRIDAE! 

OC PARAMYXOVIRUSES. 

RN III 

RP SEQUENCE FROM N.A. 

RM 87174819 

RA SUZU S., SAKAI Y., SHIODA T., SHIBUTA H. ; 

RL NUCLEIC ACIDS RES. 15:2945-2958(1987). 

CC -!- FUNCTION: HEMAGGLUTININ IS RESPONSIBLE FOR ATTACHING THE VIRUS 

CC TO CELL RECEPTORS AND FOR INITIATING INFECTION. NEUROAMINIDASE 

CC ACTIVITY HELPS THE EFFICIENT SPREAD OF THE VIRUS BY DISSOCIATING 

CC THE MATURE VIRIONS FROM THE NEURAMINIC ACID CONTAINING 

CC GLYCOPROTEINS. 

CC -!- SUBCELLULAR LOCATION: EXTERNAL, ANCHORED TO THE ENVELOPE BY ITS 

CC N-TERMINAL HYDROPHOBIC SEQUENCE. 

CC -!- SIMILARITY: TO HEMAGGLUTININ-NEURAMINIDASES FROM OTHER 

CC PARAMYXOVIRUSES. 

DR EMBL; Y00114; PAMBPIV3. 

DR PIR; B27218; HNNZB3. 

KW HYDROLASE; HEMAGGLUTININ; ENVELOPE PROTEIN; GLYCOPROTEIN; 

KU TRANSMEMBRANE. 



FT 


DOMAIN 


1 


30 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


31 


53 


POTENTIAL. 


FT 


DOMAIN 


54 


572 


EXTRACELLULAR (POTENTIAL). 


FT 


CARBOHYD 


308 


308 


POTENTIAL. 


FT 


CARBOHYD 


351 


351 


POTENTIAL. 


FT 


CARBOHYD 


448 


448 


POTENTIAL. 


FT 


CARBOHYD 


523 


523 


POTENTIAL. 


FT 


CARBOHYD 


570 


570 


POTENTIAL. 


SQ 


SEQUENCE 


572 AA; 


64590 


MU; 1764491 CN; 



Initial Score = 7 Optimized Score = 7 Significance = 5-18 
Residue Identity = 43X Hatches = 7 Mismatches = 9 

Gaps = 0 Conservative Substitutions = 0 



X 10 X 

VNSHQTVFVGYGPTFK 



flu_ai it 


Jjb 


4it> 


STRAND 


3 


8 


STRAND 


20 


20 


HELIX 


22 


30 


TURN 


31 
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